National Center for Biotechnology Information (NCBI)
welcome to the blast network service.
PEPTIDE SEQUENCE DATABASES
nr Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
month All new or revised GenBank CDS translation+PDB+SwissProt+PIR
sequences released in the last 30 days
pdb PDB protein sequences
yeast Yeast (Saccharomyces cerevisiae) protein sequences.
kabat Kabat Sequences of Proteins of Immunological Interest
alu * Translations of Select Alu Repeats from REPBASE
swissprot SwissProt sequences
NUCLEOTIDE SEQUENCE DATABASES
nr Non-redundant GenBank+EMBL+DDBJ+PDB sequences
(but no EST, STS, GSS, or HTGS sequences)
month All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
the last 30 days
yeast Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
est + Non-redundant Database of GenBank+EMBL+DDBJ EST Division
sts + Non-redundant Database of GenBank+EMBL+DDBJ STS Division
htgs High Throughput Genomic Sequences
pdb PDB nucleotide sequences
vector Vector subset of GenBank
mito * Database of mitochondrial sequences, Rel. 1.0, July 1995
gss Genome Survey Sequences (includes single-pass genomic data, exon-
trapped sequences, and Alu PCR primers.
kabat Kabat Sequences of Nucleic Acid of Immunological Interest
epd Eukaryotic Promotor Database
alu *+ Select Alu Repeats from REPBASE
BLASTP 2.2.2 [Dec-14-2001]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= orf1 [1-39]
(37 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.322 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,996,896
Number of Sequences: 919193
Number of extensions: 297434
Number of successful extensions: 558
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 558
Number of HSP's gapped (non-prelim): 0
length of query: 37
length of database: 288,098,425
effective HSP length: 13
effective length of query: 24
effective length of database: 276,148,916
effective search space: 6627573984
effective search space used: 6627573984
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf2 [38-276]
(237 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.314 0.129 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 101,932,983
Number of Sequences: 919193
Number of extensions: 3377873
Number of successful extensions: 7910
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7910
Number of HSP's gapped (non-prelim): 0
length of query: 237
length of database: 288,098,425
effective HSP length: 117
effective length of query: 120
effective length of database: 180,552,844
effective search space: 21666341280
effective search space used: 21666341280
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 89 (38.9 bits)
Query= orf3 [270-946]
(675 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_518974.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 205 1e-051
dbj|BAA89640.1| (AB036666) similar to terminase large subun... 203 6e-051
dbj|BAA89621.1| (AB036665) similar to terminase large subun... 201 3e-050
ref|NP_297798.1| (NC_002488) phage-related terminase large ... 115 2e-024
ref|NP_298001.1| (NC_002488) phage-related terminase large ... 96 1e-018
ref|NP_461544.1| (NC_003197) Gifsy-1 prophage: similar to t... 93 1e-017
ref|NP_310206.1| (NC_002695) putative terminase large subun... 92 1e-017
ref|NP_287368.1| (NC_002655) putative DNA packaging protein... 92 2e-017
ref|NP_040581.1| (NC_001416) DNA packaging protein [bacteri... 91 3e-017
sp|P36693|TERL_BPP21 TERMINASE LARGE SUBUNIT (GP2) >gi|5407... 89 2e-016
>ref|NP_518974.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14555.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 660
Score = 205 bits (521), Expect = 1e-051
Identities = 178/621 (28%), Positives = 282/621 (44%), Gaps = 59/621 (9%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L SEW++R R + + S PG + PY R ++DC P SP+ V K AQ+GAT
Sbjct: 26 LTVSEWSDRHRMLSSKASAEPGRWRTGRTPYLRAIMDCLSPTSPIERVVFMKAAQLGAT- 84
Query: 89 SVLENIIAYGIDYVKTASM-IFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGT 147
+ N I Y I + M ++ TV ++ KR I P++E S + A +
Sbjct: 85 EMGSNWIGYVIHHAPGPMMAVWPTV--EMAKRNSKQRIDPLIEESA----VLAERIAPAR 138
Query: 148 KR-SGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLT 206
R SG T E+ GGVL GA S +RS PV +L DEV G+P V +GD + L
Sbjct: 139 SRDSGNTILAKEF-RGGVLVMTGANSAVGLRSMPVRYLFLDEVDGYPLDVEGEGDAISLA 197
Query: 207 ETRTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKK 266
E RT +F A RK+ +STP AG TI + ++ DQR + VPC HC Q L F
Sbjct: 198 EARTRTF-ARRKIFIVSTPTIAGASTIEREYDASDQRRYFVPCPHCDHRQWLRFE----- 251
Query: 267 GQGRLIWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHL 326
+L W T+G P++ Y+C C + K + +G W + +HL
Sbjct: 252 ---QLRW-TKG---EPETAAYICEACFEPIHEHHKAWMLAQGEWRAMVETGGRT-AGFHL 303
Query: 327 SAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREV 386
S++Y+P RSW+EIA AW + +A ++ F N +LGE + ++ +
Sbjct: 304 SSLYSPVGWRSWREIAAAWESAVSKESGSAA---AIKTFRNTELGETW-VEEGEAPDWQR 359
Query: 387 YAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYV 446
RR+ Y +G VP + AG ++ + V V WA G ++
Sbjct: 360 LLERREDYPIGTVPAGGLLLSAGADVQKDRIEVSV-----------WAFG---RGKEAWL 405
Query: 447 IDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEYCNDW 506
+++ + GD T WK+L E++ + + G +AR+ +D T Y +
Sbjct: 406 VEHRVLMGD--TARDAVWKRLAELVEEQWTHASGATMPLARLALDTGFATQEAYAFVRAC 463
Query: 507 GE-NVIPIRG--RDLPIKGAQIKY---FNRQMSEKGVEYLSVTVDLYKDRWSPTLRK--E 558
G+ V+ ++G R + G N + +G++ +V V + K + LRK +
Sbjct: 464 GDARVMAVKGTARGAALIGTPTAVDVTRNGKKLRRGIKVFTVAVGIAKLEFYNNLRKAAD 523
Query: 559 WSGQG---EMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGSRNEM 615
+ G P G + P I++++L++L E RD + I + W + RNE
Sbjct: 524 VAKDGATIAFPTGFVHLP-KIDAEFLQQLCAEQLITRRDRSGFPI--RAWQKM-RERNEA 579
Query: 616 WDTLIYNTAIFESMVLEACED 636
D +Y A + L+ E+
Sbjct: 580 LDCYVYARAAASAAGLDRFEE 600
>dbj|BAA89640.1| (AB036666) similar to terminase large subunit of phage lambda
[Wolbachia sp. wKue]
Length = 609
Score = 203 bits (516), Expect = 6e-051
Identities = 177/634 (27%), Positives = 280/634 (43%), Gaps = 83/634 (13%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L SEWA R + + PG + + PY +E++D P SP V KGAQ+G T
Sbjct: 17 LKVSEWANEYRVLAPTAASEPGKWRTERTPYLKEIMDSLSPSSPTEKVVFMKGAQIGGTE 76
Query: 89 SVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTK 148
+ N I Y ID M+ ++ KR P++E + +++ ++
Sbjct: 77 AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPC---LKSKVKDPRSR 131
Query: 149 RSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTET 208
SG T + E+ GG V+ GA S +RS PV +L DE+ +P G +GDP+ L+
Sbjct: 132 DSGNTVQSKEFPGGIVVIT-GANSSVALRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSIA 190
Query: 209 RTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKKGQ 268
RTN+F A RK+ +STP G I + FE D+RY+ VPC HC +Q L +
Sbjct: 191 RTNTF-ARRKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW-------- 241
Query: 269 GRLIWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHLSA 328
++ WE + YVC C G++ N K + G W PT + + + + +HLS+
Sbjct: 242 SQIKWENSDS----RTAHYVCTECSGKIENHQKTEMLDRGEWRPTNR-VKGEKKGFHLSS 296
Query: 329 MYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREVYA 388
+Y+P SWQ+ +D +A + L+V+ N LGE + K ++++
Sbjct: 297 LYSPVGWYSWQQAV-------EDFLHAKESEQLLKVWINTTLGETWVDKGEVPDWKQLF- 348
Query: 389 HRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYVID 448
+RR+ + +G VP + AG VDVQ L V + W S + + ID
Sbjct: 349 NRREFFQIGTVPRREVVLTAG---------VDVQKDRLEVEVVAWGKSRE-----SWSID 394
Query: 449 YEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEYC--NDW 506
Y+ EGD T GE W KL+E+++ G Y I+ + +DA T VY + +
Sbjct: 395 YQVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452
Query: 507 GENVIPIRG-------------RDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
V+ ++G D+ + G ++K +G++ V V + K
Sbjct: 453 SGRVMAVKGVNKALVPLSSPSRVDITVGGQKLK--------RGIKLWPVGVSILKSELFQ 504
Query: 554 TLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGSRN 613
L +G+ G + P +Y K+LT E + +Q W + RN
Sbjct: 505 LLNILKEEEGKALPGYCHFP-EYAPEYFKQLTAE---QLVSKVVKGYTKQEWQKV-RERN 559
Query: 614 EMWDTLIYNTAIFESMVLEACEDILGLEALVWPE 647
E+ D IY A LG++ WPE
Sbjct: 560 EVLDCRIY---------ARAASIALGIDR--WPE 582
>dbj|BAA89621.1| (AB036665) similar to terminase large subunit of lambda
[bacteriophage WO]
Length = 610
Score = 201 bits (510), Expect = 3e-050
Identities = 175/629 (27%), Positives = 278/629 (43%), Gaps = 75/629 (11%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L SEWA R + + PG + + PY +E++D P SP V KGAQ+G T
Sbjct: 17 LKVSEWANEYRVLAATAASEPGKWRTERTPYLKEIMDSLSPSSPAEKVIFMKGAQIGGTE 76
Query: 89 SVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTK 148
+ N I Y ID M+ ++ KR P++E + +++ ++
Sbjct: 77 AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPC---LKSKVKDPRSR 131
Query: 149 RSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTET 208
SG T + E+ GG V+ GA S +RS PV +L DE+ +P G +GDP+ L+
Sbjct: 132 DSGNTVQSKEFPGGIVVIT-GANSSVGLRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSIA 190
Query: 209 RTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKKGQ 268
RTN+F A RK+ +STP G I + FE D+RY+ VPC HC +Q L +
Sbjct: 191 RTNTF-ARRKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW-------- 241
Query: 269 GRLIWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHLSA 328
++ WE + P++ Y+C CG ++ N K + G W T + + + +HLS+
Sbjct: 242 SQIKWEDKN----PNTAHYICIECGKKIENHQKTEMLDRGEWRAT-EAKEGEKKGFHLSS 296
Query: 329 MYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREVYA 388
+Y+P W +W +D +A + L+V+ N LGE + K ++++
Sbjct: 297 LYSPV---GW----YSWTQAVEDFLHAKESEQLLKVWINTTLGETWVDKGEVPDWKQLF- 348
Query: 389 HRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYVID 448
+RR+ + +G VP + AG VDVQ L V + W S ++ + ID
Sbjct: 349 NRREFFPVGTVPRREVVLTAG---------VDVQEDRLEVEVVAWGKSREN-----WSID 394
Query: 449 YEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEYCNDWGE 508
Y EGD T GE W KL+E+++ G Y I+ + +DA T VY +
Sbjct: 395 YRVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452
Query: 509 NVIPIRGRDLPIKGAQIKYFNR----------QMSEKGVEYLSVTVDLYKDRWSPTLRKE 558
+ GR + +KG + + Q ++G++ V V + K L
Sbjct: 453 S-----GRVMAVKGVALVPLSSPSRVDITVGGQKLKRGIKLWPVGVSILKSELFQLLNVL 507
Query: 559 WSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGSRNEMWDT 618
+G+ G + P +Y K+LT E + +Q W + RNE+ D
Sbjct: 508 KEEEGKALPGYCHFP-EYAPEYFKQLTAE---QLVSKVVKGYTKQEWQKV-RERNEVLDC 562
Query: 619 LIYNTAIFESMVLEACEDILGLEALVWPE 647
IY A LG++ WPE
Sbjct: 563 RIY---------ARAASIALGIDR--WPE 580
>ref|NP_297798.1| (NC_002488) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
pir||D82798 phage-related terminase large subunit XF0508 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF83318.1|AE003899_10 (AE003899) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
Length = 699
Score = 115 bits (287), Expect = 2e-024
Identities = 116/472 (24%), Positives = 183/472 (38%), Gaps = 49/472 (10%)
Query: 32 SEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVL 91
S+WA+ R I + PG + D P RE+++C ++PV V+ K Q+GAT +
Sbjct: 35 SQWADAYRKIARGSGAEPGQWRTDRHPPLREIMNCLSDHTPVQQVSFMKSGQIGAT-EIG 93
Query: 92 ENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSG 151
N + Y ID + ++ V D + F +L+ L + + N+ +
Sbjct: 94 INWVCYVIDRGIDSMIVTQPVKDLARTWTVAKFDPGVLDMPPLLNKLTTNNTFE------ 147
Query: 152 ATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTN 211
+ GG L+ A S ++R + DE+ +P+ + + G + R
Sbjct: 148 ------KQYPGGTLFVKWANSSSQLRQITACYAFLDEIDEYPRNLNNQGTADQQIAARIM 201
Query: 212 SFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
S K+ TP AG I F GDQR++ + C HCG Q L + G
Sbjct: 202 SHGERGKIYRACTPTVAGGSAIEASFLDGDQRHYHIHCPHCGSEQILDL--EHLQPDGTF 259
Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHLSAMYA 331
G I + H ++ E G W P + PD RSY+L A YA
Sbjct: 260 ACAVNGCI--------IQEHHKNTILKERGTG--GTAFWHPHNPSAPPDHRSYYLWAAYA 309
Query: 332 PY-YARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREVYAHR 390
P SW++IA W + A +D +L F N LG ++ + + EV
Sbjct: 310 PLGLGLSWKQIADKWAE-------AKRDPSKLPGFTNLILGLPFQGERDARNAHEVATLA 362
Query: 391 RDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYVIDYE 450
G VP + AG VD+ H V I A G Y++DY
Sbjct: 363 EPGVYRGLVPRGGLVLAAG---------VDLAHDRAEVHLI-----ATGRGQRRYIVDYA 408
Query: 451 HVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEY 502
++ D + E + L + + G I+ V ID T+ V ++
Sbjct: 409 VIDLDPTVL--ESYADLDAYLRGTWKTASGIDMPISAVAIDGGNWTETVAQF 458
>ref|NP_298001.1| (NC_002488) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
ref|NP_299778.1| (NC_002488) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
pir||C82551 phage-related terminase large subunit XF2500 XF0711 [imported] -
Xylella fastidiosa (strain 9a5c)
gb|AAF83521.1|AE003914_2 (AE003914) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
gb|AAF85298.1|AE004057_11 (AE004057) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
Length = 580
Score = 95.9 bits (237), Expect = 1e-018
Identities = 142/561 (25%), Positives = 225/561 (39%), Gaps = 68/561 (12%)
Query: 76 VAVKKGAQVGATVSVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLA 135
V +K GAQVG T+ L + Y I+ + M+ D+ T F + + GL
Sbjct: 8 VTLKWGAQVGKTMLAL-CVQGYCIEMAPRSQMMLQPSQGDLQAWLETKFSPLIAANQGLQ 66
Query: 136 HLIQANDFSKGTKRSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQK 195
HLI +K R G + ++ GG L + SP MR P ++ DE+ G+ +
Sbjct: 67 HLI-----AKPRGRDGVNNQRMKSYPGGFLMFAWSGSPKTMRGRSAPLIVCDEIDGYERT 121
Query: 196 VGSDGDPMKLTETRTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGF 255
+G P+ L R +F R +L++STP G I + GDQR + V C CG
Sbjct: 122 --DEGHPVSLLWQRAATFGDERFLLEISTPTIEGASYIDDAYRAGDQRRFYVRCPACGYE 179
Query: 256 QRLYFRGNAKKGQGRLIWETEGGILV-----PDSVRYVCPHCGGEMINEDKVAFMGEGHW 310
Q L + + G+ + + + P + RYVC CG + ++A + + W
Sbjct: 180 QTLEWEHVSWVGRQS---DPDADLAAIDAHQPHTARYVCQGCGVCWDDGQRIAAVRQARW 236
Query: 311 VPTAKPSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDL 370
+KP SY L+ +Y+ + R I Q ++D +LQ F N L
Sbjct: 237 -QASKPFN-GHASYELNELYSTF--RRQSAIVQDYLD--------KLKHQDLQTFTNVSL 284
Query: 371 GEAYELKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLT 430
+ + + ++ RR L +VP GG +T +D+Q L V
Sbjct: 285 ARVWSETAEQADIDDLL--RRLETYLADVP-------MGGVF--LTAGIDMQTDRLEVEI 333
Query: 431 IGWAPSADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTR-QYESGGRLYDIARVG 489
+ W + + I + GD + G+ W L +ST Q+ESG RL I
Sbjct: 334 VAWGIDEE-----SWSIHTAVLYGD--PLLGDVWDALDRYLSTTWQHESGLRL-SIQAAC 385
Query: 490 ID---ASELTDVVYEYCND-WGENVIPIR-----GRDLPIKGAQIKYFNRQMSEKGVEYL 540
+D S T Y+Y G + I+ GR + + AQ K+ R +
Sbjct: 386 LDTGGTSGYTQAAYQYLRTRTGRRLFGIKGVGGWGRPI-VDKAQRKHSGRNAPR--INLF 442
Query: 541 SVTVDLYKDRWSPTLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKI 600
+V VD K L G G + P++ + +LT E R +
Sbjct: 443 TVGVDEAKLIVMRRLAITQPGP-----GYSHFPADRSPDWFAQLTAEKLRTR--YLKGQP 495
Query: 601 IRQVWHRPGGSRNEMWDTLIY 621
+RQ W +P + NE D +Y
Sbjct: 496 VRQ-WTKPDKTPNEALDCRVY 515
>ref|NP_461544.1| (NC_003197) Gifsy-1 prophage: similar to terminase large chain gp2
of N15 [Salmonella typhimurium LT2]
gb|AAL21503.1| (AE008818) Gifsy-1 prophage: similar to terminase large chain gp2
of N15 [Salmonella typhimurium LT2]
Length = 643
Score = 92.8 bits (229), Expect = 1e-017
Identities = 146/628 (23%), Positives = 246/628 (38%), Gaps = 90/628 (14%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L +WA+ ++P++ S PG ++ P+ +++ Y + V + K A+VG T
Sbjct: 28 LTTVQWADEYYYLPKESSYTPG--KWETLPFQVAIMNAMG-YELIRVVNLIKSARVGYTK 84
Query: 89 SVLENIIAYGIDYVKTASMIFATVD---DDVTKRRLTNFI--LPMLEHSGLAHLIQANDF 143
+L + Y I++ S++F D +D K + I +P+L L A F
Sbjct: 85 MLL-GVEGYFIEHKSRNSLLFQPTDSSAEDFMKSHVEPTIRDVPVL-------LELAPWF 136
Query: 144 SKGTKRSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPM 203
+ + + T K + G + G + R V + DE+S + V +G P
Sbjct: 137 GRKHRDNTLTLK--RFSSGVGFWCLGGAAAKNYREKSVDVVCYDELSSFEPDVEKEGSPT 194
Query: 204 KLTETRTNSFSATRKVLDLSTPLPAGNDTITQRF-EKGDQRYWEVPCKHCGGFQRLYFRG 262
L + R S K + STP G+ I + E + VPC HCG Q L F G
Sbjct: 195 LLGDKRIEG-SVWPKSIRGSTPKVKGSCQIEKAANESAHFMRFYVPCPHCGEEQYLKF-G 252
Query: 263 NAKKGQGRLIWETEGGILVPDSVRYVCPHCG--------------------GEMINEDKV 302
+ G L WE P++V Y+C H G G +
Sbjct: 253 DGSTPFG-LKWEKSK----PETVYYLCEHNGCVIRQSELDQKAGRWICDNTGMWTRDGLA 307
Query: 303 AFMGEGHWVPTAKPSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDEL 362
F G VP P ++H+ Y+P+ +W +I W+D A KD + +
Sbjct: 308 YFSASGEEVPP-----PRSITFHIWTAYSPF--TTWIQIIYDWLD-------ALKDPNGV 353
Query: 363 QVFYNNDLGEAYELKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQ 422
+ F N LGE YE E A + L E ++A A + +T +D Q
Sbjct: 354 KTFINTTLGEPYE---------EAVAEKLSHELLLEKVIHYA-APVPERVVYLTAGIDSQ 403
Query: 423 HTWLAVLTIGWAPSADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRL 482
+ GWAP G ++ID + + G + + + +++ +I+ + + G
Sbjct: 404 RNRYEMYVWGWAP-----GEEAFLIDKQIIMG--RHDDEDTLQRVDAVINKKYRHADGTD 456
Query: 483 YDIARV-----GIDASELTDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKG 536
I+R+ GIDA ++VY+ G V+P++G + G + ++ ++ G
Sbjct: 457 ISISRICWDIGGIDA----EIVYKRSKKHGIFRVLPVKGAS--VYGKPVITMPKKRNQSG 510
Query: 537 VEYLSVTVDLYKDRWSPTLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAA 596
V + D K+ + + E P N + E E +
Sbjct: 511 VFLCEIGTDTAKEMLYARMGAVTAPADEATPYAIRFPDNPDVFTEVEAKQLVAEELVEKL 570
Query: 597 TNKIIRQVWHRPGGSRNEMWDTLIYNTA 624
N R +W G RNE D L+Y +A
Sbjct: 571 VNGKFRLLWDAK-GRRNEALDCLVYASA 597
>ref|NP_310206.1| (NC_002695) putative terminase large subunit [Escherichia coli
O157:H7]
dbj|BAB35602.1| (AP002557) putative terminase large subunit [Escherichia coli
O157:H7]
Length = 641
Score = 92.4 bits (228), Expect = 1e-017
Identities = 142/643 (22%), Positives = 251/643 (38%), Gaps = 69/643 (10%)
Query: 33 EWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVLE 92
EWA+ ++P++ + G ++ P+ R +++ + V V K A+VG + +L
Sbjct: 32 EWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKMLL- 87
Query: 93 NIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSGA 152
+ AY I++ + ++I+ D D T+ + + L L A + K + +
Sbjct: 88 GVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLSL--APWYGKKHRDNTL 145
Query: 153 TAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTNS 212
T K + G + G ++ R V DE++ + + + +G P L + R
Sbjct: 146 TMK--RFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDKRIEG 203
Query: 213 FSATRKVLDLSTPLPAGNDTITQRFEKGDQ-RYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
S K + STP G I + + + V C HCG Q L F K+ L
Sbjct: 204 -SVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF--GDKETPFGL 260
Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGH--------------WVPTA--K 315
W + P SV Y+C H +I + ++ F + W ++ +
Sbjct: 261 KWTPDD----PSSVFYLCEH-NACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEE 315
Query: 316 PSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYE 375
PD ++H+ Y+P+ +W +I + W+ D + + F N LGE +E
Sbjct: 316 IEPPDSVTFHIWTAYSPF--TTWVQIVKDWM-------KTKGDTGKRKTFVNTTLGETWE 366
Query: 376 LKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAP 435
K EV A R++ Y+ VP+ A AG +D Q + GW P
Sbjct: 367 AKIGERPDAEVMAERKEHYS-ASVPDRVAYLTAG---------IDSQLDRYEMRVWGWGP 416
Query: 436 SADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASEL 495
G ++ID + + G + + + ++ E I+ G ++R+ D +
Sbjct: 417 -----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTGGI 469
Query: 496 -TDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
+VYE G VIPI+G + G + R+ ++ GV + D K++
Sbjct: 470 DPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYN 527
Query: 554 TLRKEWSGQGEMPRGMF--NAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGS 611
G +P + N P + ++LT E + E+ KI+ R
Sbjct: 528 RFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR---- 583
Query: 612 RNEMWDTLIYNTAIFESMVLEACEDILGLEALVWPEFWAIAGK 654
RNE D +Y A + D+ L A + E A K
Sbjct: 584 RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_287368.1| (NC_002655) putative DNA packaging protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
ref|NP_309657.1| (NC_002695) terminase large subunit [Escherichia coli O157:H7]
gb|AAG55980.1|AE005330_12 (AE005330) putative DNA packaging protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
dbj|BAB35053.1| (AP002555) terminase large subunit [Escherichia coli O157:H7]
Length = 641
Score = 92.0 bits (227), Expect = 2e-017
Identities = 142/643 (22%), Positives = 251/643 (38%), Gaps = 69/643 (10%)
Query: 33 EWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVLE 92
EWA+ ++P++ + G ++ P+ R +++ + V V K A+VG + +L
Sbjct: 32 EWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKMLL- 87
Query: 93 NIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSGA 152
+ AY I++ + ++I+ D D T+ + + L L A + K + +
Sbjct: 88 GVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLAL--APWYGKKHRDNTL 145
Query: 153 TAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTNS 212
T K + G + G ++ R V DE++ + + + +G P L + R
Sbjct: 146 TMK--RFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDKRIEG 203
Query: 213 FSATRKVLDLSTPLPAGNDTITQRFEKGDQ-RYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
S K + STP G I + + + V C HCG Q L F K+ L
Sbjct: 204 -SVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF--GDKETPFGL 260
Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGH--------------WVPTA--K 315
W + P SV Y+C H +I + ++ F + W ++ +
Sbjct: 261 KWTPDD----PSSVFYLCEH-NACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEE 315
Query: 316 PSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYE 375
PD ++H+ Y+P+ +W +I + W+ D + + F N LGE +E
Sbjct: 316 IEPPDSVTFHIWTAYSPF--TTWVQIVKDWM-------KTKGDTGKRKTFVNTTLGETWE 366
Query: 376 LKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAP 435
K EV A R++ Y+ VP+ A AG +D Q + GW P
Sbjct: 367 AKIGERPDAEVMAERKEHYS-ASVPDRVAYLTAG---------IDSQLDRYEMRVWGWGP 416
Query: 436 SADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASEL 495
G ++ID + + G + + + ++ E I+ G ++R+ D +
Sbjct: 417 -----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTGGI 469
Query: 496 -TDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
+VYE G VIPI+G + G + R+ ++ GV + D K++
Sbjct: 470 DPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYN 527
Query: 554 TLRKEWSGQGEMPRGMF--NAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGS 611
G +P + N P + ++LT E + E+ KI+ R
Sbjct: 528 RFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR---- 583
Query: 612 RNEMWDTLIYNTAIFESMVLEACEDILGLEALVWPEFWAIAGK 654
RNE D +Y A + D+ L A + E A K
Sbjct: 584 RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_040581.1| (NC_001416) DNA packaging protein [bacteriophage lambda]
sp|P03708|TERL_LAMBD TERMINASE LARGE SUBUNIT (DNA PACKAGING PROTEIN A) (GPA)
pir||JVBPAL DNA-packaging protein A - phage lambda
gb|AAA96534.1| (J02459) A (DNA packaging;641) [bacteriophage lambda]
Length = 641
Score = 91.3 bits (225), Expect = 3e-017
Identities = 143/643 (22%), Positives = 250/643 (38%), Gaps = 69/643 (10%)
Query: 33 EWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVLE 92
EWA+ ++P++ + G ++ P+ R +++ + V V K A+VG + +L
Sbjct: 32 EWADANYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKMLL- 87
Query: 93 NIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSGA 152
+ AY I++ + ++I+ D D T+ + + L L A + K + +
Sbjct: 88 GVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLAL--APWYGKKHRDNTL 145
Query: 153 TAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTNS 212
T K + G + G ++ R V DE++ + + +G P L + R
Sbjct: 146 TMK--RFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDIEQEGSPTFLGDKRIEG 203
Query: 213 FSATRKVLDLSTPLPAGNDTITQRFEKGDQ-RYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
S K + STP G I + + + V C HCG Q L F K+ L
Sbjct: 204 -SVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF--GDKETPFGL 260
Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGH--------------WVPTA--K 315
W + P SV Y+C H +I + ++ F + W ++ +
Sbjct: 261 KWTPDD----PSSVFYLCEH-NACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEE 315
Query: 316 PSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYE 375
PD ++H+ Y+P+ +W +I + W+ D + + F N LGE +E
Sbjct: 316 IEPPDSVTFHIWTAYSPF--TTWVQIVKDWM-------KTKGDTGKRKTFVNTTLGETWE 366
Query: 376 LKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAP 435
K EV A R++ Y+ VP+ A AG +D Q + GW P
Sbjct: 367 AKIGERPDAEVMAERKEHYS-APVPDRVAYLTAG---------IDSQLDRYEMRVWGWGP 416
Query: 436 SADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASEL 495
G ++ID + + G + + + ++ E I+ G I+R+ D +
Sbjct: 417 -----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSISRICWDTGGI 469
Query: 496 -TDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
+VYE G VIPI+G + G + R+ ++ GV + D K++
Sbjct: 470 DPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYN 527
Query: 554 TLRKEWSGQGEMPRGMF--NAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGS 611
G +P + N P + ++LT E + E+ KI+ R
Sbjct: 528 RFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR---- 583
Query: 612 RNEMWDTLIYNTAIFESMVLEACEDILGLEALVWPEFWAIAGK 654
RNE D +Y A + D+ L A + E A K
Sbjct: 584 RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>sp|P36693|TERL_BPP21 TERMINASE LARGE SUBUNIT (GP2)
pir||B49849 terminase large chain gp2 - phage 21
gb|AAA32340.1| (M81255) terminase large subunit [Bacteriophage 21]
Length = 642
Score = 89.0 bits (219), Expect = 2e-016
Identities = 138/611 (22%), Positives = 238/611 (38%), Gaps = 62/611 (10%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L +WA + ++P++ S PG ++ P+ +++C + V + K A+VG T
Sbjct: 28 LTTVQWANKHYYLPKESSYTPG--RWETLPFQVGIMNCMGN-DLIRTVNLIKSARVGYTK 84
Query: 89 SVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTK 148
+L + AY I++ S++F D + + + + P + A L A F + +
Sbjct: 85 MLL-GVEAYFIEHKSRNSLLFQPTDS-AAEDFMKSHVEPTIRDVP-AMLELAPWFGRKHR 141
Query: 149 RSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTET 208
+ T K + G + G + R V + DE+S + V +G P L +
Sbjct: 142 DNTLTLK--RFSSGVGFWCLGGAAAKNYREKSVDVVCYDELSSFEPDVEKEGSPTLLGDK 199
Query: 209 RTNSFSATRKVLDLSTPLPAGNDTITQRF-EKGDQRYWEVPCKHCGGFQRLYFRGNAKKG 267
R S K + STP G+ I + E + VPC HCG Q L F +A
Sbjct: 200 RIEG-SVWPKSIRGSTPKIKGSCQIEKAANESAHFMRFYVPCPHCGEEQYLKFGDDASPF 258
Query: 268 QGRLIWETEGGILVPDSVRYVCPHCG------------GEMINEDKVAFMGEGHWVPTAK 315
L WE P+SV Y+C H G G I E+ + +G +A+
Sbjct: 259 G--LKWEKNK----PESVFYLCEHHGCVIHQSELDQSNGRWICENTGMWTRDGLMFFSAR 312
Query: 316 PSR---PDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGE 372
P ++H+ Y+P+ +W +I W+D A KD + L+ F N LGE
Sbjct: 313 GDEIPPPRSITFHIWTAYSPF--TTWVQIVYDWLD-------ALKDPNGLKTFVNTTLGE 363
Query: 373 AYELKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIG 432
+E +V + Y A + +T +D Q + G
Sbjct: 364 TWEEAVGEKLDHQVLMDKVVRYT----------AAVPARVVYLTAGIDSQRNRFEMYVWG 413
Query: 433 WAPSADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDA 492
WAP G +++D + G + E E ++ I+ + + G I+RV D
Sbjct: 414 WAP-----GEEAFLVDKIIIMG--RPDEEETLLRVDAAINKKYRHADGTEMTISRVCWDI 466
Query: 493 SELT-DVVYEYCNDWGE-NVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDR 550
+ ++VY+ G V+P++G + G + + +++GV V D K+
Sbjct: 467 GGIDGEIVYQRSKKHGVFRVLPVKGAS--VYGKPVITMPKTRNQRGVYLCEVGTDTAKEI 524
Query: 551 WSPTLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGG 610
++ + + E P + E E E + +R +W
Sbjct: 525 LYARMKADPTPVDEATSYAIRFPDDPEIFSQTEAQQLVAEELVEKWEKGKMRLLWDNK-K 583
Query: 611 SRNEMWDTLIY 621
RNE D L+Y
Sbjct: 584 RRNEALDCLVY 594
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 429,114,867
Number of Sequences: 919193
Number of extensions: 19621958
Number of successful extensions: 44335
Number of sequences better than 5.0e-02: 21
Number of HSP's better than 0.1 without gapping: 4
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 44260
Number of HSP's gapped (non-prelim): 28
length of query: 675
length of database: 288,098,425
effective HSP length: 127
effective length of query: 548
effective length of database: 171,360,914
effective search space: 93905780872
effective search space used: 93905780872
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 94 (40.8 bits)
Query= orf4 [960-1053]
(92 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.320 0.136 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 47,051,056
Number of Sequences: 919193
Number of extensions: 1521961
Number of successful extensions: 3464
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3464
Number of HSP's gapped (non-prelim): 0
length of query: 92
length of database: 288,098,425
effective HSP length: 68
effective length of query: 24
effective length of database: 225,593,301
effective search space: 5414239224
effective search space used: 5414239224
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf5 [1074-1620]
(545 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
sp|P36272|VG04_BPP21 PORTAL PROTEIN (HEAD PROTEIN GP4) >gi|... 77 6e-013
ref|NP_518978.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 74 5e-012
ref|NP_287794.1| (NC_002655) putative capsid protein of pro... 73 9e-012
ref|NP_459888.1| (NC_003197) Fels-1 prophage [Salmonella ty... 73 9e-012
ref|NP_460007.1| (NC_003197) Gifsy-2 prophage [Salmonella t... 73 9e-012
ref|NP_287587.1| (NC_002655) putative capsid assembly prote... 71 3e-011
ref|NP_461542.1| (NC_003197) Gifsy-1 prophage: similar to h... 69 1e-010
ref|NP_040583.1| (NC_001416) capsid component [bacteriophag... 67 6e-010
ref|NP_287370.1| (NC_002655) putative capsid structural pro... 65 2e-009
emb|CAC83156.1| (AJ304858) putative portal protein [Escheri... 64 3e-009
ref|NP_288458.1| (NC_002655) putative head-tail preconnecto... 64 3e-009
ref|NP_297800.1| (NC_002488) conserved hypothetical protein... 49 1e-004
>sp|P36272|VG04_BPP21 PORTAL PROTEIN (HEAD PROTEIN GP4)
pir||JN0538 head protein gp4 - phage 21
gb|AAA32342.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
Length = 530
Score = 76.6 bits (187), Expect = 6e-013
Identities = 92/410 (22%), Positives = 169/410 (40%), Gaps = 37/410 (9%)
Query: 103 DDQADAWAENVNDLFEAWGELPEVCDFTKRYTFGQWQRVAKAEALIAGDVLVVEHHNSAT 162
D+ AW+E +F GE+ D + TF ++ R G++ V ++ T
Sbjct: 106 DEVEAAWSEYAEGMF---GEI----DVEGKRTFTEFIREGVGVHAFNGEIFVQPVWDTET 158
Query: 163 EL---PTYELIDGDMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQEGF------ 213
++ + V TP M N + GVE+ + G VA+ + ++ F
Sbjct: 159 TQLFRTRFKAVSPKRVDTPGHGMG----NRFLRAGVEVDRYGRAVAYHICEDDFPFSGSG 214
Query: 214 KHKRIKATGANGRRIAWMYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGA 273
+ +RI GR A ++ + RG VM+ + +D ++ + A + A
Sbjct: 215 RWERIPRELPTGRP-AMLHIFEPVEDGQTRGANQFYSVMERLKMLDSLQATQLQSAIVKA 273
Query: 274 QIVGFIQRQIGGKTPGTKPFTSGAQRRVQDQDFVGD----GTVRQTNIAETPFGMMVEGL 329
I+ ++ T + +GA + +D + + TN T G+ + L
Sbjct: 274 MYAATIESELD--TEKAFEYIAGAPQEQKDNPLINILEKFSSWYDTNNV-TLGGVKIPHL 330
Query: 330 AEGEEIKAFTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDS-SYSASRGEIKEFE 388
G+++K T +D F E+A+LR +A LG+ + L+ + SYS++R E
Sbjct: 331 FPGDDLKLQTAQDSDNGFSALEQALLRYIAAGLGVSYEQLSRDYSKVSYSSARASANESW 390
Query: 389 SVIHEMRDRDADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEF-PTFAAWTRSQWF 447
R A L ++ WL +L G + R +F +AW+R++W
Sbjct: 391 RYFMGRRKFIAARLATQMFSCWLEEALLR------GIIRPPRARFDFYQARSAWSRAEWI 444
Query: 448 GMVKEAVDLEKETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKEN 497
G + A+D KE + +I+ G T K ++ G ++ RQ +E+
Sbjct: 445 GAGRMAIDGLKEVQESVMRIEAGLSTYEKGLALM-GEDYQDIFRQQVRES 493
>ref|NP_518978.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14559.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 508
Score = 73.6 bits (179), Expect = 5e-012
Identities = 103/472 (21%), Positives = 176/472 (36%), Gaps = 59/472 (12%)
Query: 43 GGLGPVEILYTDYWSQRRRSNTFFKKNLYARGIIRRIVGNVVNTGQVXXXXXXXXXXXXS 102
G G V L R +S ++N +A+ I V N V TG S
Sbjct: 41 GNPGAVAALLASGEDLRIKSRDLVRRNAWAQAGIEAFVANAVGTG----------IKPQS 90
Query: 103 DDQADAWAENVNDLFEAWGELPEVCDFTKRYTFGQWQRVAKAEALIAGDVLVV----EHH 158
DA+ +V L+ W + T Y Q +A L G+ L+
Sbjct: 91 LSTDDAFKADVQALWRDWTAEADAAGQTDFYGL---QALACRAMLEGGECLIRLRPRRPE 147
Query: 159 NSATELPTYELIDGDMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQEGFKHKRI 218
+ T +L++ + + P + + + +G+E G VA+ + + R+
Sbjct: 148 DGLTVPLQLQLLEAEHL--PMTLNVDLPSGNAVRSGIEFDGLGRRVAYHLYRSHPDDGRL 205
Query: 219 KATGANGRRIAWMYYGTDKRHT-------EARGEPLLTLVMQNIAEIDKMRDATQRKATL 271
G ++ H + RGEP L+ + + E+D+ DA +
Sbjct: 206 APMSGQGGLDTVRVDASEIIHLYRVLRPGQIRGEPWLSRALVKLNELDQYDDAELVRKKT 265
Query: 272 GAQIVGFIQRQIGGKTPGTKPFTSGAQRRVQDQDFVGDGTVRQTNIAETPFGM-MVEGLA 330
A GF+ RQ + + +G+G + I+ ++ L
Sbjct: 266 AAMFAGFVTRQ------------------SPEDNLMGEGLPDEAGISLVGLEPGTLQILE 307
Query: 331 EGEEIKAFTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDS-SYSASRGEIKEF-- 387
GE+IK +GEF RAVA ALG+ + L +YS+ R + EF
Sbjct: 308 PGEDIKFSDPADVGGSYGEFLRTQFRAVAAALGITYEQLTGDLTGVNYSSIRAGLLEFRR 367
Query: 388 --ESVIHEMRDRDADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEFPTFAAWTRSQ 445
E V H + + RP++ +W++ VL+ + APGF + W
Sbjct: 368 RCEMVQHSVL---VHQMCRPVWAAWMKQAVLSGALVAPGFARGGAARRRQYLQVKWIPQG 424
Query: 446 WFGMVKEAVDLEKETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKEN 497
W + VD EKE + I+ G ++R++A G + R++A +N
Sbjct: 425 W-----QWVDPEKEFKAMLLAIRAGLMSRSEAISTF-GYDAEDIDREIAADN 470
>ref|NP_287794.1| (NC_002655) putative capsid protein of prophage CP-933R
[Escherichia coli O157:H7 EDL933]
ref|NP_309134.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
ref|NP_310760.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
gb|AAG56408.1|AE005368_14 (AE005368) putative capsid protein of prophage CP-933R [Escherichia
coli O157:H7 EDL933]
dbj|BAB34530.1| (AP002554) putative portal protein [Escherichia coli O157:H7]
dbj|BAB36156.1| (AP002559) putative portal protein [Escherichia coli O157:H7]
Length = 530
Score = 72.8 bits (177), Expect = 9e-012
Identities = 90/404 (22%), Positives = 165/404 (40%), Gaps = 34/404 (8%)
Query: 113 VNDLFEAWGELPE----VCDFTKRYTFGQWQRVAKAEALIAGDVLVVEHHNSATEL---P 165
V+++ AW E E D + TF ++ R G++ V ++ T
Sbjct: 105 VDEVEAAWSEYAEGMSGEIDVEGKRTFTEFIREGVGVHAFNGEIFVQPVWDTETTQLFRT 164
Query: 166 TYELIDGDMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQEGF------KHKRIK 219
++ + V TP M N + GVE+ + G VA+ + ++ F + +RI
Sbjct: 165 RFKAVSPKRVDTPGHGMG----NRFLRAGVEVDRYGRAVAYHICEDDFPRSGSGRWERIP 220
Query: 220 ATGANGRRIAWMYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFI 279
GR A ++ + RG VM+ + +D ++ + A + A I
Sbjct: 221 RELPTGRP-AMLHIFEPVEDGQTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATI 279
Query: 280 QRQIGGKTPGTKPFTSGAQRRVQDQDFVGD----GTVRQTNIAETPFGMMVEGLAEGEEI 335
+ + T + +GA + +D + + TN T G+ + L G+++
Sbjct: 280 ESDLD--TEKAFEYIAGAPQGQKDNPLINILEKFSSWYDTNNV-TLGGVKIPHLFPGDDL 336
Query: 336 KAFTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDS-SYSASRGEIKEFESVIHEM 394
K T +D F E+A+LR +A LG+ + L+ + SYS++R E
Sbjct: 337 KLQTAQDSDNGFSALEQALLRYIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGR 396
Query: 395 RDRDADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEF-PTFAAWTRSQWFGMVKEA 453
R A L ++ WL +L G + R +F +AW+R++W G + A
Sbjct: 397 RKFIAARLATQMFSYWLEEALLR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMA 450
Query: 454 VDLEKETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKEN 497
+D KE + +I+ G T K ++ G ++ RQ +E+
Sbjct: 451 IDGLKEVQESVMRIEAGLSTYEKEPALM-GEDYQDIFRQQVRES 493
>ref|NP_459888.1| (NC_003197) Fels-1 prophage [Salmonella typhimurium LT2]
gb|AAL19847.1| (AE008738) Fels-1 prophage [Salmonella typhimurium LT2]
Length = 511
Score = 72.8 bits (177), Expect = 9e-012
Identities = 77/322 (23%), Positives = 125/322 (37%), Gaps = 37/322 (11%)
Query: 110 AENVNDLFEAWGELPEVCDFTKRYTFGQWQRVAKAEALIAGDVLVVEHHNSATELPTYEL 169
AE + + W PEV T +YT +R+ L G+V + L
Sbjct: 107 AEKIRRCWAEWSVSPEV---TGQYTRPVLERLMLRTWLRDGEVFTQVLTGKISGLSPVAG 163
Query: 170 IDG--DMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQE----GFKHKRIKATGA 223
+ + ++ Y + + N + G+ + V + V Q G +K A
Sbjct: 164 VPFWLEALEPDYIPLERTDNNSNLVQGIYFNEWRRPVKYLVCQSWPGAGAAAVAVKEVTA 223
Query: 224 NGRRIAWMYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQI 283
++ +R +ARG LL V+ + ++ + D+ + A + A + FI++Q
Sbjct: 224 ENM----LHLRFTRRLNQARGASLLAPVIIRLMDLKEYEDSERIAARIAASLGMFIKKQ- 278
Query: 284 GGKTPGTKPFTSGAQRRVQDQDFVGDGTVRQTNIAETPF--GMMVEGLAEGEEIKAFTND 341
D DG V ET GM+ +GL GE+I +D
Sbjct: 279 ---------------------DVGTDGYVAPEKRKETQIQPGMLFDGLNPGEDIGMIKSD 317
Query: 342 TTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDADT 401
+ F LRAVA L FS +A +D +YSA R E+ E + ++D
Sbjct: 318 RPNAGLESFRMGQLRAVAAGLRGSFSSIARNYDGTYSAQRQELVEAQEGYSILQDSFIAA 377
Query: 402 LLRPIYRSWLRTMVLTRRIDAP 423
RP+YR WL V + I+ P
Sbjct: 378 FTRPLYRRWLAAAVASGAIEVP 399
>ref|NP_460007.1| (NC_003197) Gifsy-2 prophage [Salmonella typhimurium LT2]
gb|AAL19966.1| (AE008744) Gifsy-2 prophage [Salmonella typhimurium LT2]
Length = 437
Score = 72.8 bits (177), Expect = 9e-012
Identities = 93/404 (23%), Positives = 161/404 (39%), Gaps = 46/404 (11%)
Query: 107 DAWAENVNDLFEAWGELPEVCDFTKRYTFGQWQRVAKAEALIAGDVLVVEHHNSATELPT 166
+A AE + + W P+V T +YT +R+ L G+V A ++P
Sbjct: 26 NALAEQIRARWAEWSVSPDV---TGQYTRPVLERLLLRTWLRDGEVF---SQMVAGKMPG 79
Query: 167 YELIDG-----DMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQE---GFKHKRI 218
E + G + ++ Y M + + + G+ D + ++V + GF +
Sbjct: 80 LEPVAGVPFWLEAMEPDYVPMEQTDSTNNLIQGIYFN-DWQRPKSYIVCKSWPGFATAMV 138
Query: 219 KATGANGRRIAWMYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGF 278
+ + + + +R +ARG LL V+ + ++ + D+ + A + A F
Sbjct: 139 ATKLIDAENMLHLKF--TRRLNQARGVTLLAPVIIRLLDLKEYEDSERLAARISAAFAMF 196
Query: 279 IQRQIGGKTPGTKPFTSGAQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKAF 338
I+R + VQD D D + ++ P G +++ L GE+I
Sbjct: 197 IRR---------------SDAMVQDGD-APDYADKDRDLDIEP-GTILKDLLPGEDIGTI 239
Query: 339 TNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRD 398
+D + F LRAVA + FS +A +D +YSA R E+ E + ++D
Sbjct: 240 KSDRPNANLESFRMGQLRAVAAGVRGSFSSIARNYDGTYSAQRQELVEAQEGYAILQDNF 299
Query: 399 ADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEK 458
+ RP+YR WL T + ID P A + + G V +D K
Sbjct: 300 IAAVSRPVYRRWLATAITAGVIDVP----------TDTDMATLFNAVYSGPVMPWIDPLK 349
Query: 459 ETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAK--ENQLL 500
E I+ GA T + R G + R+ A+ EN+ L
Sbjct: 350 EANAWRILIRGGAATESDWVRARGGAPAEVKRRRKAEIDENRKL 393
>ref|NP_287587.1| (NC_002655) putative capsid assembly protein of prophage CP-933O
[Escherichia coli O157:H7 EDL933]
gb|AAG56199.1|AE005349_3 (AE005349) putative capsid assembly protein of prophage CP-933O
[Escherichia coli O157:H7 EDL933]
Length = 530
Score = 70.9 bits (172), Expect = 3e-011
Identities = 89/404 (22%), Positives = 164/404 (40%), Gaps = 34/404 (8%)
Query: 113 VNDLFEAWGELPE----VCDFTKRYTFGQWQRVAKAEALIAGDVLVVEHHNSATEL---P 165
V+++ AW E E D + TF ++ R G++ V ++ T
Sbjct: 105 VDEVEAAWSEYAEGMSGEIDVEGKRTFTEFIREGVGVHAFNGEIFVQPVWDTETTQLFRT 164
Query: 166 TYELIDGDMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQEGF------KHKRIK 219
++ + V TP M N + GV + + G VA+ + ++ F + +RI
Sbjct: 165 RFKAVSPKRVDTPGHGMG----NRFLRXGVXVDRYGRAVAYHICEDDFPRSGSGRWERIP 220
Query: 220 ATGANGRRIAWMYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFI 279
GR A ++ + RG VM+ + +D ++ + A + A I
Sbjct: 221 RELPTGRP-AMLHIFEPVEDGQTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATI 279
Query: 280 QRQIGGKTPGTKPFTSGAQRRVQDQDFVGD----GTVRQTNIAETPFGMMVEGLAEGEEI 335
+ + T + +GA + +D + + TN T G+ + L G+++
Sbjct: 280 ESDLD--TEKAFEYIAGAPQGQKDNPLINILEKFSSWYDTNNV-TLGGVKIPHLFPGDDL 336
Query: 336 KAFTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDS-SYSASRGEIKEFESVIHEM 394
K T +D F E+A+LR +A LG+ + L+ + SYS++R E
Sbjct: 337 KLQTAQDSDNGFSALEQALLRYIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGR 396
Query: 395 RDRDADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEF-PTFAAWTRSQWFGMVKEA 453
R A L ++ WL +L G + R +F +AW+R++W G + A
Sbjct: 397 RKFIAARLATQMFSYWLEEALLR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMA 450
Query: 454 VDLEKETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKEN 497
+D KE + +I+ G T K ++ G ++ RQ +E+
Sbjct: 451 IDGLKEVQESVMRIEAGLSTYEKEPALM-GEDYQDIFRQQVRES 493
>ref|NP_461542.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp4
of phage 21 [Salmonella typhimurium LT2]
gb|AAL21501.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp4
of phage 21 [Salmonella typhimurium LT2]
Length = 526
Score = 68.9 bits (167), Expect = 1e-010
Identities = 85/406 (20%), Positives = 167/406 (40%), Gaps = 29/406 (7%)
Query: 103 DDQADAWAENVNDLFEAWGELPEVCDFTKRYTFGQWQRVAKAEALIAGDVLVVEHHNSAT 162
D+ AW E + +F GE+ D + TF ++ R G++ + ++ T
Sbjct: 106 DEVEAAWTEYCDGIF---GEM----DAEGKRTFTEFIREGVGVHAFNGEIFLQPVWDAET 158
Query: 163 EL---PTYELIDGDMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAF------WVVQEGF 213
++ + V TP A N ++ GVE ++G+ +A+ W V G
Sbjct: 159 TQVFRTRFKAVSPKRVDTPGYARG----NRQLRAGVETDRNGKALAYHVCDDDWPVAGGE 214
Query: 214 KHKRIKATGANGRRIAWMYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGA 273
+ RI +GR A ++ + RG VM+ + +D ++ + A + A
Sbjct: 215 RWTRIPRFLPSGRP-AMLHIFEPVEDGQTRGANQFYSVMERLKMLDTLQATQLQSAIVKA 273
Query: 274 QIVGFIQRQIGGKTPGTKPFTSGAQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGE 333
I+ ++ + + T+ + + + + + + G+ + L G+
Sbjct: 274 MYAATIESELDSEK-AFEYITAADNKDTPLVNMLANYARYYSTNSIKLGGVKIPHLYPGD 332
Query: 334 EIKAFTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDS-SYSASRGEIKEFESVIH 392
E+ T +D F E+A+LR +A LG+ + L+ + SYS++R E
Sbjct: 333 ELNLQTAQDSDNGFSALEQALLRYIAAGLGVSYEQLSRDYSQVSYSSARASANESWRYFL 392
Query: 393 EMRDRDADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKE 452
R A L ++ WL ++ I AP + + ++W+RS+W G +
Sbjct: 393 GRRRFIAGRLATQMFSCWLEEALIRGVIRAPRARFSFWEAR-----SSWSRSEWIGAGRM 447
Query: 453 AVDLEKETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKENQ 498
A+D KE + +I+ G T K ++ G ++ RQ +E++
Sbjct: 448 AIDGLKEVQESVMRIEAGLSTYEKELAIM-GEDYQEIFRQQVRESE 492
>ref|NP_040583.1| (NC_001416) capsid component [bacteriophage lambda]
sp|P03710|VMCB_LAMBD PORTAL PROTEIN (GPB) (MINOR CAPSID PROTEIN B)
pir||VHBPBL minor capsid protein B - phage lambda
gb|AAA96536.1| (J02459) B (capsid component;533) [bacteriophage lambda]
Length = 533
Score = 66.6 bits (161), Expect = 6e-010
Identities = 84/429 (19%), Positives = 165/429 (37%), Gaps = 23/429 (5%)
Query: 61 RSNTFFKKNLYARGIIRRIVGNVVNTGQVXXXXXXXXXXXXSDDQADAWAENVNDLFEAW 120
R++ + N YA I+ ++V + +++A A++ V ++ +
Sbjct: 59 RADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEF 118
Query: 121 GELPEVC-DFTKRYTFGQWQRVAKAEALIAGDVLVVEHHNSATEL---PTYELIDGDMVQ 176
E C D ++ TF R A G++ V ++++ + ++ +
Sbjct: 119 AEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRIS 178
Query: 177 TPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKA-----TGANGRRIAWM 231
P N G + GV++ G + ++V ++G+ + G R +++
Sbjct: 179 NP----NNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFI 234
Query: 232 YYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTK 291
+ + RG + VM+ + +D +++ + A + A I+ ++ T
Sbjct: 235 HVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELD--TQSAM 292
Query: 292 PFTSGAQRRVQDQDFVGDGTVRQTNIAETPF---GMMVEGLAEGEEIKAFTNDTTDEKFG 348
F GA + Q + G A P G V L G+ + T TD +
Sbjct: 293 DFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYS 352
Query: 349 EFEEAILRAVAWALGMPFSVLAMQF-DSSYSASRGEIKEFESVIHEMRDRDADTLLRPIY 407
FE+++LR +A LG+ + L+ + SYS +R E + R A ++
Sbjct: 353 VFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMF 412
Query: 408 RSWLRTMVLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQI 467
WL ++ R + P + AR + +AW W G + A+D KE + I
Sbjct: 413 LCWLEEAIVRRVVTLP---SKARFSFQ-EARSAWGNCDWIGSGRMAIDGLKEVQEAVMLI 468
Query: 468 KLGALTRTK 476
+ G T K
Sbjct: 469 EAGLSTYEK 477
>ref|NP_287370.1| (NC_002655) putative capsid structural protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
ref|NP_309659.1| (NC_002695) portal protein [Escherichia coli O157:H7]
ref|NP_310204.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
gb|AAG55982.1|AE005330_14 (AE005330) putative capsid structural protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
dbj|BAB35055.1| (AP002555) portal protein [Escherichia coli O157:H7]
dbj|BAB35600.1| (AP002557) putative portal protein [Escherichia coli O157:H7]
Length = 533
Score = 65.1 bits (157), Expect = 2e-009
Identities = 93/489 (19%), Positives = 183/489 (37%), Gaps = 30/489 (6%)
Query: 1 MPAAVDPQGPAMTVENTSRFNGRGDLAHAGYRYTRHDGQNYEGGLGPVEILYTDYWSQRR 60
+P + P G ++ + ++G G R G++ + L P ++
Sbjct: 6 IPTLLGPDGMT-SLREYAGYHGGGSGFGGQLRAWNPPGESVDAALLP------NFTRGNA 58
Query: 61 RSNTFFKKNLYARGIIRRIVGNVVNTGQVXXXXXXXXXXXXSDDQADAWAENVNDLFEAW 120
R++ + N YA I+ ++V + +++A A++ V ++ +
Sbjct: 59 RADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEF 118
Query: 121 GELPEVC-DFTKRYTFGQWQRVAKAEALIAGDVLVV---EHHNSATELPTYELIDGDMVQ 176
E C D ++ TF R A G++ V + S + ++ +
Sbjct: 119 AEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTRPSRLFRTQFRMVSPKRIS 178
Query: 177 TPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKA-----TGANGRRIAWM 231
P N + GV++ G + ++V ++G+ + G R +++
Sbjct: 179 NP----NNTSDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFI 234
Query: 232 YYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTK 291
+ + RG + VM+ + +D +++ + A + A I+ ++ T
Sbjct: 235 HVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELD--TQSAM 292
Query: 292 PFTSGAQRRVQDQDFVGDGTVRQTNIAETPF---GMMVEGLAEGEEIKAFTNDTTDEKFG 348
F GA + Q + G A P G V L G+ + T TD +
Sbjct: 293 DFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYS 352
Query: 349 EFEEAILRAVAWALGMPFSVLAMQF-DSSYSASRGEIKEFESVIHEMRDRDADTLLRPIY 407
FE+++LR +A LG+ + L+ + SYS +R E + R A ++
Sbjct: 353 VFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMF 412
Query: 408 RSWLRTMVLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQI 467
WL ++ R + P + AR + +AW W G + A+D KE + I
Sbjct: 413 LCWLEEAIVRRVVTLP---SKARFSFQ-EARSAWGNCDWIGSGRMAIDGLKEVQEAVMLI 468
Query: 468 KLGALTRTK 476
+ G T K
Sbjct: 469 EAGLSTYEK 477
>emb|CAC83156.1| (AJ304858) putative portal protein [Escherichia coli]
Length = 500
Score = 64.3 bits (155), Expect = 3e-009
Identities = 90/382 (23%), Positives = 152/382 (39%), Gaps = 53/382 (13%)
Query: 110 AENVNDLFEAWGELPEVCDFTKRYTFGQWQRVAKAEALIAGDVLVVEHHNSATELPT--- 166
AE D+ W E D T +YT +R+ L G+V + L
Sbjct: 104 AELAADIRRLWAEWSVSPDVTGQYTRPVLERLLLRTWLRDGEVFAQMVSGAGNGLERTAG 163
Query: 167 ----YELIDGDMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKATG 222
E ++ D V D ++ ++ GV + + G + V +K+ +
Sbjct: 164 VPFWLEAMEPDFVPMRTD------ESAGLNQGVFLDEWGRPKKYLV----YKNYPVSGRQ 213
Query: 223 ANGRRIA---WMYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFI 279
++ + IA ++ +R + RG +L+ V+ I+ + + DA A + A + +I
Sbjct: 214 SDTKEIAAGKMIHLKFTRRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYI 273
Query: 280 QRQIGG--KTPGTKPFTSGAQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKA 337
++ G + PG K T R+ +I TP G++ + L +GE+I
Sbjct: 274 RKGDGQDYEDPGIKE------------------TEREVHI--TP-GIIYDDLRKGEDIGM 312
Query: 338 FTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDR 397
+D + F LRAVA + FS A ++ +YSA R E+ E ++D
Sbjct: 313 VKSDRPNPNLETFRNGQLRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDC 372
Query: 398 DADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLE 457
+ RP+YR+WL +V + P + E T T S G V +D
Sbjct: 373 FIGAVTRPVYRTWLNMVVAAGLLKIPADV-------EMKTLYNATYS---GPVMPWIDPV 422
Query: 458 KETRGRGEQIKLGALTRTKAAR 479
KE QI+ GA T + R
Sbjct: 423 KEAEAWRIQIRGGAATESDWVR 444
>ref|NP_288458.1| (NC_002655) putative head-tail preconnector protein of prophage
CP-933U [Escherichia coli O157:H7 EDL933]
ref|NP_310988.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
dbj|BAB19579.1| (AP000400) portal protein [Escherichia coli O157:H7]
gb|AAG57012.1|AE005420_17 (AE005420) putative head-tail preconnector protein of prophage
CP-933U [Escherichia coli O157:H7 EDL933]
dbj|BAB36384.1| (AP002560) putative portal protein [Escherichia coli O157:H7]
Length = 500
Score = 64.3 bits (155), Expect = 3e-009
Identities = 90/382 (23%), Positives = 152/382 (39%), Gaps = 53/382 (13%)
Query: 110 AENVNDLFEAWGELPEVCDFTKRYTFGQWQRVAKAEALIAGDVLVVEHHNSATELPT--- 166
AE D+ W E D T +YT +R+ L G+V + L
Sbjct: 104 AELAADIRRLWAEWSVSPDVTGQYTRPVLERLLLRTWLRDGEVFAQMVSGAGNGLERTAG 163
Query: 167 ----YELIDGDMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKATG 222
E ++ D V D ++ ++ GV + + G + V +K+ +
Sbjct: 164 VPFWLEAMEPDFVPMRTD------ESAGLNQGVFLDEWGRPKKYLV----YKNYPVSGRQ 213
Query: 223 ANGRRIA---WMYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFI 279
++ + IA ++ +R + RG +L+ V+ I+ + + DA A + A + +I
Sbjct: 214 SDTKEIAAGKMIHLKFTRRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYI 273
Query: 280 QRQIGG--KTPGTKPFTSGAQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKA 337
++ G + PG K T R+ +I TP G++ + L +GE+I
Sbjct: 274 RKGDGQDYEDPGIKE------------------TEREVHI--TP-GIIYDDLRKGEDIGM 312
Query: 338 FTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDR 397
+D + F LRAVA + FS A ++ +YSA R E+ E ++D
Sbjct: 313 VKSDRPNPNLETFRNGQLRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDC 372
Query: 398 DADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLE 457
+ RP+YR+WL +V + P + E T T S G V +D
Sbjct: 373 FIGAVTRPVYRTWLNMVVAAGLLKIPADV-------EMKTLYNATYS---GPVMPWIDPV 422
Query: 458 KETRGRGEQIKLGALTRTKAAR 479
KE QI+ GA T + R
Sbjct: 423 KEAEAWRIQIRGGAATESDWVR 444
>ref|NP_297800.1| (NC_002488) conserved hypothetical protein [Xylella fastidiosa
9a5c]
pir||F82798 conserved hypothetical protein XF0510 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF83320.1|AE003899_12 (AE003899) conserved hypothetical protein [Xylella fastidiosa 9a5c]
Length = 520
Score = 48.9 bits (115), Expect = 1e-004
Identities = 83/404 (20%), Positives = 146/404 (35%), Gaps = 63/404 (15%)
Query: 116 LFEAWGELPEVCDFTKRYTFGQWQRVAKAEALIAGDVLVVE--------HHNSATELPTY 167
L+ AW + PEV T + + QR+ L G+ V E H S +
Sbjct: 128 LYHAWSKRPEV---TWMHDWPSVQRLLARTWLRDGEAFVQELRGFLPTLQHGSGVPF-SI 183
Query: 168 ELIDGDMVQTPYDAMNKIGKNHRIDNGVEMTKDGEHVAFWVVQEGFKHKRI-----KATG 222
E+++ D+V YD + I G++ G +++ + + K
Sbjct: 184 EMLEPDLVPLDYD-----DPSQHILQGIQRNAWGRATGYYIYTQHPGDPEVLLPEKKLVP 238
Query: 223 ANGRRIAWMYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQ 282
++ R + T R + RG LL I ++ ++ + A + A + I
Sbjct: 239 SHLIR----HIRTIDRIGQVRGISLLASTFTRIEDLKDYEESERIAAKIAACMAAVI--- 291
Query: 283 IGGKTPGTKP--FTSGAQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKAFTN 340
I G+ P + S QR ++ Q M+ +GL GE I +
Sbjct: 292 IKGEPSQYDPDKYPSTGQRMMRFQP-----------------AMIFDGLNPGERIDPIDS 334
Query: 341 DTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDAD 400
+ + + LRA+A + + FS LA ++ +YSA R E+ E +
Sbjct: 335 TRPNPNLESYRDGQLRAIAAPMRISFSSLAKNYNGTYSAQRQELVEQYGAYGVLAHEFIS 394
Query: 401 TLLRPIYRSWLRTMVLTRRIDAPGF--LAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEK 458
++RP+Y + V + + P LA A P W +D K
Sbjct: 395 QMVRPLYERLIAIAVTSGALTLPPNVPLANALGADYLPPSMPW------------IDPLK 442
Query: 459 ETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKENQLLAD 502
E Q++ G + + G + T QL+ + Q A+
Sbjct: 443 EINALTAQVQAGVRSLSSVIAERGGRMYDT-VEQLSIDKQWAAE 485
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 312,583,243
Number of Sequences: 919193
Number of extensions: 13040094
Number of successful extensions: 26286
Number of sequences better than 5.0e-02: 15
Number of HSP's better than 0.1 without gapping: 6
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 26266
Number of HSP's gapped (non-prelim): 15
length of query: 545
length of database: 288,098,425
effective HSP length: 125
effective length of query: 420
effective length of database: 173,199,300
effective search space: 72743706000
effective search space used: 72743706000
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 93 (40.4 bits)
Query= orf6 [1615-2011]
(395 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAB65791.1| (AF009900) Z2Z3 protein [Vibrio shilonii] 93 5e-018
dbj|BAA89626.1| (AB036665) similar to GPC of lambda [bacter... 64 4e-009
ref|NP_536361.1| (NC_003309) gp5 [Bacteriophage phiE125] >g... 57 3e-007
dbj|BAA89643.1| (AB036666) similar to minor capsid protein ... 56 7e-007
ref|NP_461541.1| (NC_003197) Gifsy-1 prophage: similar to h... 52 1e-005
pir||T03571 probable peptidase - Rhodobacter capsulatus >gi... 51 2e-005
gb|AAC24140.1| (AF071201) unknown [bacteriophage Felix 01] 49 1e-004
sp|P36273|VG05_BPP21 HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CO... 48 2e-004
ref|NP_287793.1| (NC_002655) putative capsid assembly prote... 47 3e-004
ref|NP_287588.1| (NC_002655) putative head-tail preconnecto... 47 3e-004
ref|NP_470963.1| (NC_003212) similar to proteases [Listeria... 43 0.005
ref|NP_540710.1| (NC_003317) PUTATIVE PROTEASE IV [Brucella... 43 0.007
ref|NP_110548.1| (NC_002689) Secreted serine protease (ClpP... 42 0.015
>gb|AAB65791.1| (AF009900) Z2Z3 protein [Vibrio shilonii]
Length = 330
Score = 92.8 bits (229), Expect = 5e-018
Identities = 67/202 (33%), Positives = 101/202 (49%), Gaps = 10/202 (4%)
Query: 55 AQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAA-INAAESDDSVKNIEYYIDSPGGEAEG 113
A I++ GV + + F + GG +T +V A NAA ++ SVK + +DSPGGEA G
Sbjct: 40 AIIHVNGVISRYANLFHAVCGGVST---EVLAKEFNAALNESSVKAVILNVDSPGGEASG 96
Query: 114 AIKLGDMIRGT--KKPXXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLS-MRRP 170
+L +MI + KKP V +R G+IG V+S ++RP
Sbjct: 97 IHELSEMIHASRGKKPVRAYVGGDGCSAAYWITTACDRVTMDATARVGSIGTVVSFVKRP 156
Query: 171 SQ--SSYIDVTSTDAPNKRPDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADVNQ 228
+ + S+ +PNKR DPE+E+G+ I+ Q LD + +F VA G T V
Sbjct: 157 DAEGAKRFEFVSSQSPNKRLDPESEQGQTAIQTQ-LDAMAEVFISRVARNMGVTSDKVKS 215
Query: 229 NFGRGGSLFAETALKMGMIDEI 250
+FG+GG + A+ GM E+
Sbjct: 216 DFGQGGVKIGQQAVDAGMAHEL 237
>dbj|BAA89626.1| (AB036665) similar to GPC of lambda [bacteriophage WO]
Length = 353
Score = 63.5 bits (153), Expect = 4e-009
Identities = 57/208 (27%), Positives = 87/208 (41%), Gaps = 17/208 (8%)
Query: 54 TAQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEG 113
TA I + G+ T P F G T Y + I A S+ ++ I IDSPGGE G
Sbjct: 47 TAVIAINGILTKKPGAFDDFLG--MTSYEKIQEEIEEALSNKDIETILLDIDSPGGEVNG 104
Query: 114 AIKLGDMI---RGTKKPXXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLS---M 167
L D I R K+ +K V SR S G+IG + S
Sbjct: 105 IFDLADFIYESRAKKRIIAIANDDAYSAAYAIASSAEK-VFVSRTSGVGSIGVIASHIDQ 163
Query: 168 RRPSQSSYIDVTSTDAPNKRPD-----PETEEGRRVIREQALDPLHNMFAQAVADGRGTT 222
R + I T+ A +++ D P T E ++++ + L+ MF Q +A RG +
Sbjct: 164 SRFDEKQGIKYTTIFAGSRKNDLNPHEPMTSESLESLQKE-VGRLYEMFLQLIARNRGLS 222
Query: 223 LADVNQNFGRGGSLFAETALKMGMIDEI 250
+ + G F E A+++G+ D +
Sbjct: 223 IEKIRST--EAGLYFGEKAVEIGLADGV 248
>ref|NP_536361.1| (NC_003309) gp5 [Bacteriophage phiE125]
gb|AAL40278.1|AF447491_5 (AF447491) gp5 [Bacteriophage phiE125]
Length = 301
Score = 57.0 bits (136), Expect = 3e-007
Identities = 51/184 (27%), Positives = 79/184 (42%), Gaps = 17/184 (9%)
Query: 79 TLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXXXX 136
T Y + +A+N A +D +V++I IDS GG A GA +L D IR KP
Sbjct: 80 TSYEGLRSAVNQAVADPAVEHIVLDIDSNGGSATGAFELADDIRAASLVKPITAIVNFSA 139
Query: 137 XXXXXXXXXXDKIVATSRVSRAGNIGAV---LSMRRPSQSSYIDVTSTDAPNKRPDPETE 193
V SR S G+IG + L + + + I VTS A + + D
Sbjct: 140 FSGGYLIAAAASKVIVSRTSGVGSIGVIANHLDVSKRDEQQGIKVTSVFAGDHKNDLTPH 199
Query: 194 EGRRVIREQALDPLHNM-------FAQAVADGRGTTLADVNQNFGRGGSLFAETALKMGM 246
E + +Q+L L +M F A+A+ RG + V + G F + ++ G+
Sbjct: 200 E---PLSDQSLTFLTSMVQNSYKQFVDAIANFRGLSTQAVKDT--QAGIFFGQKGVEAGL 254
Query: 247 IDEI 250
D +
Sbjct: 255 ADSV 258
>dbj|BAA89643.1| (AB036666) similar to minor capsid protein GPC of phage lambda
[Wolbachia sp. wKue]
Length = 350
Score = 55.8 bits (133), Expect = 7e-007
Identities = 54/208 (25%), Positives = 86/208 (40%), Gaps = 17/208 (8%)
Query: 54 TAQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEG 113
TA I + G+ T P+ F G T Y + I A D+ V+ I IDSPGGE G
Sbjct: 49 TAIIPIHGILTKKPEAFDDFLG--MTSYEKMQEEIEEALEDEKVEIILLDIDSPGGEVNG 106
Query: 114 AIKLGDMI---RGTKKPXXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLSMRRP 170
L D I R K+ +KI T R S G+IG + S
Sbjct: 107 IFDLADFIYNARSKKRIIAIANDDAYSAAYAIASSAEKIFLT-RTSGVGSIGVIASHIDQ 165
Query: 171 S---QSSYIDVTSTDAPNKRPD-----PETEEGRRVIREQALDPLHNMFAQAVADGRGTT 222
S + I ++ A +++ D P T E ++ + ++ L+ M + +A R +
Sbjct: 166 SGFDEKQGIKYSTIFAGSRKNDLNPHEPITSESLENLKSE-VNRLYGMLVELIARNRSLS 224
Query: 223 LADVNQNFGRGGSLFAETALKMGMIDEI 250
+ + G F E A+++G+ D +
Sbjct: 225 IEAIKST--EAGLYFGEKAVEIGLADGV 250
>ref|NP_461541.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp5
of phage 21 [Salmonella typhimurium LT2]
gb|AAL21500.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp5
of phage 21 [Salmonella typhimurium LT2]
Length = 498
Score = 51.6 bits (122), Expect = 1e-005
Identities = 51/183 (27%), Positives = 70/183 (37%), Gaps = 11/183 (6%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMI--RGTKKPXXXXXXX 134
G T Y V A + A SD VK I IDSPGG+A GA DMI +KP
Sbjct: 99 GMTGYDGVTARLQQAVSDPEVKGILLDIDSPGGQAAGAFDCADMIYRMREQKPVWALANE 158
Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVT----STDAPNKR 187
++ SR G+IG V+ S + ID+T +
Sbjct: 159 TACSAAMLLAAACSHRLVTQTSRMGSIGVVMAHTSYAEKLKQEGIDITLIYSGAHKADLT 218
Query: 188 PDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADV--NQNFGRGGSLFAETALKMG 245
P + E +Q +D MFA+ VA G ++ V + G T L G
Sbjct: 219 PSQKLPESVYADYQQRMDEARKMFAEKVARYTGLSVDAVMATEAAVYDGQAIITTGLADG 278
Query: 246 MID 248
M++
Sbjct: 279 MVN 281
>pir||T03571 probable peptidase - Rhodobacter capsulatus
gb|AAC16224.1| (AF010496) potential peptidase [Rhodobacter capsulatus]
Length = 303
Score = 50.8 bits (120), Expect = 2e-005
Identities = 54/190 (28%), Positives = 78/190 (40%), Gaps = 18/190 (9%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEA---EGAIKLGDMIRGTKKPXXXXXX 133
G Y + A +DD+V + D+PGG E A+ + K
Sbjct: 77 GWATYRGIAEACADLAADDAVSAVVVEFDTPGGYTLGIEDAVTALRDLAAVKPVHALAAP 136
Query: 134 XXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLSMRR---PSQSSYI---DVTSTDAPNKR 187
++V T R + G+IG ++ P +S + D TS A K
Sbjct: 137 LAASAGYWLASQARELVMTPRAA-VGSIGVAVTAAANVAPGAASGVQLFDFTSRHARAKW 195
Query: 188 PDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLAD------VNQNFGRGGSLF-AET 240
PDP TE+G+ IR + LD F AVA+GRG D V+ + GG++F A
Sbjct: 196 PDPATEDGKAEIR-RGLDRTEARFHAAVAEGRGIAPEDLAARLSVSDDPEDGGAVFEAAE 254
Query: 241 ALKMGMIDEI 250
AL G+ D I
Sbjct: 255 ALARGLADRI 264
>gb|AAC24140.1| (AF071201) unknown [bacteriophage Felix 01]
Length = 442
Score = 48.5 bits (114), Expect = 1e-004
Identities = 50/222 (22%), Positives = 90/222 (40%), Gaps = 13/222 (5%)
Query: 40 PTANADRVMVRMGDTAQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAAINAAESDDSVKN 99
P A ++ D A I + G T F + GG + Y + + A +D+S+K
Sbjct: 48 PKGEARSLLKVKDDIAIIPIMGGLTHRMTFIDAMCTGGLSSYEGLRRGFDEALADESIKT 107
Query: 100 IEYYIDSPGGEAEGAIKLGDMI---RGTKKPXXXXXXXXXXXXXXXXXXXDKIVATSRVS 156
I +IDS GGEA G +L I RG KK ++I+A S +
Sbjct: 108 IVLHIDSGGGEASGCFELARHIMASRGQKKIIAYVDEFACSAAYALASSAEEIIA-SPDA 166
Query: 157 RAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKR----PDPETEEGRRVIREQALDPLHN 209
G+IG ++ + + + + + + A + P E + ++ ++ +
Sbjct: 167 DVGSIGVIMVHQELTKAFEKNGVTINVIKAGEFKGMGSPFQALSEESKERLQKRINDTYA 226
Query: 210 MFAQAVADGRGTTLADVNQNFGRGGSLFAETALKMGMIDEIL 251
F VA+ R + V A+ AL++G+I+ I+
Sbjct: 227 TFTGFVAESRNLSEEAVKNT--EANVYSAQEALELGLINSIM 266
>sp|P36273|VG05_BPP21 HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CONTAINS: SCAFFOLD PROTEIN
(HEAD PROTEIN GP6)]
pir||JN0539 head protein gp5 - phage 21
gb|AAA32343.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
Length = 501
Score = 48.1 bits (113), Expect = 2e-004
Identities = 46/185 (24%), Positives = 77/185 (40%), Gaps = 13/185 (7%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXX 134
G T Y + A + A +D V+ + IDSPGG+A GA DMI + KP
Sbjct: 99 GMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCND 158
Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKRPDPE 191
++ SR G+IG ++ S + +D+T + + D
Sbjct: 159 TACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGN 218
Query: 192 TEEGR----RVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLF-AETALKMGM 246
E R +Q +D MFA+ VA G ++ V G ++F ++ ++ G+
Sbjct: 219 QFEALPAEVRQDMQQRIDAARRMFAEKVAMYTGLSVDAVT---GTEAAVFEGQSGIEAGL 275
Query: 247 IDEIL 251
DE++
Sbjct: 276 ADELI 280
>ref|NP_287793.1| (NC_002655) putative capsid assembly protein of prophage CP-933R
[Escherichia coli O157:H7 EDL933]
ref|NP_309135.1| (NC_002695) putative head-tail preconnector protein [Escherichia
coli O157:H7]
ref|NP_310759.1| (NC_002695) putative head-tail preconnector protein [Escherichia
coli O157:H7]
gb|AAG56407.1|AE005368_13 (AE005368) putative capsid assembly protein of prophage CP-933R
[Escherichia coli O157:H7 EDL933]
dbj|BAB34531.1| (AP002554) putative head-tail preconnector protein [Escherichia
coli O157:H7]
dbj|BAB36155.1| (AP002559) putative head-tail preconnector protein [Escherichia
coli O157:H7]
Length = 501
Score = 47.0 bits (110), Expect = 3e-004
Identities = 46/185 (24%), Positives = 76/185 (40%), Gaps = 13/185 (7%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXX 134
G T Y + A + A +D V+ + IDSPGG+A GA DMI + KP
Sbjct: 99 GMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCND 158
Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKRPDPE 191
++ SR G+IG ++ S + +D+T + + D
Sbjct: 159 TACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGN 218
Query: 192 TEEGR----RVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLF-AETALKMGM 246
E R +Q +D MFA+ VA G ++ V G ++F ++ + G+
Sbjct: 219 QFEALPAEVRQNMQQRIDAARRMFAEKVAMFTGLSVDAVT---GTEAAVFEGQSGIDAGL 275
Query: 247 IDEIL 251
DE++
Sbjct: 276 ADELV 280
>ref|NP_287588.1| (NC_002655) putative head-tail preconnector protein of prophage
CP-933O [Escherichia coli O157:H7 EDL933]
gb|AAG56200.1|AE005349_4 (AE005349) putative head-tail preconnector protein of prophage
CP-933O [Escherichia coli O157:H7 EDL933]
Length = 501
Score = 47.0 bits (110), Expect = 3e-004
Identities = 46/185 (24%), Positives = 76/185 (40%), Gaps = 13/185 (7%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXX 134
G T Y + A + A +D V+ + IDSPGG+A GA DMI + KP
Sbjct: 99 GMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCND 158
Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKRPDPE 191
++ SR G+IG ++ S + +D+T + + D
Sbjct: 159 TACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGN 218
Query: 192 TEEGR----RVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLF-AETALKMGM 246
E R +Q +D MFA+ VA G ++ V G ++F ++ + G+
Sbjct: 219 QFEALPAEVRQNMQQRIDAARRMFAEKVAMFTGLSVDAVT---GTEAAVFEGQSGIDAGL 275
Query: 247 IDEIL 251
DE++
Sbjct: 276 ADELV 280
>ref|NP_470963.1| (NC_003212) similar to proteases [Listeria innocua]
emb|CAC96858.1| (AL596169) similar to proteases [Listeria innocua]
Length = 337
Score = 43.1 bits (100), Expect = 0.005
Identities = 57/263 (21%), Positives = 96/263 (35%), Gaps = 33/263 (12%)
Query: 10 LRLMADAPNLTQAQIDQCMASSPLF-QGPGGPTANADRVMVRMG--DT-AQINMQGVFTS 65
L +++ T +QI SSP F + T ++ G DT A +++ G
Sbjct: 15 LLIVSALAKFTSSQIASTEESSPTFVESLFADTGELTETVIEEGGDDTIAVLSVDGTIQD 74
Query: 66 APDFFAMLFGGGNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK 125
D LFG + + +DD ++ + Y++SPGG + ++ D I +
Sbjct: 75 TGDS-GSLFGDAGYDHSFFMQQLEQVRNDDYIQGVLLYVNSPGGGVMESAQIRDKILQIQ 133
Query: 126 KP-----XXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVL--------------S 166
K DKI A S+ + G++G ++ S
Sbjct: 134 KERNIPFYVSMGSMAASGGYYISAPADKIFA-SKETLTGSLGVIMQGYDYSELMKKLGVS 192
Query: 167 MRRPSQSSYIDVTSTDAPNKRPDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADV 226
Y D+ S P E + I + +D +N F + VA GRG + A+
Sbjct: 193 DNTIKSGEYKDIMSGTRP-------MTEDEKKIMQSMIDDSYNEFVKVVAKGRGMS-AEK 244
Query: 227 NQNFGRGGSLFAETALKMGMIDE 249
+ G A + G+IDE
Sbjct: 245 VRKIADGRIYDGRQAKENGLIDE 267
>ref|NP_540710.1| (NC_003317) PUTATIVE PROTEASE IV [Brucella melitensis]
gb|AAL52974.1| (AE009612) PUTATIVE PROTEASE IV [Brucella melitensis]
Length = 301
Score = 42.7 bits (99), Expect = 0.007
Identities = 36/152 (23%), Positives = 59/152 (38%), Gaps = 15/152 (9%)
Query: 83 DVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIR--GTKKPXXXXXXXXXXXXX 140
++ ++ DD+VK + +DSPGG G + D +R TKKP
Sbjct: 39 ELLKRLDKIAGDDAVKGVILLLDSPGGTTVGGEAIYDAVRKIATKKPVVTQVGTLAASAG 98
Query: 141 XXXXXXDKIVATSRVSRAGNIGAVLSMRRPSQSSYIDV------TSTDAPNKRP----DP 190
+ + S G+IG + + P S +D T +P K P
Sbjct: 99 YMIASASDHIVARQTSIVGSIGVL--FQYPDLSKLLDTLGVKVETIKSSPLKAEPNYFSP 156
Query: 191 ETEEGRRVIREQALDPLHNMFAQAVADGRGTT 222
+EE + +IR +D ++ F V + R T
Sbjct: 157 ASEEAKNMIRSMIMDS-YDWFVGIVQERRAFT 187
>ref|NP_110548.1| (NC_002689) Secreted serine protease (ClpP class) [Thermoplasma
volcanium]
dbj|BAB59171.1| (AP000991) proteinase IV [Thermoplasma volcanium]
Length = 230
Score = 41.6 bits (96), Expect = 0.015
Identities = 41/182 (22%), Positives = 73/182 (39%), Gaps = 14/182 (7%)
Query: 80 LYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIR--GTKKPXXXXXXXXXX 137
LY ++ A+ AAE S+ + +S GG+A + + D+++ KKP
Sbjct: 16 LYKYLYPALKAAEGKKSIAGLILVFNSGGGDAASSQLIHDLVKKIRKKKPVYSLALGICA 75
Query: 138 XXXXXXXXXDKIVATSRVSRAGNIGAVLSMRRPSQSSYIDVTSTDA---------PNKRP 188
+ S G+IG V+S+ RP+ ++ D P
Sbjct: 76 SGAYWIASASTKIYAIDTSLIGSIG-VISI-RPNVKKLMEKIGVDVMVYKSGKYKDMTSP 133
Query: 189 DPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLFAETALKMGMID 248
E E + + ++ LD + F ++VA+ RG +++ G A+ A G+ID
Sbjct: 134 FSEPNEEEKSVYQRLLDDIFEKFKRSVAEDRGIPSEKIDE-IANGMVYSAKMAADNGLID 192
Query: 249 EI 250
I
Sbjct: 193 RI 194
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.315 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 183,463,006
Number of Sequences: 919193
Number of extensions: 6556375
Number of successful extensions: 16076
Number of sequences better than 5.0e-02: 17
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 16068
Number of HSP's gapped (non-prelim): 17
length of query: 395
length of database: 288,098,425
effective HSP length: 122
effective length of query: 273
effective length of database: 175,956,879
effective search space: 48036227967
effective search space used: 48036227967
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 92 (40.0 bits)
Query= orf7 [2015-2150]
(134 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_518980.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 37 0.026
>ref|NP_518980.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14561.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 125
Score = 37.4 bits (85), Expect = 0.026
Identities = 18/41 (43%), Positives = 27/41 (64%)
Query: 33 AAETWPAGAVLGRVTATGRYVRYNASASDGSQVPSAILTEA 73
A ++ G VLG VTATG+ + + SA+DGSQ + +L +A
Sbjct: 31 AGQSLVLGTVLGMVTATGKVKQLDPSATDGSQYAAGVLMQA 71
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.319 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 62,111,016
Number of Sequences: 919193
Number of extensions: 1994709
Number of successful extensions: 4326
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4325
Number of HSP's gapped (non-prelim): 1
length of query: 134
length of database: 288,098,425
effective HSP length: 110
effective length of query: 24
effective length of database: 186,987,195
effective search space: 4487692680
effective search space used: 4487692680
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf8 [2155-2520]
(364 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_056680.1| (NC_002214) gp348 [Streptococcus thermophi... 46 5e-004
pir||T13320 hypothetical protein 31 - Streptococcus phage p... 44 0.002
>ref|NP_056680.1| (NC_002214) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
pir||T13625 hypothetical protein gp348 - Streptococcus phage phi-Sfi11
gb|AAC34402.1| (AF158600) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
Length = 348
Score = 46.2 bits (108), Expect = 5e-004
Identities = 42/167 (25%), Positives = 68/167 (40%), Gaps = 16/167 (9%)
Query: 37 GKTVEIDVKRFAENVAVVVTKLSGPNFNDASLISTKEFEPPEYGEAFATDVDDLLQRLLG 96
G++V + F NV V D + + P + EA +D Q L
Sbjct: 51 GQSVALKAAAFDTNVTV----------RDRVSAEMHDEQMPFFKEAMLVKENDRQQLNL- 99
Query: 97 VNPYDDANIAYSSKLVGRLMDYFMEANDMIMRGIELQASQILQTGRLSLIDRAGEVAYEI 156
D N A + +V + + + + +E Q+L TG+++ V +I
Sbjct: 100 --VKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDG--VNKDI 155
Query: 157 DYSPKATHFPTTATAWSDDGADPLADLESLFGVIRADGKVNPDMIIM 203
DY K H + +W++ GA PLADLE R G +NP+ +M
Sbjct: 156 DYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELG-LNPERAVM 201
>pir||T13320 hypothetical protein 31 - Streptococcus phage phi-O1205
gb|AAC79546.1| (U88974) ORF31 [Streptococcus thermophilus temperate bacteriophage
O1205]
Length = 348
Score = 44.3 bits (103), Expect = 0.002
Identities = 25/74 (33%), Positives = 38/74 (50%), Gaps = 3/74 (4%)
Query: 130 IELQASQILQTGRLSLIDRAGEVAYEIDYSPKATHFPTTATAWSDDGADPLADLESLFGV 189
+E Q+L TG+++ V +IDY K H + +W++ GA PLADLE
Sbjct: 131 LEAMRMQVLATGKIAFTSDG--VNKDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIET 188
Query: 190 IRADGKVNPDMIIM 203
R G +NP+ +M
Sbjct: 189 ARELG-LNPERAVM 201
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 213,197,707
Number of Sequences: 919193
Number of extensions: 9048143
Number of successful extensions: 25213
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 25212
Number of HSP's gapped (non-prelim): 3
length of query: 364
length of database: 288,098,425
effective HSP length: 122
effective length of query: 242
effective length of database: 175,956,879
effective search space: 42581564718
effective search space used: 42581564718
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 91 (39.7 bits)
Query= orf9 [2536-2666]
(129 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.309 0.137 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 66,225,875
Number of Sequences: 919193
Number of extensions: 2139604
Number of successful extensions: 2464
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2464
Number of HSP's gapped (non-prelim): 0
length of query: 129
length of database: 288,098,425
effective HSP length: 105
effective length of query: 24
effective length of database: 191,583,160
effective search space: 4597995840
effective search space used: 4597995840
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf10 [2669-2798]
(128 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.314 0.133 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,501,668
Number of Sequences: 919193
Number of extensions: 2712836
Number of successful extensions: 4936
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4936
Number of HSP's gapped (non-prelim): 0
length of query: 128
length of database: 288,098,425
effective HSP length: 104
effective length of query: 24
effective length of database: 192,502,353
effective search space: 4620056472
effective search space used: 4620056472
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf11 [2867-3021]
(153 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.131 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 76,484,636
Number of Sequences: 919193
Number of extensions: 2666006
Number of successful extensions: 4797
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4797
Number of HSP's gapped (non-prelim): 0
length of query: 153
length of database: 288,098,425
effective HSP length: 110
effective length of query: 43
effective length of database: 186,987,195
effective search space: 8040449385
effective search space used: 8040449385
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf12 [3025-3526]
(500 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_050643.1| (NC_000929) major tail subunit [Enterobact... 74 3e-012
ref|NP_404841.1| (NC_003143) putative bacteriophage tail sh... 70 5e-011
ref|NP_599043.1| (NC_003444) tail sheath protein [Shigella ... 59 9e-008
ref|NP_543096.1| (NC_003356) putative sheath protein [Bacte... 59 1e-007
ref|NP_284552.1| (NC_003116) hypothetical protein [Neisseri... 52 1e-005
ref|NP_439661.1| (NC_000907) sheath protein gpL (muL) [Haem... 50 6e-005
ref|NP_274135.1| (NC_003112) phage sheath protein [Neisseri... 43 0.007
>ref|NP_050643.1| (NC_000929) major tail subunit [Enterobacteria phage Mu]
sp|P79678|VPL_BPMU Tail sheath protein (gpL)
dbj|BAA19195.1| (AB000833) sheath protein gpL [Enterobacteria phage Mu]
gb|AAF01117.1|AF083977_36 (AF083977) L [Enterobacteria phage Mu]
Length = 495
Score = 74.3 bits (181), Expect = 3e-012
Identities = 116/483 (24%), Positives = 183/483 (37%), Gaps = 29/483 (6%)
Query: 5 NAIDNSVRARVLGIKTEFRNFNTGRTFFLP-QHVALLGQ-GNTAATYALTPFRATSAAQV 62
NAI + VR + I EF N N P Q V + GQ G+ A+ P R S +Q
Sbjct: 7 NAIPSDVRVPLTYI--EFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQA 64
Query: 63 GQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPLNDNXXXXXXXXXXXXXXXXXXTATI 122
FG GS L L A L N R L P N ++
Sbjct: 65 SAAFGQGSMLALMADAFLNAN----RVAELWCIP-QGNGTGNAAVGEISLSGTAGENGSL 119
Query: 123 RVRIGSQRSSLITIPTGTTASQAAALIVAGIQGNTFMPVTSEVDATNDNE-----VNVAA 177
I QR + +++ G T + A L+VA I+G +PVT+EV A + ++ V ++A
Sbjct: 120 VTYIAGQRLA-VSVAAGATGAALADLLVARIKGQPDLPVTAEVRADSGDDDTHADVVLSA 178
Query: 178 KWKG-LSGNDL---VVSIEGAVAGLTVAITQXXXXXXXXXXXXXXXXFGENVWYTQIVSC 233
K+ G LS D+ + E G+ A G+ + Y IV
Sbjct: 179 KFTGALSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGD-LQYKYIVMP 237
Query: 234 FNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAIGDARRAQRTNLITPV 293
+ + LN T + RW P+ + T + T + ++ G +R + + +
Sbjct: 238 YT--DEPNLNLLRTELQERWGPVNQADGFAVTVLSGTYGD-ISTFGVSRNDHLISCMG-I 293
Query: 294 PGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWSHTQRDLLVKAGI 353
GA + DPA LTLPG P +++ ++R+ L+ GI
Sbjct: 294 AGAPEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGDRFTWSERNALLFDGI 353
Query: 354 STSIVRD-GVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLYNLDLIFNTREWDGA 410
ST V D G +I +T Y G+ +P YL S + Y+L T+++
Sbjct: 354 STFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRI-TQKFPNY 412
Query: 411 PLIPD-DQPTTNARAKRPKXXXXXXXXXXXXXXXXXIISDVPFTLENIRASINDQNPNRL 469
L D + T P ++ D E + + N + +RL
Sbjct: 413 KLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYVARNKDDKDRL 472
Query: 470 DII 472
D++
Sbjct: 473 DVL 475
>ref|NP_404841.1| (NC_003143) putative bacteriophage tail sheath protein [Yersinia
pestis]
emb|CAC90076.1| (AJ414147) putative bacteriophage tail sheath protein [Yersinia
pestis]
Length = 502
Score = 70.1 bits (170), Expect = 5e-011
Identities = 91/371 (24%), Positives = 141/371 (37%), Gaps = 33/371 (8%)
Query: 21 EFRNFNTGRTFFLPQHVALLGQGNTAATY-ALTPFRATSAAQVGQRFGFGSPLHLAALQL 79
EF N + T Q ++GQ +T A P +S A V G GS LH
Sbjct: 20 EFDN-SQANTASTTQRTLIIGQTRAGSTLPANIPLLVSSTATVAGLSGAGSMLHGQMAAY 78
Query: 80 LPDNNDGLRSIPLTIFPLNDNXXXXXXXXXXXXXXXXXXTATIRVRIGSQRSSLITIPTG 139
L ++ G + + PL+D T I + IG R + T
Sbjct: 79 LANDTAG----EIYLLPLSDADSMVAAIGKITINTAASETGVISLYIGGIRVQTTVVATD 134
Query: 140 TTASQAAALIVAGIQGNTFMPVT--SEVDATNDNEVNVA-AKWKGLSGN--DLVVSIEGA 194
+ ++ AAAL A I+ +PVT D ++ V V AK KG GN DL ++ G
Sbjct: 135 SVSTIAAAL-AAAIENQPELPVTVVHTGDMVSEGAVVVLEAKNKGAHGNNIDLRLNYLGN 193
Query: 195 VAG------LTVAITQXXXXXXXXXXXXXXXXFGENVWYTQIVSCFNTANTDALNAFETF 248
G L + IT + + I +TA+ DA+ F +
Sbjct: 194 AGGEITPESLVLMITPMAGGAGAPELADGLANLQDRTFDFIINPYTDTASLDAIKDFLSD 253
Query: 249 GEGRWN---PILRRPLAVFTGTNETNANTLAAIGDARRAQRTNLITPVPGAQNLPCEI-- 303
GRW+ + A +GT L A G+ R Q +L+ G N P
Sbjct: 254 STGRWSYRQQLYGHSFAAQSGTY----GQLTAAGELRNDQHASLL----GIHNSPTPAHI 305
Query: 304 -XXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWSHTQRDLLVKAGIST-SIVRDG 361
NDP L + G+ + +++ T+R+ L+ +GIST ++ D
Sbjct: 306 WSAAYVGAIAQSLRNDPGRPLQTLAIRGVLAPPLSSRFTLTERNNLLHSGISTVTVADDS 365
Query: 362 VAEISDTVTTY 372
++ + +TTY
Sbjct: 366 TVQVENIITTY 376
>ref|NP_599043.1| (NC_003444) tail sheath protein [Shigella flexneri bacteriophage V]
gb|AAL89420.1| (U82619) tail sheath protein [Shigella flexneri bacteriophage V]
Length = 498
Score = 59.3 bits (142), Expect = 9e-008
Identities = 65/297 (21%), Positives = 111/297 (36%), Gaps = 15/297 (5%)
Query: 119 TATIRVRIGSQRSSLITIPTGTTASQAAALIVAGIQGNTFMPVTSEVDATNDNEVNVAAK 178
+ T+ V +G R + G + A+ I I +P T+ A V + A+
Sbjct: 113 SGTVNVYVGRTRVQA-PVTNGDNVTTIASSIQDAINAVPTLPFTASSSA---GVVTLTAR 168
Query: 179 WKGLSGNDLVVSI--------EGAVAGLTVAITQXXXXXXXXXXXXXXXXFGENVWYTQI 230
KGL GN++ VS+ E AG+ +A+ + +
Sbjct: 169 HKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIG 228
Query: 231 VSCFNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAIGDARRAQRTNLI 290
+ +TA+ + L GRW+ + V+T T + + A GD Q L
Sbjct: 229 LPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNA-GDQFNQQHITLA 287
Query: 291 TPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWSHTQRDLLVK 350
Q E+ NDPA L G+ P ++++ T++ L+
Sbjct: 288 GYEKETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLS 347
Query: 351 AGISTSIVRDGVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLYNLDLIFNTR 405
G++T+ V GV I VTTY G + YL + ++ VL L + ++
Sbjct: 348 HGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSK 404
>ref|NP_543096.1| (NC_003356) putative sheath protein [Bacteriophage P27]
emb|CAC83562.1| (AJ298298) putative sheath protein [Bacteriophage P27]
Length = 498
Score = 58.9 bits (141), Expect = 1e-007
Identities = 79/373 (21%), Positives = 134/373 (35%), Gaps = 29/373 (7%)
Query: 119 TATIRVRIGSQRSSL-ITIPTGTTASQAAALIVAGIQGNTFMPVTSEVDATNDNEVNVAA 177
+ T+ + +G RSS+ + + G A+ A I + G +P + DA V + A
Sbjct: 113 SGTLSLYVG--RSSVQVPVVNGDDATAVATAIKEAVNGVITLPFAASSDA---GVVTLTA 167
Query: 178 KWKGLSGNDLVVSI--------EGAVAGLTVAITQXXXXXXXXXXXXXXXXFGENVWYTQ 229
+ KGL GN+L V + E AGL V G+ +
Sbjct: 168 RHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFDFI 227
Query: 230 IVSCFNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAIGDARRAQRTNL 289
+ + A+ + + GRW+ + V+T T + + A GD Q L
Sbjct: 228 GLPFNDAASINMMMTEMNDSSGRWSYARQLYGHVYTAKLGTLSELVNA-GDMHNQQHITL 286
Query: 290 ITPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWSHTQRDLLV 349
Q+ E+ NDPA L G+ P +++ T++ L+
Sbjct: 287 AGYEKETQSPVDELVASRLAREAVFIRNDPARPTQTGELVGMLPAPKGKRFIMTEQQTLL 346
Query: 350 KAGISTSIVRDGVAEISDTVTTY--HPTGEVNPGYLFYKDTVKASNVLYNLDLIFNTREW 407
G++T+ V G I +VTTY + G + YL + ++ VL L + ++
Sbjct: 347 SHGVATAYVEGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYG 406
Query: 408 ------DGAPLIPDDQPTTNARAKRPKXXXXXXXXXXXXXXXXXIISDVPFTLENIRASI 461
DG P T P I+ + + +
Sbjct: 407 RHKLANDGTRFGPGQAIVT------PAVIKGELLATYRQMERAGIVENYDLFKQYLIVER 460
Query: 462 NDQNPNRLDIIFP 474
+ NPNRL+ +FP
Sbjct: 461 DADNPNRLNTLFP 473
>ref|NP_284552.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
pir||E81810 hypothetical protein NMA1841 [imported] - Neisseria meningitidis
(group A strain Z2491)
emb|CAB71993.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
emb|CAB85065.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
Length = 475
Score = 52.4 bits (124), Expect = 1e-005
Identities = 48/186 (25%), Positives = 71/186 (37%), Gaps = 13/186 (6%)
Query: 19 KTEFRNFNTGRTFFLPQHVALLGQGNTAATYALTPFRAT-SAAQVGQRFGFGSPLHLAAL 77
K RN T + Q V ++ Q N A LT SAA ++G GS HL
Sbjct: 27 KLAVRNLPTNK-----QRVLIVAQHNNPALGELTELENVFSAADAAAKYGAGSMAHLMVT 81
Query: 78 QLLPDNNDGLRSIPLTIFPLNDNXXXXXXXXXXXXXXXXXXTATIRVRIGSQRSSLITIP 137
+ L++ + DN +RV I + + I I
Sbjct: 82 AAIK----AYAYADLSLITVADNKAGVAAGGKITLSGTANTQGVLRVSIANADTLTIGIG 137
Query: 138 TGTTASQAAALIVAGIQGNTFMPVTSEVDATNDNEVNVAAKWKGLSGNDLVVSIEGAVAG 197
TA+ AA + A I +PVT+ V + V + AK KG +GN + + G
Sbjct: 138 AEDTAATVAAAVKAAIDAVPDLPVTATV---AEAVVTLTAKNKGTAGNAIRIKTSNTAEG 194
Query: 198 LTVAIT 203
+T A+T
Sbjct: 195 ITAAVT 200
>ref|NP_439661.1| (NC_000907) sheath protein gpL (muL) [Haemophilus influenzae Rd]
sp|P44233|VPL_HAEIN Mu-like prophage FluMu tail sheath protein
pir||I64033 hypothetical protein HI1511 - Haemophilus influenzae (strain Rd
KW20)
gb|AAC23158.1| (U32827) sheath protein gpL (muL) [Haemophilus influenzae Rd]
Length = 487
Score = 50.1 bits (118), Expect = 6e-005
Identities = 74/364 (20%), Positives = 130/364 (35%), Gaps = 30/364 (8%)
Query: 45 TAATYALTPFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPLNDNXXXX 104
T + A R ++ AQ FG GS L N+ ++ L + PL++
Sbjct: 46 TGSAKAGEAVRVSAYAQAKTLFGRGSQLAEMVKTFKAHNS----TLDLWVLPLDEAPSGA 101
Query: 105 XXXXXXXXXXXXXXTATIRVRI-GSQRSSLITIPTGTTASQAAALIVAGIQGNTFMPVTS 163
T + I G+ + +T +G TA A + I + +PV +
Sbjct: 102 KATGSVQITGTATQAGTFSLMIAGNNYKTAVT--SGDTADVVAGKLQKLIAADQDVPVVA 159
Query: 164 EVDATNDNEVNVAAKWKGLSGNDLVV-----SIEGAVAGLTVAITQXXXXXXXXXXXXXX 218
V N + + ++KG +GN++ + S E GL IT
Sbjct: 160 TVAG---NTITLTCRFKGETGNEIDLRCNYYSGEAFPEGLKATITDMKNGAVNPDMSVAI 216
Query: 219 XXFGENVWYTQIVSCFNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAI 278
G W+ I++ F +T++LN RW P+ + F T+A
Sbjct: 217 TGLGAE-WWNYIINPFT--DTESLNLLRADLVKRWGPLKQIDGICFMAKRGTHAEVTTFA 273
Query: 279 GDARRAQRTNLITPVPGAQNLPCEIXXXXXXXXXXXXNN---DPASDFARLTLPGLTPGT 335
QR + + + P + DPA L + L P +
Sbjct: 274 -----EQRNDYLFSLLATHKAPQPAYLWASAYAAVVAGSLAIDPARPVQTLVMD-LLPPS 327
Query: 336 DAQQWSHTQRDLLVKAGISTSIVRDG-VAEISDTVTTYHPT--GEVNPGYLFYKDTVKAS 392
+ +W +R+ L+ +G+ST V G ++ +T Y G+ + YL+ + S
Sbjct: 328 MSDRWDLPERNTLLYSGVSTYTVNAGSQPQVEAAITMYRKNAFGDNDESYLYVETIATLS 387
Query: 393 NVLY 396
+ Y
Sbjct: 388 YLRY 391
>ref|NP_274135.1| (NC_003112) phage sheath protein [Neisseria meningitidis MC58]
pir||E81121 phage sheath protein NMB1104 [imported] - Neisseria meningitidis
(group B strain MD58)
gb|AAF41495.1| (AE002460) phage sheath protein [Neisseria meningitidis MC58]
Length = 469
Score = 43.1 bits (100), Expect = 0.007
Identities = 42/167 (25%), Positives = 63/167 (37%), Gaps = 10/167 (5%)
Query: 34 PQHVALLGQGNTAATY-ALTPFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPL 92
PQ V ++ TA AL P + S A+ FG GS HL Q +N + L
Sbjct: 35 PQKVLMVAPMLTAGIQPALEPVQLFSDAEAADLFGQGSLAHLMVRQAFANNP----YLDL 90
Query: 93 TIFPLNDNXXXXXXXXXXXXXXXXXXTATIRVRIGSQRSSLITIPTGTTASQAAALIVAG 152
T+ + D+ + + IG ++ S + TG TA+ A +
Sbjct: 91 TVIGIADHSAGVQATATVTLSGTATAPGVVEITIGGKQVST-AVNTGETAATVADRLKTA 149
Query: 153 IQGNTFMPVTSEVDATNDNEVNVAAKWKGLSGNDLVVSIEGAVAGLT 199
I S A V + AK KG GN+ +++ GLT
Sbjct: 150 ITAADVTVTASGSGAA----VTLTAKHKGEIGNESGLTVSTGNTGLT 192
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 248,250,562
Number of Sequences: 919193
Number of extensions: 9236449
Number of successful extensions: 18731
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 18723
Number of HSP's gapped (non-prelim): 7
length of query: 500
length of database: 288,098,425
effective HSP length: 124
effective length of query: 376
effective length of database: 174,118,493
effective search space: 65468553368
effective search space used: 65468553368
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 93 (40.4 bits)
Query= orf13 [3530-3658]
(127 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.310 0.128 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 67,676,599
Number of Sequences: 919193
Number of extensions: 2593983
Number of successful extensions: 7316
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7316
Number of HSP's gapped (non-prelim): 0
length of query: 127
length of database: 288,098,425
effective HSP length: 103
effective length of query: 24
effective length of database: 193,421,546
effective search space: 4642117104
effective search space used: 4642117104
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf14 [3684-3818]
(133 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 73,286,568
Number of Sequences: 919193
Number of extensions: 2912039
Number of successful extensions: 7739
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7739
Number of HSP's gapped (non-prelim): 0
length of query: 133
length of database: 288,098,425
effective HSP length: 109
effective length of query: 24
effective length of database: 187,906,388
effective search space: 4509753312
effective search space used: 4509753312
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf15 [3850-3885]
(34 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.326 0.144 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,641,315
Number of Sequences: 919193
Number of extensions: 281290
Number of successful extensions: 570
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 570
Number of HSP's gapped (non-prelim): 0
length of query: 34
length of database: 288,098,425
effective HSP length: 10
effective length of query: 24
effective length of database: 278,906,495
effective search space: 6693755880
effective search space used: 6693755880
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf16 [3923-4505]
(581 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_058453.1| (NC_002321) orf 15 [Staphylococcus aureus ... 172 9e-042
ref|NP_061640.1| (NC_002486) phi PVL ORF 15 and 16 homologu... 172 9e-042
gb|AAL82371.1| (AF424783) tail length tape measure protein ... 171 2e-041
ref|NP_310668.1| (NC_002695) putative tail protein [Escheri... 163 4e-039
ref|NP_288365.1| (NC_002655) putative tail fiber component ... 161 2e-038
pir||T13525 hypothetical protein 36 - Bacillus phage phi-10... 153 4e-036
ref|NP_108725.1| (NC_002703) unknown [Bacteriophage Tuc2009... 153 6e-036
ref|NP_112708.1| (NC_002747) TMP [Lactococcus lactis bacter... 152 1e-035
ref|NP_075512.1| (NC_002661) orf1374 [Staphylococcus aureus... 145 2e-033
ref|NP_490625.1| (NC_003278) orf25; similar to T gene of P2... 132 1e-029
ref|NP_046782.1| (NC_001895) gpT [Enterobacteria phage P2] ... 120 4e-026
ref|NP_052273.1| (NC_001317) G protein [Enterobacteria phag... 112 8e-024
ref|NP_471714.1| (NC_003212) similar to hypothetical protei... 111 2e-023
gb|AAK85310.1| (AY043263) phage-related tail protein [Wolba... 48 3e-004
>ref|NP_058453.1| (NC_002321) orf 15 [Staphylococcus aureus bacteriophage PVL]
pir||T00148 hypothetical protein 15 - Staphylococcus aureus phage phi PVL
dbj|BAA31888.1| (AB009866) orf 15 [Staphylococcus aureus bacteriophage PVL]
Length = 694
Score = 172 bits (436), Expect = 9e-042
Identities = 115/384 (29%), Positives = 191/384 (48%), Gaps = 25/384 (6%)
Query: 37 SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
++ GIEF+ + + G + ++ L A+++G + ++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234
Query: 97 GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
G+D++Q + L ++ LA A+G +L +DI TD L AFGL KDS LA DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHLA-------DVL 287
Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
A+TS+ ANT + + E + AP A A G +IE S IG+M+N GIK E AGTA++ F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347
Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
L++P + RLGI + DS G M ++ L + + Q+ + +FG
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407
Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
E ++G L +INA + + ++ S+ GA+ RMA + + G LRTLRS +E +A+
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467
Query: 337 FELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 395
++ +++ +V + V ++ + +I V G L V L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVVGFGLFVAVLGPLVF 515
Query: 396 AFNTIVSTITGAITLLN-LVIAAN 418
F +S + A+T+L L+I N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_061640.1| (NC_002486) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
prophage phiPV83]
dbj|BAA97857.1| (AB044554) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
prophage phiPV83]
Length = 1313
Score = 172 bits (436), Expect = 9e-042
Identities = 115/384 (29%), Positives = 191/384 (48%), Gaps = 25/384 (6%)
Query: 37 SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
++ GIEF+ + + G + ++ L A+++G + ++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234
Query: 97 GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
G+D++Q + L ++ LA A+G +L +DI TD L AFGL KDS A DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHFA-------DVL 287
Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
A+TS+ ANT + + E + AP A A G +IE S IG+M+N GIK E AGTA++ F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347
Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
L++P + RLGI + DS G M ++ L + + Q+ + +FG
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407
Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
E ++G L +INA + + ++ S+ GA+ RMA + + G LRTLRS +E +A+
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467
Query: 337 FELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 395
++ +++ +V + V ++ + +I V G L A V L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVIGFGLFAAVLGPLVF 515
Query: 396 AFNTIVSTITGAITLLN-LVIAAN 418
F +S + A+T+L L+I N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>gb|AAL82371.1| (AF424783) tail length tape measure protein [Staphylococcus aureus
phage phi 13]
Length = 1550
Score = 171 bits (434), Expect = 2e-041
Identities = 114/384 (29%), Positives = 191/384 (49%), Gaps = 25/384 (6%)
Query: 37 SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
++ GIEF+ + + G + ++ L A+++G + ++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234
Query: 97 GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
G+D++Q + L ++ LA A+G +L +DI TD L AFGL KDS A D+L
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELEAVSDIVTDGLTAFGLKAKDSGHFA-------DIL 287
Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
A+TS+ ANT + + E + AP A A G +IE S IG+M+N GIK E AGTA++ F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347
Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
L++P + RLGI + DS G M ++ L + + Q+ + +FG
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407
Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
E ++G L +INA + + ++ S+ GA+ RMA + + G LRTLRS +E +A+
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467
Query: 337 FELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 395
++ +++ +V + V ++ + +I V G L A V L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVVGFGLFAAVLGPLVF 515
Query: 396 AFNTIVSTITGAITLLN-LVIAAN 418
F +S + A+T+L L+I N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_310668.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
dbj|BAB36064.1| (AP002559) putative tail protein [Escherichia coli O157:H7]
Length = 696
Score = 163 bits (413), Expect = 4e-039
Identities = 122/399 (30%), Positives = 182/399 (45%), Gaps = 47/399 (11%)
Query: 89 GIEFLAMAGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 148
G FLAMAGF Q AALP ++++A A G++L DI ++ L F L A
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299
Query: 149 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIA 208
+ RV D L T NT + + ET++ P A G S+E +AM G++ANNG++ A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359
Query: 209 GTAVQNFFLRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLA 268
GTA++ RLA+P A L+ LG+ VAD+ G MR DV+ DL A + G+ +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419
Query: 269 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 327
+ + G E G V AG L + L A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEEAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479
Query: 328 TVESVAIRFFELSGGSMRDVVDQATAWLR-----------ANRELIAQNVA--------- 367
E + IR +L G +R V T L R+L+ A
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539
Query: 368 ----------GFLNTIVENIDSIVRGIKLIATVFATLWAFNTIVSTITGAITLLNLVIAA 417
G L T+ D + R + ++ V LW +TG+++LL VI A
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVV-RVLPALWGM------VTGSVSLLGGVIGA 592
Query: 418 NPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGI 456
+ +W+PI++FF ++ GI
Sbjct: 593 --LFSPVGLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>ref|NP_288365.1| (NC_002655) putative tail fiber component of prophage CP-933T
[Escherichia coli O157:H7 EDL933]
gb|AAG56919.1|AE005413_13 (AE005413) putative tail fiber component of prophage CP-933T
[Escherichia coli O157:H7 EDL933]
Length = 696
Score = 161 bits (407), Expect = 2e-038
Identities = 121/399 (30%), Positives = 181/399 (45%), Gaps = 47/399 (11%)
Query: 89 GIEFLAMAGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 148
G FLAMAGF Q AALP ++++A A G++L DI ++ L F L A
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299
Query: 149 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIA 208
+ RV D L T NT + + ET++ P A G S+E +AM G++ANNG++ A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359
Query: 209 GTAVQNFFLRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLA 268
GTA++ RLA+P A L+ LG+ VAD+ G MR DV+ DL A + G+ +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419
Query: 269 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 327
+ + G G V AG L + L A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEXAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479
Query: 328 TVESVAIRFFELSGGSMRDVVDQATAWLR-----------ANRELIAQNVA--------- 367
E + IR +L G +R V T L R+L+ A
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539
Query: 368 ----------GFLNTIVENIDSIVRGIKLIATVFATLWAFNTIVSTITGAITLLNLVIAA 417
G L T+ D + R + ++ V LW +TG+++LL VI A
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVV-RVLPALWGM------VTGSVSLLGGVIGA 592
Query: 418 NPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGI 456
+ +W+PI++FF ++ GI
Sbjct: 593 --LFSPVGLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>pir||T13525 hypothetical protein 36 - Bacillus phage phi-105
dbj|BAA36642.1| (AB016282) ORF36 [bacteriophage phi-105]
Length = 1018
Score = 153 bits (387), Expect = 4e-036
Identities = 105/373 (28%), Positives = 179/373 (47%), Gaps = 16/373 (4%)
Query: 38 RAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAG 97
+ GI+FE +++ G + + K L + AK++G FLAMAG
Sbjct: 173 QVGIDFEKQMSKVQAISGGSAAEIAK----LREQAKELGATTVFTASQAADAQGFLAMAG 228
Query: 98 FDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLA 157
F A+P ++SLA A ++L A DI ++ + AF L K+S +DV+A
Sbjct: 229 FKVNDIYDAMPGMLSLAAAGQLELGMAADITSNIMSAFALKAKES-------GHASDVIA 281
Query: 158 KTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFL 217
+ +ANT ++ M E ++ AP A + G +E A I + G++ IAG A +
Sbjct: 282 YAAANANTNVEQMGEAMKFLAPNANSLGWGMEESPAAIMAFGDAGLQGSIAGQAFGTSLI 341
Query: 218 RLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGAE 277
RLA+P G+A K++++LG D+AGNM+ +VV ++ L M + Q+ A ++ + GAE
Sbjct: 342 RLASPTGKASKLVKKLGFNFFDAAGNMKSMPEVVEEMEKGLKGMTKEQQAAALKTIVGAE 401
Query: 278 GLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFF 337
++ G AL + L ++GAA +MA + D+ GS+ +S +E I+
Sbjct: 402 AYKHWAVLLQKGSKALGDNTKALEKSDGAAKKMADTMLDNAHGSIVAFQSALEGAKIKLT 461
Query: 338 ELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATLWAF 397
E ++ D+ ++ + + L + V T V + G+ + T ATL A
Sbjct: 462 ESLLPALGDLANKGSDLIMMFNNLDSGTVQTIAKTAV-----LATGVLGVTTAVATLTAG 516
Query: 398 NTIVSTITGAITL 410
+ TG + L
Sbjct: 517 IGALLAFTGPVGL 529
>ref|NP_108725.1| (NC_002703) unknown [Bacteriophage Tuc2009]
gb|AAK19873.1|AF109874_46 (AF109874) unknown [Bacteriophage Tuc2009]
Length = 1025
Score = 153 bits (386), Expect = 6e-036
Identities = 114/441 (25%), Positives = 203/441 (45%), Gaps = 23/441 (5%)
Query: 37 SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
++ G +FE ++R G + ++++L A +G G+E LA A
Sbjct: 69 AKIGGDFESQMSRVKAISGATGS----SFEELRQQAIDLGAKTAFSAKESASGMENLASA 124
Query: 97 GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
GF+A++ + A+P ++ LA +G D+ A++ A +L F L S +A +V
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVGLASENAATALRGFNLDASQSGHVA-------NVF 177
Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
AK + + N + M E ++ AP A + G SIE VSA IG+M++ GIK AGT+++
Sbjct: 178 AKAAANTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237
Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
RLA P + + LG+ DS G M+ D +G L A + Q+ + ++G
Sbjct: 238 SRLAKPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297
Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
E L+G + +I+ G D L + +L +++GAAD+MAK + D+M SL + +ES AI
Sbjct: 298 ESLSGMIALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGALESAAIVV 357
Query: 337 FELSGGSMRDVVDQATAWLR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 390
++ ++R V D + + + Q + + IV I ++ + + ++ V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVLVLQRV 417
Query: 391 FATLWAFNTIVSTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFV 450
A + ++ I G+ T ++L + Y +W+ I F
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471
Query: 451 NLWDGIGIAFDTFAMNLANKW 471
+W + A + KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_112708.1| (NC_002747) TMP [Lactococcus lactis bacteriophage TP901-1]
gb|AAG32164.1|AF252967_1 (AF252967) tape measure protein [Lactococcus lactis bacteriophage
TP901-1]
gb|AAK38062.1| (AF304433) TMP [Lactococcus lactis bacteriophage TP901-1]
Length = 937
Score = 152 bits (384), Expect = 1e-035
Identities = 115/441 (26%), Positives = 201/441 (45%), Gaps = 23/441 (5%)
Query: 37 SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
++ G +FE ++R G + ++++L A +G G+E LA A
Sbjct: 69 AKIGGDFESQMSRVKAISGATGS----SFEELRQQAIDLGAKTAFSAKESASGMENLASA 124
Query: 97 GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
GF+A++ + A+P ++ LA +G D+A A++ A +L F L S +A +V
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVALASENAATALRGFNLDASQSGHVA-------NVF 177
Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
AK + N + M E ++ AP A + G SIE VSA IG+M++ GIK AGT+++
Sbjct: 178 AKAAADTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237
Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
RLA P + + LG+ DS G M+ D +G L A + Q+ + ++G
Sbjct: 238 SRLADPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297
Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
E L+G + +I+ G D L + +L +++GAAD+MAK + D+M SL + ES AI
Sbjct: 298 ESLSGMMALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGAFESAAIVV 357
Query: 337 FELSGGSMRDVVDQATAWLR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 390
++ ++R V D + + + Q + + IV I ++ + + + V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVVTLQRV 417
Query: 391 FATLWAFNTIVSTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFV 450
A + ++ I G+ T ++L + Y +W+ I F
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471
Query: 451 NLWDGIGIAFDTFAMNLANKW 471
+W + A + KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_075512.1| (NC_002661) orf1374 [Staphylococcus aureus temperate phage phiSLT]
dbj|BAB21743.1| (AB045978) orf1374 [Staphylococcus aureus temperate phage phiSLT]
Length = 1374
Score = 145 bits (365), Expect = 2e-033
Identities = 89/317 (28%), Positives = 160/317 (50%), Gaps = 18/317 (5%)
Query: 53 KMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFDAQQSVAALPKIVS 112
++G +K + + + A +G G+E LA GF+A+Q++ A+P ++S
Sbjct: 345 RVGAIAQASSKDLKSMSNQAVDLGAKTSKSANEVAKGMEELAALGFNAKQTMEAMPGVIS 404
Query: 113 LATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFE 172
A A+G ++A + ++ +FGL D+ +A D+LA+++ + I M +
Sbjct: 405 AAEASGAEMATTATVMASAINSFGLKASDANHVA-------DLLARSANDSAADIQYMGD 457
Query: 173 TIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRLAAPAGEARKILRR 232
++ A A A G SIE SA I V++N+G++ AGTA++ F+RLA P+ K +++
Sbjct: 458 ALKYAGTPAKALGVSIEDTSAAIEVLSNSGLEGSQAGTALRASFIRLANPSKNTAKEMKK 517
Query: 233 LGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGAEGLAGNLGVINAGKDA 292
LGI ++D+ G +++ + M Q+LA + + G E +G L +I AG D
Sbjct: 518 LGIHLSDAKGQFVGMGELIRQFQDNMKGMTREQKLATVATIVGTEAASGFLALIEAGPDK 577
Query: 293 LVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFELSGGSMRDVVDQAT 352
+ Y +L ++ G + + A + D++ G+L L ES+AI +V T
Sbjct: 578 INSYSKSLKNSNGESKKAADLMKDNLKGALEQLGGAFESLAI-----------EVGKDLT 626
Query: 353 AWLRANRELIAQNVAGF 369
+RA E + + V GF
Sbjct: 627 PMIRAGAEGLTKLVDGF 643
>ref|NP_490625.1| (NC_003278) orf25; similar to T gene of P2 [bacteriophage phi CTX]
dbj|BAA36253.1| (AB008550) orf25; similar to T gene of P2 [bacteriophage phi CTX]
Length = 904
Score = 132 bits (332), Expect = 1e-029
Identities = 107/457 (23%), Positives = 205/457 (44%), Gaps = 54/457 (11%)
Query: 40 GIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFD 99
G EF+ ++ AT + K Q L A+ + G FL G++
Sbjct: 212 GYEFDATMS-ATQAVTRIERKDDPQMQALRQQARTLPLSSKFTDKEVAQGQYFLGRTGYN 270
Query: 100 AQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 159
A+Q + A+P +++LA A +DL DIA++ A G+ + Q+A DVL
Sbjct: 271 AKQILGAMPGMLNLAAAGDMDLGDTADIASNIQTAMGIPAEKMDQVA-------DVLTAA 323
Query: 160 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRL 219
T N I M+ ++++ +A GQS+ETV+ ++ N G++ +AGT++++ RL
Sbjct: 324 FTRNNVDIRMLGDSLKYSAGVGREYGQSLETVTTATALLGNAGVQGSMAGTSMRSVLTRL 383
Query: 220 AAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGAEGL 279
K + +LG++ D+ GNMRD D++ D+N + MG QR A+ + + G +
Sbjct: 384 G-----TSKAVAKLGVQTKDANGNMRDMLDILKDINKKTAGMGNVQRGAIFKDIAGQYAV 438
Query: 280 AGNLGVINAGKDALVE-YRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFE 338
++ A + + R +L ++EG A R+A D++ G + L + +E++++ F+
Sbjct: 439 TSFGTLMRAVEGGQFQTMRESLNNSEGEAARVAATQLDNLKGDMTMLHAALENISVELFD 498
Query: 339 LSGGSMRDVVDQAT-------AWLRANREL---IAQNVAGFLNTIVENIDSIVRGIK-LI 387
+ +R++ + +L+AN ++ I VA F + ++ + S+ + ++
Sbjct: 499 KNSPWLRELAADLSHLLHNIGEFLKANPQVSKGIVITVAAF-SALMATVGSLAITLAGIL 557
Query: 388 ATVFATLWAFNTIVSTITGAITLLNLV----------------------------IAANP 419
+ A + TI + G I LL L+ +AANP
Sbjct: 558 GPMIAVRFMLGTIGIRLPGLIGLLKLLFAPIRMLAGLLIGPLVTALRVVSIALWGLAANP 617
Query: 420 XXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGI 456
Y +W+ +K++ + +W+ I
Sbjct: 618 VVLAIAAVVAVLAGAAYLIYRNWDAVKAYLLGMWEEI 654
>ref|NP_046782.1| (NC_001895) gpT [Enterobacteria phage P2]
gb|AAD03293.1| (AF063097) gpT [Enterobacteria phage P2]
Length = 815
Score = 120 bits (301), Expect = 4e-026
Identities = 108/472 (22%), Positives = 186/472 (38%), Gaps = 46/472 (9%)
Query: 40 GIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFD 99
G EF + +G V K + L A+Q+G +A AG D
Sbjct: 212 GYEFAQKNSELQAVIG--VAKDSAEMAALRKQARQLGDNTAASADDAAGAQIIIAKAGGD 269
Query: 100 AQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 159
AA P +++A A + + AF L +A + DVL+ T
Sbjct: 270 VDAIQAATPVTLNMALANRRTMEENAALLMGMKSAFQLSND-------KVAHIGDVLSMT 322
Query: 160 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRL 219
D M + + AAP A AG SIE +AM+G + + I +AGT + RL
Sbjct: 323 MNKTAADFDGMSDALTYAAPVAKNAGVSIEETAAMVGALHDAKITGSMAGTGSRAVLSRL 382
Query: 220 AAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVAL--SQMGERQRLAVIQKVFGAE 277
AP G+A L+ LG++ +DS GN R F ++ ++ + +++G Q+ ++ +FG E
Sbjct: 383 QAPTGKAWDALKELGVKTSDSKGNTRPIFTILKEMQASFEKNRLGTAQQAEYMKTIFGEE 442
Query: 278 GLAGNLGVINAGKDALVEYRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
+ ++ A ++ A +G + + D++ G + +S E+V
Sbjct: 443 ASSAAAVLMTAASTGKLDKLTAAFKASDGKTAELVNIMQDNLGGDFKAFQSAYEAVGTDL 502
Query: 337 FELSGGSMRDVVDQAT-------AWLRANRELI---------AQNVAGFLNTIVENIDSI 380
F+ G++R + AT W++ N+ L A + G + I +
Sbjct: 503 FDQQEGALRKLTQTATKYVLKLDGWIQKNKSLASTIGIIAGGALALTGIIGAIGLVAWPV 562
Query: 381 VRGIKLIATVFATLWA-FNTIVSTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXY 439
+ GI I + A F T+ S + AI ++ + A
Sbjct: 563 ITGINAIIAAAGAMGAVFTTVGSAVMTAIGAISWPVVA---------VVAAIVAGALLIR 613
Query: 440 MHWEPIKSFFVNLWDGIGIAFDTFAMNLANKWEALIAPIRSSIQWLLEQADA 491
+WEP+ +FF + +G+ AF L P++ WL E+ A
Sbjct: 614 KYWEPVSAFFGGVVEGLKAAFAPVG--------ELFTPLKPVFDWLGEKLQA 657
>ref|NP_052273.1| (NC_001317) G protein [Enterobacteria phage 186]
gb|AAC34170.1| (U32222) G protein [Enterobacteria phage 186]
Length = 812
Score = 112 bits (281), Expect = 8e-024
Identities = 105/449 (23%), Positives = 182/449 (40%), Gaps = 37/449 (8%)
Query: 58 VTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFDAQQSVAALPKIVSLATAA 117
V K + Q L A+ VG +A +G DA A +++A +
Sbjct: 227 VDKQSPEMQALRKQARHVGDNTAASADDAASAQIIIAKSGGDAAAIQARRRVTLNMALSN 286
Query: 118 GIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFETIRKA 177
+ + T AF L +A + DVL+ T D + + + A
Sbjct: 287 RRTMEENAALLTGMKSAFQLSND-------KIAHIGDVLSMTMNKTAADFDGLSDALTYA 339
Query: 178 APTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRLAAPAGEARKILRRLGIEV 237
AP A AG SIE +AM+G + + I +AGT + RL P +A + ++ LG++
Sbjct: 340 APVAKNAGVSIEQTAAMVGALHDAKITGSMAGTGSRAILSRLQLPPEKAFEAIKELGVKT 399
Query: 238 ADSAGNMRDAFDVVGDLNVAL--SQMGERQRLAVIQKVFGAEGLAGNLGVINAGKDALVE 295
+DS GN R F ++ ++ + + +G QR ++ +FG E + ++ A ++
Sbjct: 400 SDSKGNTRPIFSILKEMQRSFEKNNLGTSQRGEYMKTIFGEEASSAAAVLMEAASSGKLD 459
Query: 296 YRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFELSGGSMRDVVDQAT-- 352
A +G + + K + D++ G + +S E+V F+ GS+R + AT
Sbjct: 460 RLTAAFKASDGKTEELVKVMQDNLGGDFKEFQSAYEAVGTDLFDQQEGSLRKLTQTATQY 519
Query: 353 -----AWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATLWAFNTIVS----- 402
W++ N+ L G + + I+ GI L+A + + N I++
Sbjct: 520 VLKLDGWIQKNKGL--ATTIGIIAGGALALIGIIGGIGLVA--WPVVMGINAIIAAAGVL 575
Query: 403 ----TITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGIGI 458
T+TG+ +V A +WEPI +FF + +GI
Sbjct: 576 GTVFTVTGSA----IVTALGAITWPIVAVGAAIVAGALLIRKYWEPISAFFSGVIEGIMS 631
Query: 459 AFDTFA---MNLANKWEALIAPIRSSIQW 484
AF A LA ++ L +R QW
Sbjct: 632 AFAPVAEMFAPLAPIFDGLGEKLRGVWQW 660
>ref|NP_471714.1| (NC_003212) similar to hypothetical protein [Lactobacillus casei
bacteriophage A2] [Listeria innocua]
emb|CAC97610.1| (AL596172) similar to hypothetical protein [Lactobacillus casei
bacteriophage A2] [Listeria innocua]
Length = 1601
Score = 111 bits (278), Expect = 2e-023
Identities = 74/340 (21%), Positives = 154/340 (44%), Gaps = 25/340 (7%)
Query: 37 SRAGIEFEHVI---NRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFL 93
++A ++F++ + N + G++ + Y + D A++ G++ +
Sbjct: 100 TKAYLDFDNEVTEVNSLLRESGESAKEFGDRYTQVFDYAQKASVKYGVASEQTMLGMKEM 159
Query: 94 AMAGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVN 153
G+D Q++A++P I + A A+G D + T +L FG+++KD+ + +V
Sbjct: 160 VKKGYDINQTMASMPAIFNAARASGDDFETVMSVTTSTLEQFGMISKDTNKQMEYTNKVA 219
Query: 154 DVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQ 213
DVL + M + P + + G S+ +A +G+++N GI+ + AGT ++
Sbjct: 220 DVLTYVADKTAAGFSDMGTAMNYVGPISHSLGYSLTDTAAAVGLLSNRGIEGQKAGTGLR 279
Query: 214 NFFLRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKV 273
L P+ A + + +G+ + D+ GNM+ ++ D+N +M + Q+ + + +
Sbjct: 280 GMLTSLLKPSKSAAEAMSAVGLTIEDNNGNMKTLPTLLDDINDKTKKMTKTQKNSFLTMI 339
Query: 274 FGAEGLAGNLGVINAGKDALVEYRNTLLSAEG----AADRMAK----------------- 312
FG E L+ ++ AG D+L +Y A G AD M K
Sbjct: 340 FGREPLSAVNTLLEAGGDSLRKYSKGADEANGYTKQVADNMRKAGKFGVDQFKASLEVLE 399
Query: 313 -RIGDDMLGSLRTLRSTVESVAIRFFELSGGSMRDVVDQA 351
+G ++ +L + + +F +LSG ++++ A
Sbjct: 400 QNVGQKLMPALTPIIEWANKMIDKFNDLSGEQQQNIIKWA 439
>gb|AAK85310.1| (AY043263) phage-related tail protein [Wolbachia endosymbiont of
Drosophila melanogaster]
Length = 680
Score = 48.1 bits (113), Expect = 3e-004
Identities = 83/476 (17%), Positives = 170/476 (35%), Gaps = 62/476 (13%)
Query: 38 RAGIEFEHVINRAT--VKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAM 95
+ I+FE + T V + TK + L ++++ + G
Sbjct: 110 KVAIDFESAMADVTKVVDFKKGTDEATKFAKKLKEMSRTIPLSAAELAQIAASG----GQ 165
Query: 96 AGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDV 155
G + + ++TA + +A D +G+ + V +V
Sbjct: 166 LGIKKEDLFMFTETVAKMSTAFDMSAEQAGDSIAKLSNVYGIDVS-------KMEYVGNV 218
Query: 156 LAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNF 215
+ S + M E + TA G I+ S+++ + G + A TA+
Sbjct: 219 INHLSDNTAAKAKDMVEALAIVGGTAKQFGLDIKETSSLVNAFVSLGKQPAKAATAINAL 278
Query: 216 FLRLAAP---AGEARKILRRLGIEVADSAGNMRD-AFDVVGDLNVALSQMGERQRLAVIQ 271
+L G+ + L ++GI + + + + + AL +M ++R ++
Sbjct: 279 LSKLQTAEEQGGDFKAALEQMGITAEEIVQRISENGEEALLYFFQALKKMDNQERSTILM 338
Query: 272 KVFGAE---GLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRST 328
K+FG E +A G N +DA+ +T R+ D LR LR+
Sbjct: 339 KLFGQEYQDDIALLAGSFNKYEDAIRLLSDTEEYKSSLQKEFQNRV-DTTASKLRLLRNA 397
Query: 329 VESVAIRFFELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIA 388
+ V + + +++ + + R+ L A+ I+ + +++ +K++A
Sbjct: 398 IAEVGMNLGSVMLPTLKSIAEFLQEKTRSIA-LFAEKYPTLTKAIMGTVAALI-SLKVVA 455
Query: 389 ------------TVFAT----LWAFNTIVSTITGAI-TLLNLVIAANPXXXXXXXXXXXX 431
T+F+ L F+ + +T+ A+ T L V ANP
Sbjct: 456 VGLGYGFTLLGSTIFSLKANLLGVFSFLSATVFPAVVTGLRAVTLANPIGLLITGLVTGA 515
Query: 432 XXXXXXXYMHWEPIKSFFVNLWDGIGIAFDTFAMNLANKWEALIAPIRSSIQWLLE 487
+W+ +K FF ++ W+++I PI + W+ E
Sbjct: 516 ALVIT----NWQKVKDFF------------------SSFWKSIIKPIEEAFSWIGE 549
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.319 0.133 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 232,838,276
Number of Sequences: 919193
Number of extensions: 7656501
Number of successful extensions: 21616
Number of sequences better than 5.0e-02: 29
Number of HSP's better than 0.1 without gapping: 19
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 21555
Number of HSP's gapped (non-prelim): 46
length of query: 581
length of database: 288,098,425
effective HSP length: 126
effective length of query: 455
effective length of database: 172,280,107
effective search space: 78387448685
effective search space used: 78387448685
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 94 (40.8 bits)
Query= orf17 [4504-4922]
(417 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_274140.1| (NC_003112) phage virion protein, putative... 40 0.035
>ref|NP_274140.1| (NC_003112) phage virion protein, putative [Neisseria meningitidis
MC58]
pir||B81122 phage virion protein, probable NMB1109 [imported] - Neisseria
meningitidis (group B strain MD58)
gb|AAF41500.1| (AE002460) phage virion protein, putative [Neisseria meningitidis
MC58]
Length = 443
Score = 40.4 bits (93), Expect = 0.035
Identities = 32/129 (24%), Positives = 59/129 (44%), Gaps = 10/129 (7%)
Query: 3 WQTRLREAAYTPPSGNRLTFIYTSVSEEFDQKGGPFDFAGADGTFVQFLGVTGRRYPMTI 62
W T L++A+Y G + + S + + PF G ++ +G+TGR+ +
Sbjct: 4 WHTLLQDASYKGV-GFDIEVVDESNGKALAEHARPF----VQGIDLEDMGMTGRQVQINA 58
Query: 63 IVSGDDYDLDAAAWMQALAEQGEAILEHPAYGRL--TVAPVGTVKRSENFVNGAGQATIE 120
+ G Y + AL + G +L HP +GR+ +A + + ++V+ AG I+
Sbjct: 59 VFWGKGYAGRLKKLLDALEQPGGGVLVHPVWGRMHNMIAASWSYRHEADYVDYAG---ID 115
Query: 121 VTLFETTGA 129
+T E A
Sbjct: 116 ITFREAAEA 124
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 182,905,421
Number of Sequences: 919193
Number of extensions: 6274856
Number of successful extensions: 14984
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 14984
Number of HSP's gapped (non-prelim): 1
length of query: 417
length of database: 288,098,425
effective HSP length: 123
effective length of query: 294
effective length of database: 175,037,686
effective search space: 51461079684
effective search space used: 51461079684
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 92 (40.0 bits)
Query= orf18 [4920-5338]
(417 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_313011.1| (NC_002695) putative tail protein [Escheri... 67 3e-010
ref|NP_439666.1| (NC_000907) 43 kD tail protein [Haemophilu... 66 6e-010
ref|NP_050648.1| (NC_000929) putative tail protein [Enterob... 65 2e-009
ref|NP_284542.1| (NC_003116) hypothetical protein [Neisseri... 62 9e-009
ref|NP_599048.1| (NC_003444) tail protein [Shigella flexner... 58 2e-007
>ref|NP_313011.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
dbj|BAB38407.1| (AP002567) putative tail protein [Escherichia coli O157:H7]
Length = 374
Score = 67.0 bits (162), Expect = 3e-010
Identities = 75/290 (25%), Positives = 114/290 (38%), Gaps = 20/290 (6%)
Query: 124 QPVSVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTP--PASAFPLQWDE 181
Q + + + G+ + +G L + R +A + S G L DC+ P S QW
Sbjct: 52 QSLELRIDGQPVITGWLDQVRQRISATRFQITLSGRDKTGDLVDCSAIHPGS----QWRN 107
Query: 182 ATLQTIAADLCRPFGIQV--LAPDGT-GQTFERIAVEPAEKVMAVIARLAAQRNLVVRSD 238
TL+ IA+DLC PFG+ V D T + F +E +E V + R A R ++V S+
Sbjct: 108 RTLEHIASDLCAPFGVTVRWQVNDATAARPFSTFTLENSETVADALTRAARHRGVLVTSN 167
Query: 239 EQGRLVLLRPDT-RGEPVAEFIEGQQPPISVTPTFGNQDYY-------SHVTGITPTIVG 290
G LV + + RG+ + +V + +Y G T
Sbjct: 168 AAGELVFTQAGSQRGDTLTLGENLLDLDHNVDHRLRHSEYRVRGHGRGGGHAGDALTAGT 227
Query: 291 LEGPQATVRNPHLEGVLRPYVYNADDMAEAD-LVQAVQSKAGRMFAQSATYDVPVPTWRN 349
L P TV + + RP + AD +AD Q + R A+S V W
Sbjct: 228 LAAPVGTVTDSAIHR-YRPKIVLADHAVDADGARQRAVREMRRAVARSVRLTATVRHWFR 286
Query: 350 ANGDLWRVGDFVILEAPGAQVYRRTLMQIKTVRFSATTTERSAVLELIIP 399
NG LW + + AP V R L+ + V FS ++ P
Sbjct: 287 ENGQLWDINLLTAVTAPRTGVEERDLL-VCQVEFSLDANHGETTRLILAP 335
>ref|NP_439666.1| (NC_000907) 43 kD tail protein [Haemophilus influenzae Rd]
Length = 420
Score = 66.2 bits (160), Expect = 6e-010
Identities = 48/188 (25%), Positives = 92/188 (48%), Gaps = 10/188 (5%)
Query: 68 TQDPDEVTLRINGLNFRFWRAVTITQHLDAVS-TVSLHAPFDPNDEQSRDAFRPYSYQPV 126
T+ +++ L +NG W+++ + + L+++S L P D+ S A + P+
Sbjct: 7 TETNNKIELYLNGKILSGWKSLNLQRSLESMSGRFDLGIAVRPEDDISVLA----AGSPL 62
Query: 127 SVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQT 186
+ +GG+ + +G L + R + N++T+ S L DC +++ Q+ T +
Sbjct: 63 VLKMGGQTVITGYLDEIKQRVSGNDKTISVSGRDKTCDLVDCAIIHNSY--QFKNQTAKQ 120
Query: 187 IAADLCRPFGIQVLAPDGTGQTFERI---AVEPAEKVMAVIARLAAQRNLVVRSDEQGRL 243
IA +C+PFGI V+ + ERI VEP E ++++A + ++V SD G L
Sbjct: 121 IAEAICKPFGISVVWQVQAPEANERIPVWQVEPGETAFDNLSKIARHKGVLVTSDVDGNL 180
Query: 244 VLLRPDTR 251
+ P +
Sbjct: 181 LFTEPSNK 188
>ref|NP_050648.1| (NC_000929) putative tail protein [Enterobacteria phage Mu]
sp|P08558|VPP_BPMU 43 kDa tail protein (gpP)
pir||ZPBPMU gene P protein - phage Mu
emb|CAA29956.1| (X06796) 43 kDa tail protein [Enterobacteria phage Mu]
gb|AAF01122.1|AF083977_41 (AF083977) P [Enterobacteria phage Mu]
Length = 379
Score = 64.7 bits (156), Expect = 2e-009
Identities = 72/310 (23%), Positives = 129/310 (41%), Gaps = 30/310 (9%)
Query: 74 VTLRINGLNFRFWRAVTITQHLDAVSTV-SLHAPFDPNDEQSRDAFRPYSYQPVSVDVGG 132
VTLR +G F W +V++T+ +++V+ L P + S A + ++++GG
Sbjct: 5 VTLRADGRLFTGWTSVSVTRSIESVAGYFELGVNVPPGTDLSGLA----PGKKFTLEIGG 60
Query: 133 ERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQTIAADLC 192
+ + +G + + + + A+ + + L DC S QW TL+ IA DLC
Sbjct: 61 QIVCTGYIDSRRRQMTADSMKITVAGRDKTADLIDCAAVYSGG--QWKNRTLEQIARDLC 118
Query: 193 RPFGIQV---LAPDGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLVLLRPD 249
P+G+ V L+ + F ++ +E V + R + R +++ S+ G LV R
Sbjct: 119 APYGVTVRWELSDKESSAAFPGFTLDHSETVYEALVRASRARGVLMTSNAAGELVFSR-- 176
Query: 250 TRGEPVAEFIEGQQPPISVTPTFGNQDYYSHVTGIT----PTIVGLEGPQATVRN-PHLE 304
E + G+ +T F +D+ + T G EG ++ +
Sbjct: 177 AASTATDELVLGEN---LLTLDF-EEDFRDRFSEYTVKGYARANGAEGDDIDAKSIVSRK 232
Query: 305 GV--------LRPYVYNADDMAEADLVQA-VQSKAGRMFAQSATYDVPVPTWRNANGDLW 355
G RP + AD A QA + R A+S T++ + W +G LW
Sbjct: 233 GTATDSDVTRYRPMIIIADSKITAKDAQARALREQRRRLAKSITFEAEIDGWTRKDGQLW 292
Query: 356 RVGDFVILEA 365
V ++A
Sbjct: 293 MPNLLVTIDA 302
>ref|NP_284542.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
pir||C81809 hypothetical protein NMA1830 [imported] - Neisseria meningitidis
(group A strain Z2491)
emb|CAB72001.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
emb|CAB85055.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
Length = 379
Score = 62.4 bits (150), Expect = 9e-009
Identities = 82/313 (26%), Positives = 122/313 (38%), Gaps = 34/313 (10%)
Query: 71 PDE-VTLRINGLNFRFWRAVTITQHLDAVS---TVSLHAPFDPNDEQSRDAFRPYSYQPV 126
PD VTL ING W I L + +V L P D + R + V
Sbjct: 4 PDNTVTLLINGKTHGQWTNYDIVSDLLTPADDFSVMLGRPVDAKPDAVRAGDK------V 57
Query: 127 SVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQT 186
V VGG+ + SG + Q T +T+ GVL DC+ P + L
Sbjct: 58 EVRVGGDTVLSGRIDRVQTVTEKGGKTLTIQGRDDAGVLLDCSAPL----FNAQDMDLNQ 113
Query: 187 IAADLCRPFGIQVLAPDG--TGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLV 244
I + +P G+ + D T +T ++ +EP + + A L + G LV
Sbjct: 114 IIEKIVKPLGLAKIRIDAAKTDKT-HKVQIEPGSRAWDALLEYAEANGLWPWLEPDGTLV 172
Query: 245 LLRPDTRGEPVAEFI---EGQQPPISVTPTFGNQDY---YSHVTGITPTIVGLEGPQATV 298
+ PD PVAE + GQ I N+D YS VT + + G +AT
Sbjct: 173 VGGPDYTAAPVAELVLRTNGQNNNIKRLEV--NRDMAARYSEVTVLAQSHSGKNNIKATA 230
Query: 299 RNPHLEGVLRPYVYNADDMAEADLVQAVQSKAGRMFAQS----ATYDVPVPTWRNANGDL 354
++ ++ + RP + D+ D Q KA + A S T V R +G L
Sbjct: 231 KDESVK-LHRPLIVTEPDI---DSQAQAQRKAKKRLADSRLEGLTITATVQGHRTDDGTL 286
Query: 355 WRVGDFV-ILEAP 366
W+ G + +L P
Sbjct: 287 WQPGQRINVLSEP 299
>ref|NP_599048.1| (NC_003444) tail protein [Shigella flexneri bacteriophage V]
gb|AAL89425.1| (U82619) tail protein [Shigella flexneri bacteriophage V]
Length = 359
Score = 57.8 bits (138), Expect = 2e-007
Identities = 80/354 (22%), Positives = 134/354 (37%), Gaps = 33/354 (9%)
Query: 72 DEVTLRINGLNFRFWRAVTITQHLDAVS---TVSLHAPFDPNDEQSRDAFRPYSYQPVSV 128
D VTLR+NG + W +V I ++ ++ +V + + ++ + R + V V
Sbjct: 3 DNVTLRVNGREWNGWTSVRIGAGIERLARDFSVDITRQWPGDEGITTLQPRIKNGSKVEV 62
Query: 129 DVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQTIA 188
+G E + +G + R A + + S L DC +A P Q++ +L IA
Sbjct: 63 LIGDELVITGWVEATPVRYDARSVSTGIAGRSLTADLIDC----AAEPTQFNGRSLVQIA 118
Query: 189 ADLCRPFGIQVLAPDGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLVLLR- 247
L PFGI+V+ + + E V+ VI ++ Q+ + D GRLV+
Sbjct: 119 QALAAPFGIEVVNNGAPSGVIPDVQPDHGETVIEVINKILGQQQALAYDDPHGRLVIGGI 178
Query: 248 PDTRGEP---VAEFIEGQQPPISVTPTF----------GNQDYYSHVTGITPTIVGLEGP 294
TR + E I S+ F GN D + T
Sbjct: 179 GSTRAHTALVLGENILSCDTEKSIRERFSVYQVAGQRAGNDDDFGEAT--------TTAL 230
Query: 295 QATVRNPHLEGVLRPYVYNADDMAEADLVQAVQSKAGRMFAQSATYDVPVPTWRNANGDL 354
+A + + Y+ A + +A + A++ V WR NG L
Sbjct: 231 RARTEDAFIARYRPMYIRQTGQATGAGCIARADFEARQRAARTDETTYVVQGWRQGNGTL 290
Query: 355 WRVGDFVILEAPGAQVYRRTLMQIKTVRFSATTTERSAVLELIIPGSLSGQLPE 408
W+ VI+ P + T + + V F T +++ L I G LPE
Sbjct: 291 WQPNQRVIVFDPVCG-FDNTELLVSEVTF---TQDQNGTLTEIRVGPPDAYLPE 340
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 243,169,145
Number of Sequences: 919193
Number of extensions: 10197789
Number of successful extensions: 21519
Number of sequences better than 5.0e-02: 6
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 21510
Number of HSP's gapped (non-prelim): 9
length of query: 417
length of database: 288,098,425
effective HSP length: 123
effective length of query: 294
effective length of database: 175,037,686
effective search space: 51461079684
effective search space used: 51461079684
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 92 (40.0 bits)
Query= orf19 [5330-5545]
(214 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_267614.1| (NC_002662) UNKNOWN PROTEIN [Lactococcus l... 41 0.007
emb|CAC02002.2| (AL390114) possible l6202.3 [Leishmania major] 40 0.016
gb|AAD39531.2|AF149108_1 (AF149108) outer membrane protein ... 39 0.027
ref|NP_523104.1| (NC_003296) PUTATIVE HEMAGGLUTININ-RELATED... 39 0.027
ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED... 39 0.046
ref|NP_522101.1| (NC_003296) PROBABLE HEMAGGLUTININ-RELATED... 39 0.046
>ref|NP_267614.1| (NC_002662) UNKNOWN PROTEIN [Lactococcus lactis subsp. lactis]
gb|AAK05556.1|AE006376_2 (AE006376) UNKNOWN PROTEIN [Lactococcus lactis subsp. lactis]
Length = 1441
Score = 41.2 bits (95), Expect = 0.007
Identities = 63/224 (28%), Positives = 81/224 (36%), Gaps = 41/224 (18%)
Query: 23 NPGGGATVDAYHAQPAGDDCHPLPTDTAVL--------------VETPRSNNYAAVG-VV 67
NPGG TV + DD L ++ V+ TP N G VV
Sbjct: 1064 NPGGSVTVPGPDGKTGTDDDTTLNPNSPVVPGDNGSVTLPGGGTASTPNGNITLPGGTVV 1123
Query: 68 DPNNRQTAGPGERRVYSRNADGE-----QVAEVFLHDDGQVR-----VSNDEGFVDLAAD 117
DP+ G+ N DG Q + DDG+++ + D G V L
Sbjct: 1124 DPDGTIHLPGGD----IVNPDGTITLPGQDGKTGTGDDGKIKPNGPIIPGDNGSVTLPGG 1179
Query: 118 GTIQASNGQASATLQGTSITLDDGA----GGSISINGGVITLTG----TAIRLIGPVDAN 169
GT+ G + + G S+ DG GG I G ITL G T G V N
Sbjct: 1180 GTVTTPGG--TINVPGGSVVDPDGTVHLPGGDIVNPDGTITLPGQDGKTGTGDDGKVKPN 1237
Query: 170 GATISEA-GQITDADGLSVHGHNHTQANDSDNDVQQPTSTAQIP 212
G +IS G IT G +V T N V P T +P
Sbjct: 1238 GPSISNPDGSITLPGGGTVTTPGGT-INVPGGSVVDPDGTVHLP 1280
>emb|CAC02002.2| (AL390114) possible l6202.3 [Leishmania major]
Length = 1376
Score = 40.0 bits (92), Expect = 0.016
Identities = 48/194 (24%), Positives = 72/194 (36%), Gaps = 22/194 (11%)
Query: 23 NPGGGATVDAYHAQPAGDDCHPLPTDTAVLVETPRSNNYAAVGVVDPNNRQTAGPGERRV 82
+P G + HA AG+ P P A P N + + + +RQ A G R
Sbjct: 832 HPSGSGVRTSSHALAAGEHSQP-PAQRAAAAAAPPLANTSGASLQELQDRQRASSGSRSS 890
Query: 83 YSRNADGEQVAEVFLHDDGQVRVSN-------------DEGFVDLAADGTIQA--SNGQA 127
S+ E+ A + L VR SN D D T++A + +A
Sbjct: 891 SSKEGSEERPA-LHLSGVSSVRPSNGQEERLHSLVEAGDASTASADGDNTVEAFTARPEA 949
Query: 128 SATLQGTSITLDDGAGGSISINGGVITLTGTAIRLIGPVDANGATISEAGQITDADGLSV 187
A L TS DG GS S+ T T T +L P + S G ++ +
Sbjct: 950 VARLPRTS----DGTNGSASVPLFATTRTETVEQLSAPAH-DSTQASAVGNVSHCTAIRE 1004
Query: 188 HGHNHTQANDSDND 201
+ ++ SD+D
Sbjct: 1005 FTTITSSSSLSDSD 1018
>gb|AAD39531.2|AF149108_1 (AF149108) outer membrane protein A [Rickettsia australis]
Length = 2106
Score = 39.3 bits (90), Expect = 0.027
Identities = 30/82 (36%), Positives = 40/82 (48%), Gaps = 7/82 (8%)
Query: 110 GFVDLAADGTIQASNGQASA--TLQGTSITLDDGAG---GSISINGGVI--TLTGTAIRL 162
G V+ AADG + A+NG A A T + TL GAG G+I +NGG I +
Sbjct: 265 GAVNFAADGALTANNGIAGAVTTANNNTGTLTVGAGDVTGAIGVNGGNILKQVLFNGASN 324
Query: 163 IGPVDANGATISEAGQITDADG 184
+ +DA TI+ A A G
Sbjct: 325 VATIDATNVTINNAAANVTAAG 346
>ref|NP_523104.1| (NC_003296) PUTATIVE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD18696.1| (AL646085) PUTATIVE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 2691
Score = 39.3 bits (90), Expect = 0.027
Identities = 35/114 (30%), Positives = 49/114 (42%), Gaps = 18/114 (15%)
Query: 114 LAADGTIQA-----SNGQASATLQGT-----SITLDDGAG-----GSISINGGVITLTGT 158
LA G + A +NGQA TL GT +T AG GS+ + G + LTGT
Sbjct: 611 LATSGAVLAGGNVTANGQAGTTLSGTVYAAKGVTAQSSAGATSATGSVVAHSGKVALTGT 670
Query: 159 AIRLIGPVDANGATISEAGQ-ITDADGLSVHGHNHTQANDSDNDVQQPTSTAQI 211
++ + G + G T A Q DG + T + D+ P STA +
Sbjct: 671 SVAVSGSTQSGGDTALRATQGDATIDGQAAALGKLTVT--ASQDITGPGSTASV 722
>ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14589.1| (AL646061) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 3501
Score = 38.5 bits (88), Expect = 0.046
Identities = 28/77 (36%), Positives = 37/77 (47%), Gaps = 3/77 (3%)
Query: 110 GFVDLAADGTIQASNGQASATLQGTS--ITLDDGAGGSISINGGVITLTGTAIRLIGPVD 167
G +D+ A G + G+ SA Q T +LD+ AGG + G IT G G V
Sbjct: 742 GALDVQA-GAVSNQGGKLSAQSQATLNVASLDNSAGGYVGAQGVAITDQGALNNAGGTVA 800
Query: 168 ANGATISEAGQITDADG 184
A+GA AG I +A G
Sbjct: 801 ASGALTVSAGAIANAGG 817
>ref|NP_522101.1| (NC_003296) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD17691.1| (AL646079) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 3552
Score = 38.5 bits (88), Expect = 0.046
Identities = 28/77 (36%), Positives = 37/77 (47%), Gaps = 3/77 (3%)
Query: 110 GFVDLAADGTIQASNGQASATLQGTS--ITLDDGAGGSISINGGVITLTGTAIRLIGPVD 167
G +D+ A G + G+ SA Q T +LD+ AGG + G IT G G V
Sbjct: 741 GALDVQA-GAVSNQGGKLSAQSQATLNVASLDNSAGGYVGAQGVAITDQGALNNAGGTVA 799
Query: 168 ANGATISEAGQITDADG 184
A+GA AG I +A G
Sbjct: 800 ASGALTVSAGAIANAGG 816
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.312 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 129,490,373
Number of Sequences: 919193
Number of extensions: 5599259
Number of successful extensions: 12889
Number of sequences better than 5.0e-02: 6
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 12856
Number of HSP's gapped (non-prelim): 36
length of query: 214
length of database: 288,098,425
effective HSP length: 115
effective length of query: 99
effective length of database: 182,391,230
effective search space: 18056731770
effective search space used: 18056731770
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 88 (38.5 bits)
Query= orf20 [5549-5685]
(135 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_518999.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni... 67 4e-011
gb|AAB05794.1| (U62615) ORF 3 [Synechococcus sp. PCC 7942] 37 0.033
>ref|NP_518999.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD14580.1| (AL646061) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 132
Score = 66.6 bits (161), Expect = 4e-011
Identities = 38/125 (30%), Positives = 59/125 (46%), Gaps = 3/125 (2%)
Query: 3 LAFSQNIRNQRLSVVSTAADAGDGAALIRIYDGTRPPTGGQVTN--LLAQLEMSDPAFNA 60
+ S +RN RL V+ A DAG L+R+Y RP G + LLA++ + P +
Sbjct: 2 ITLSVPVRNSRLVVIGQALDAGTAGGLLRLYSAPRPDVGQALAEQVLLAEVRLPQPCMGS 61
Query: 61 PANGSMTARAITPEGSTPIGGTATWFRIVDSEGNFVLDGDVGLNGSNSEIELSDVNILSN 120
G + I + G W R+ DS+G +V D D GL S +E+EL + + +
Sbjct: 62 LEGGRLVFAPIG-QALCRRSGIVAWARLCDSDGRWVADLDAGLPDSGAEVELPKLQVFAG 120
Query: 121 QEVRI 125
V +
Sbjct: 121 GAVNV 125
>gb|AAB05794.1| (U62615) ORF 3 [Synechococcus sp. PCC 7942]
Length = 136
Score = 37.0 bits (84), Expect = 0.033
Identities = 36/135 (26%), Positives = 61/135 (44%), Gaps = 14/135 (10%)
Query: 1 MTLAFSQNIRNQRLSVVSTAADAGDGAALIRIYDGTRPPTGGQVTNLLAQLEMSDPAFNA 60
MT+A +RN ++ A G ALI +Y G P + Q+T + + ++ F
Sbjct: 1 MTVALGLELRNSLCELIRDRASNG---ALI-LYAGYDPHSD-QLTEEIVRFPLAAQPFRP 55
Query: 61 PANGSMTARAITPEGSTPIGGTATWFRIVDS------EGNFVLDGDVG--LNGSNSEIEL 112
+G+M A I P + GG ++ I S +L G +G N ++++ L
Sbjct: 56 VESGTMAAAPIAPARCSR-GGALRYWEIRSSPLHDTHPDQMLLRGSIGDITNPGDADLVL 114
Query: 113 SDVNILSNQEVRIAT 127
V +L NQ + IA+
Sbjct: 115 DTVAVLQNQAIAIAS 129
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.312 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 77,781,354
Number of Sequences: 919193
Number of extensions: 3273800
Number of successful extensions: 5667
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 5665
Number of HSP's gapped (non-prelim): 2
length of query: 135
length of database: 288,098,425
effective HSP length: 111
effective length of query: 24
effective length of database: 186,068,002
effective search space: 4465632048
effective search space used: 4465632048
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf21 [5686-5908]
(221 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_404848.1| (NC_003143) putative bacteriophage protein... 40 0.017
>ref|NP_404848.1| (NC_003143) putative bacteriophage protein GP46 [Yersinia pestis]
emb|CAC90083.1| (AJ414147) putative bacteriophage protein GP46 [Yersinia pestis]
Length = 151
Score = 40.0 bits (92), Expect = 0.017
Identities = 28/85 (32%), Positives = 39/85 (44%), Gaps = 7/85 (8%)
Query: 108 NGDLYRTDGLETAVYLSLYG---GNPEDNGQDANRLGWWGNANQEEPARQMVSRFQHLVE 164
NG L D L+TA+ LSL+ +D +R GWWG++ A + L+
Sbjct: 22 NGGLLEGDDLQTAILLSLFTDRLARADDAIDSPDRRGWWGDSG----AASAIGSRLWLLR 77
Query: 165 GIPLTSGNLQRLEDAATADLEWLSE 189
LT+ + ED AT L WL E
Sbjct: 78 REKLTTQVAIKAEDYATEALAWLIE 102
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.315 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 125,646,374
Number of Sequences: 919193
Number of extensions: 5426027
Number of successful extensions: 9348
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 9348
Number of HSP's gapped (non-prelim): 1
length of query: 221
length of database: 288,098,425
effective HSP length: 116
effective length of query: 105
effective length of database: 181,472,037
effective search space: 19054563885
effective search space used: 19054563885
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 88 (38.5 bits)
Query= orf22 [5911-6307]
(395 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_599051.1| (NC_003444) tail protein [Shigella flexner... 60 4e-008
ref|NP_415670.1| (NC_000913) orf, hypothetical protein [Esc... 59 7e-008
ref|NP_313014.1| (NC_002695) hypothetical protein [Escheric... 59 7e-008
dbj|BAA35978.1| (D90749) Hypothetical protein HI1520 [Esche... 59 7e-008
ref|NP_284072.1| (NC_003116) hypothetical protein NMA1323 [... 58 2e-007
ref|NP_439669.1| (NC_000907) conserved hypothetical protein... 52 8e-006
ref|NP_543104.1| (NC_003356) hypothetical protein [Bacterio... 50 4e-005
ref|NP_050651.1| (NC_000929) Hypothetical protein [Enteroba... 46 6e-004
>ref|NP_599051.1| (NC_003444) tail protein [Shigella flexneri bacteriophage V]
gb|AAL89428.1| (U82619) tail protein [Shigella flexneri bacteriophage V]
Length = 352
Score = 60.1 bits (144), Expect = 4e-008
Identities = 51/180 (28%), Positives = 75/180 (41%), Gaps = 9/180 (5%)
Query: 92 PQRAAVQAQLDLSVNVVTQGGVLASGEQLQGPNGIIYITTGDTLLNSQSVTVRXXXXXXX 151
P++ A A + V+ G + +G +Q + + Y T D + + + V
Sbjct: 82 PRKGATAASGYMRWEGVSDGLKVTAGSVIQRDDLVQYTATADAISSGGVLRVPIACSSAG 141
Query: 152 XXXXXXXXXXNLANGAVLNFVQPLAALQPMGTVSATVRTGIDAETTEAYRARIDERTQRR 211
N +G L V P+ L P V+ T+ G D E E +RAR+ ER
Sbjct: 142 AVG-------NADDGTSLILVTPVNGL-PSSGVADTLTGGFDTEELETWRARVIERYYWT 193
Query: 212 PLGGAPVDYPLWAETSSAVLNAYPYT-GNTPGTVAVYIESSTEVDGIPTNDQLLEARTAI 270
P GGA DY +WA+ + A+ Y GTV V I SS ++ IP AR I
Sbjct: 194 PQGGADGDYVVWAKEVPGITRAWTYRHWMGTGTVGVMIASSDLINPIPEESTETAARQHI 253
>ref|NP_415670.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
sp|P75981|YMFP_ECOLI Hypothetical protein ymfP in lambdoid prophage e14 region
pir||E64860 hypothetical protein b1152 - Escherichia coli (strain K-12)
gb|AAC74236.1| (AE000214) orf, hypothetical protein [Escherichia coli K12]
Length = 263
Score = 59.3 bits (142), Expect = 7e-008
Identities = 49/164 (29%), Positives = 69/164 (41%), Gaps = 9/164 (5%)
Query: 108 VTQGGVLASGEQLQGPNGIIYITTGDTLLNSQSVTVRXXXXXXXXXXXXXXXXXNLANGA 167
V+ G + +G +Q + + Y TT D + + V N +G
Sbjct: 9 VSDGLKVTAGSVIQRDDLVQYTTTDDATSSGGVLRVPIACSSAGAVG-------NADDGT 61
Query: 168 VLNFVQPLAALQPMGTVSATVRTGIDAETTEAYRARIDERTQRRPLGGAPVDYPLWAETS 227
L V P+ L P V+ T+ G D E E +RAR+ ER P GGA DY +WA+
Sbjct: 62 ALILVTPVNGL-PSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEV 120
Query: 228 SAVLNAYPYTG-NTPGTVAVYIESSTEVDGIPTNDQLLEARTAI 270
+ A+ Y GTV V I SS ++ IP AR I
Sbjct: 121 PGITRAWTYRHLMGTGTVGVMIASSDLINPIPEESTETAARQHI 164
>ref|NP_313014.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
dbj|BAB38410.1| (AP002567) hypothetical protein [Escherichia coli O157:H7]
Length = 361
Score = 59.3 bits (142), Expect = 7e-008
Identities = 68/306 (22%), Positives = 115/306 (37%), Gaps = 26/306 (8%)
Query: 20 ISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQMFIRTASNQPVTINGQTLT 79
IS +L P + +A AL+G+ + + ++ Q+ A + L
Sbjct: 20 ISQRLPGSQPGVNETTLNAIAYALAGLSAQEHEHLAWISRQIIPTEADE------AELLK 73
Query: 80 PLAEWGRILGAGPQRAAVQAQLDLSVNVVTQGGVLASGEQLQGPNGIIYITTGDTLLNSQ 139
A WG I ++ A +A + + + T G+ G LQ +G++Y TG
Sbjct: 74 HCAFWGVI-----RKPASRADGPVQLMLTTDAGI-TEGVLLQRSDGVVYRITG------- 120
Query: 140 SVTVRXXXXXXXXXXXXXXXXXNLANGAVLNFVQPLAALQPMGTVSATVRTG-IDAETTE 198
S T + N G L+F+ P A + TV+ T TG D ET
Sbjct: 121 SATGKAGTLNVNVEAESAGRAGNTPTGTRLSFITPQAGINQTATVTGTGLTGGADVETVP 180
Query: 199 AYRARIDERTQRRPLGGAPVDYPLWAETSSAVLNAY-PYTGNTPGTVAVYIESSTEVDGI 257
+R+ R Q P GG D+ WA V A+ G+V V D
Sbjct: 181 ELLSRLVFRVQNPPSGGTQYDFERWAREVPGVTRAWCKPEWPEAGSVGVTFVQDNNPDIF 240
Query: 258 PTNDQLLEARTAI---NQSPNGRRNRAPVGTLVNTFPITRE--TFNVSVSGLNVDNPADV 312
P + I + G+ P+G ++ F +T + F + + +N A V
Sbjct: 241 PGEGDVKRVADYIRSHDDPATGQPVGQPLGPTISVFKLTNKPVAFEIRIVPKTPENQAAV 300
Query: 313 RRDISN 318
++ +++
Sbjct: 301 KQALTD 306
>dbj|BAA35978.1| (D90749) Hypothetical protein HI1520 [Escherichia coli]
dbj|BAA35990.1| (D90750) Hypothetical protein HI1520 [Escherichia coli]
Length = 276
Score = 59.3 bits (142), Expect = 7e-008
Identities = 49/164 (29%), Positives = 69/164 (41%), Gaps = 9/164 (5%)
Query: 108 VTQGGVLASGEQLQGPNGIIYITTGDTLLNSQSVTVRXXXXXXXXXXXXXXXXXNLANGA 167
V+ G + +G +Q + + Y TT D + + V N +G
Sbjct: 22 VSDGLKVTAGSVIQRDDLVQYTTTDDATSSGGVLRVPIACSSAGAVG-------NADDGT 74
Query: 168 VLNFVQPLAALQPMGTVSATVRTGIDAETTEAYRARIDERTQRRPLGGAPVDYPLWAETS 227
L V P+ L P V+ T+ G D E E +RAR+ ER P GGA DY +WA+
Sbjct: 75 ALILVTPVNGL-PSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEV 133
Query: 228 SAVLNAYPYTG-NTPGTVAVYIESSTEVDGIPTNDQLLEARTAI 270
+ A+ Y GTV V I SS ++ IP AR I
Sbjct: 134 PGITRAWTYRHLMGTGTVGVMIASSDLINPIPEESTETAARQHI 177
>ref|NP_284072.1| (NC_003116) hypothetical protein NMA1323 [Neisseria meningitidis
Z2491]
pir||C81901 hypothetical protein NMA1323 [imported] - Neisseria meningitidis
(group A strain Z2491)
emb|CAB84574.1| (AL162755) hypothetical protein NMA1323 [Neisseria meningitidis
Z2491]
Length = 351
Score = 57.8 bits (138), Expect = 2e-007
Identities = 51/174 (29%), Positives = 82/174 (46%), Gaps = 14/174 (8%)
Query: 187 TVRTGIDAETTEAYRARIDERTQRRPLGGAPVDYPLWAETSSAVLNAYPY-TGNTPGTVA 245
TV+ G D E+ + AR+ E +R P GG DY WA + V +AY Y GTV
Sbjct: 162 TVQGGTDRESDASLLARLLEIIRRPPAGGNRYDYKNWALSVDGVTSAYVYPLRRGLGTVD 221
Query: 246 VYIESSTEVDGIPTNDQLLEARTAINQ-SPNGRRNRAPVGTLVNTFPITRETFNVSVSGL 304
+ I T DG+P+ + + + I++ P +N + V P+ T V + G+
Sbjct: 222 IAI---TSADGVPSEETVRRVQAYIDEMRPVTAKNALVLKPTVTAVPV---TVQVKLDGI 275
Query: 305 NVDNPADVRRDISNAVAEYFMQREPFITGVTTGVRRDQITQIAVGGVVEGIVTA 358
++D + +R I A+ EYF + I G V + + V GV++ +TA
Sbjct: 276 DLD---EAKRRIRTALKEYF---DTLIPGDGLTVSQIEAAISNVDGVIDLRLTA 323
>ref|NP_439669.1| (NC_000907) conserved hypothetical protein [Haemophilus influenzae
Rd]
sp|P44240|VG47_HAEIN Mu-like prophage FluMu protein gp47
pir||G64034 hypothetical protein HI1520 - Haemophilus influenzae (strain Rd
KW20)
gb|AAC23162.1| (U32827) conserved hypothetical protein [Haemophilus influenzae Rd]
Length = 355
Score = 52.4 bits (124), Expect = 8e-006
Identities = 52/209 (24%), Positives = 75/209 (35%), Gaps = 13/209 (6%)
Query: 123 PNGIIY--ITTGDTLLNSQSVTVRXXXXXXXXXXXXXXXXXNLANGAVLNFVQPLAALQP 180
P G ++ TG T + +Q V+ NL G L+ + L P
Sbjct: 99 PAGTVFEDTNTGLTFITTQETVVKAGTADIAVKCETTGVEGNLKAGTSLSLTSAILGLLP 158
Query: 181 MGTVSATVRTGIDAETTEAYRARIDERTQRRPLGGAPVDYPLWAETSSAVLNAY---PYT 237
TV + G D E+ AR+ R Q P GGA DY WA V A+ Y
Sbjct: 159 TATVKV-MSGGADIESLSRLLARLIYRVQYPPAGGASHDYIRWATEVPGVTRAWCFERYY 217
Query: 238 GNTPGTVAVYIESSTEVDGIPTNDQLLEARTAINQSPN---GRRNRAPVGTLVNTFPITR 294
G GTV V D +PT + + R I N G+ P + F
Sbjct: 218 GG--GTVGVAFACDEREDILPTPEDIARVRAYIEGHKNEVTGQFEGMPANVELYVFAPQF 275
Query: 295 ETFNVSVSGLNVDNPADVRRDISNAVAEY 323
+ N + N +R+ + ++A Y
Sbjct: 276 QAVNFKIR--LAPNTPTLRQAVRKSLAAY 302
>ref|NP_543104.1| (NC_003356) hypothetical protein [Bacteriophage P27]
emb|CAC83570.1| (AJ298298) hypothetical protein [Bacteriophage P27]
Length = 360
Score = 50.1 bits (118), Expect = 4e-005
Identities = 42/161 (26%), Positives = 68/161 (42%), Gaps = 9/161 (5%)
Query: 92 PQRAAVQAQLDLSVNVVTQGGVLASGEQLQGPNGIIYITTGDTLLNSQSVTVRXXXXXXX 151
P++ AV A+ + + V VL +G Q+Q + + + TT +VT
Sbjct: 82 PRKDAVAARGFVRWDGVEGTPVLPAGTQIQRDDQVTFTTTA-------TVTAADGLLRVP 134
Query: 152 XXXXXXXXXXNLANGAVLNFVQPLAALQPMGTVSATVRTGIDAETTEAYRARIDERTQRR 211
N +G + P++ L G T+ G D E E +RAR+ ER
Sbjct: 135 VVADEPGSAGNTDDGIAMQLGTPVSGLPSTGYAD-TITDGEDIENLEIWRARVMERYYYI 193
Query: 212 PLGGAPVDYPLWAETSSAVLNAYPYT-GNTPGTVAVYIESS 251
P GGA DY +WA+ + A+ + GTV V + ++
Sbjct: 194 PQGGADPDYVIWAKEVQGINRAWTFRHWKGIGTVGVMVATN 234
>ref|NP_050651.1| (NC_000929) Hypothetical protein [Enterobacteria phage Mu]
sp|Q9T1V2|VG47_BPMU Protein gp47
gb|AAF01125.1|AF083977_44 (AF083977) gp47 [Enterobacteria phage Mu]
Length = 360
Score = 46.2 bits (108), Expect = 6e-004
Identities = 54/247 (21%), Positives = 90/247 (35%), Gaps = 21/247 (8%)
Query: 2 VAPTTPTTQSISDNIVAQISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQM 61
+A + PT S+ I +L P + +A A +G+ + + ++ Q+
Sbjct: 1 MAYSPPTLSSLIARTEQNIEQRLPGSWPQAREKTLSAIAYAQAGLAAGCHEHISWVGRQI 60
Query: 62 FIRTASNQPVTINGQTLTPLAEWGRILGAGPQRA-AVQAQLDLSVNVVTQGGVLASGEQL 120
TA L E R G ++A A L ++ + T + +G +
Sbjct: 61 IPSTADEDE----------LLEHCRFWGVRRKQATAASGPLTVTTSAAT---TIPAGTRW 107
Query: 121 QGPNGIIYITTGDTLLNSQSVTVRXXXXXXXXXXXXXXXXXNLANGAVLNFVQPLAALQP 180
Q +G++Y + DT+ V R N +L + P+A +
Sbjct: 108 QRADGVVY-SLADTI-----VIDRAGTTEITVTALAAGEAGNTGENTLLTLITPVACVVS 161
Query: 181 MGTVSATVRTGIDAETTEAYRARIDERTQRRPLGGAPVDYPLWAETSSAVLNAYPY-TGN 239
G D E+ +R++ R Q P GG DY WA S V A+ + T
Sbjct: 162 DAITVKGFSGGADIESAAELLSRLEYRVQYPPFGGNQFDYVRWAREVSGVTRAWCFPTWK 221
Query: 240 TPGTVAV 246
GTV V
Sbjct: 222 GGGTVGV 228
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.316 0.132 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 201,145,509
Number of Sequences: 919193
Number of extensions: 7866724
Number of successful extensions: 26260
Number of sequences better than 5.0e-02: 9
Number of HSP's better than 0.1 without gapping: 4
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 26255
Number of HSP's gapped (non-prelim): 9
length of query: 395
length of database: 288,098,425
effective HSP length: 122
effective length of query: 273
effective length of database: 175,956,879
effective search space: 48036227967
effective search space used: 48036227967
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 92 (40.0 bits)
Query= orf23 [6304-6545]
(240 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAB13413.1| (AB046807) KIAA1587 protein [Homo sapiens] 40 0.015
ref|XP_034014.1| (XM_034014) KIAA1587 protein [Homo sapiens... 40 0.015
>dbj|BAB13413.1| (AB046807) KIAA1587 protein [Homo sapiens]
Length = 991
Score = 40.4 bits (93), Expect = 0.015
Identities = 22/65 (33%), Positives = 34/65 (51%), Gaps = 2/65 (3%)
Query: 145 EPLAQCGEPTAQAGNSLQTPPTPQGRLLVNIIQTDTGQVEYAIPTDPAEFP--HIIYVGG 202
E L+ +PTA G+S PPTP G L ++ T T ++ ++P P E P ++ + G
Sbjct: 336 EGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPG 395
Query: 203 EEFGT 207
E T
Sbjct: 396 EGLST 400
>ref|XP_034014.1| (XM_034014) KIAA1587 protein [Homo sapiens]
gb|AAM08354.1| (AF490507) hepatocellular carcinoma-associated protein HCA1 [Homo
sapiens]
Length = 957
Score = 40.4 bits (93), Expect = 0.015
Identities = 22/65 (33%), Positives = 34/65 (51%), Gaps = 2/65 (3%)
Query: 145 EPLAQCGEPTAQAGNSLQTPPTPQGRLLVNIIQTDTGQVEYAIPTDPAEFP--HIIYVGG 202
E L+ +PTA G+S PPTP G L ++ T T ++ ++P P E P ++ + G
Sbjct: 302 EGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPG 361
Query: 203 EEFGT 207
E T
Sbjct: 362 EGLST 366
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.320 0.140 6.04e-154
Gapped
Lambda K H
0.267 0.0410 5.40e-067
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 155,712,268
Number of Sequences: 919193
Number of extensions: 7318204
Number of successful extensions: 14969
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 14955
Number of HSP's gapped (non-prelim): 14
length of query: 240
length of database: 288,098,425
effective HSP length: 117
effective length of query: 123
effective length of database: 180,552,844
effective search space: 22207999812
effective search space used: 22207999812
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 89 (38.9 bits)
Query= orf24 [6549-6994]
(444 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni... 67 3e-010
ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PR... 65 2e-009
ref|NP_518238.1| (NC_003295) CONSERVED HYPOTHETICAL PROTEIN... 50 6e-005
ref|NP_298995.1| (NC_002488) phage-related tail fiber prote... 47 4e-004
ref|NP_488746.1| (NC_003272) unknown protein [Nostoc sp. PC... 43 0.006
gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger] 42 0.013
ref|NP_105312.1| (NC_002678) microcystin dependent protein ... 41 0.022
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 225
Score = 67.4 bits (163), Expect = 3e-010
Identities = 33/46 (71%), Positives = 35/46 (75%)
Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLPADGAGRPVG 149
WL NG AVSRTTYA LF ++GTTFGAGDGSTTFNLP A P G
Sbjct: 80 WLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLPELRAEFPRG 125
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 554
Score = 64.7 bits (156), Expect = 2e-009
Identities = 29/37 (78%), Positives = 33/37 (88%)
Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLP 140
WL ANG AVSRTTYA+L+A +GTTFGAGDG+ TFNLP
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLP 451
>ref|NP_518238.1| (NC_003295) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD13645.1| (AL646057) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 177
Score = 49.7 bits (117), Expect = 6e-005
Identities = 32/64 (50%), Positives = 37/64 (57%), Gaps = 7/64 (10%)
Query: 101 APQ-WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLPADGAGRPVGQQFVNLGTAT 159
APQ WLL GQ + ++ L+ALLGTT+G GDG TTF LP PVGQ G
Sbjct: 16 APQGWLLCQGQLIDISSNEILYALLGTTYG-GDGRTTFALPNLQGALPVGQ-----GQGP 69
Query: 160 GLPA 163
GL A
Sbjct: 70 GLTA 73
>ref|NP_298995.1| (NC_002488) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
pir||C82649 phage-related tail fiber protein XF1706 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF84515.1|AE003994_14 (AE003994) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
Length = 387
Score = 47.0 bits (110), Expect = 4e-004
Identities = 21/39 (53%), Positives = 28/39 (70%)
Query: 102 PQWLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLP 140
P LL +G+AVSR Y LF + T++GAGDG +TFN+P
Sbjct: 151 PGTLLCDGRAVSRAMYPRLFEEINTSYGAGDGVSTFNIP 189
>ref|NP_488746.1| (NC_003272) unknown protein [Nostoc sp. PCC 7120]
dbj|BAB76405.1| (AP003597) ORF_ID:alr4706~unknown protein [Nostoc sp. PCC 7120]
Length = 101
Score = 43.1 bits (100), Expect = 0.006
Identities = 20/37 (54%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLP 140
W L NGQ +S Y +++++LGTTFG GDG TF LP
Sbjct: 38 WALCNGQLLSIQEYTAVYSILGTTFG-GDGVQTFALP 73
>gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger]
Length = 857
Score = 42.0 bits (97), Expect = 0.013
Identities = 61/260 (23%), Positives = 95/260 (36%), Gaps = 60/260 (23%)
Query: 161 LPAYNSEARFARAPNGN--MYFVATGPSRMYRSTDNGLTWTDTNIGSGLPTTFDNPAIVV 218
+P R A PN N +YF A + +++STD G TW++ T F +
Sbjct: 141 MPGRGVGERLAVDPNDNSILYFGARSGNGLWKSTDYGETWSNVTAFKWTGTYFQD----- 195
Query: 219 GVNNYVYFIDRMNGDLRRSXXXXXXXXXXXXXXXLPSGIGAPFLASAPTTSGS-NPEILY 277
++ Y D P GI S +SGS P I
Sbjct: 196 --SSSTYTSD-------------------------PVGIAWVTFDSTSGSSGSPTPRIFV 228
Query: 278 FIDSALDTLYVSTNQGVNWTSLG---VGDGLP----------------ANGLGEMIGAPD 318
+ ++++VS + G WT + + LP +NG G G +
Sbjct: 229 GVVDTGESVFVSEDAGETWTWVSGEPMYGFLPHKGILSPSEHTLYISYSNGAGPYDGT-N 287
Query: 319 GYFYGIDIVGDQLYRLTPPSTTWTSTGLGEGLPSTISAPSMTVTPDGTLYFIDGTDRDRL 378
G + +I ++P S T T G G GL + P + ++ D + +
Sbjct: 288 GTVHKYNITSGVWTDISPTSMTDTYYGYG-GLAVDLQVPGTVMVAALNCWWPD----ELI 342
Query: 379 YRSTDRGVTWSSQFLGEGLP 398
+RSTD G TWS + G P
Sbjct: 343 WRSTDSGGTWSPIWAWNGYP 362
>ref|NP_105312.1| (NC_002678) microcystin dependent protein MdpB [Mesorhizobium loti]
dbj|BAB51098.1| (AP003004) microcystin dependent protein; MdpB [Mesorhizobium loti]
Length = 169
Score = 41.2 bits (95), Expect = 0.022
Identities = 23/48 (47%), Positives = 31/48 (63%), Gaps = 3/48 (6%)
Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLPADGAGR-PVGQ 150
W+ GQ + + Y +LF L+GTT+G GDG +TF LP D GR P+ Q
Sbjct: 21 WMFCEGQLLPISEYETLFQLIGTTYG-GDGQSTFALP-DLRGRVPIHQ 66
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.315 0.136 5.40e-067
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 280,063,815
Number of Sequences: 919193
Number of extensions: 13533936
Number of successful extensions: 20452
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 20445
Number of HSP's gapped (non-prelim): 11
length of query: 444
length of database: 288,098,425
effective HSP length: 123
effective length of query: 321
effective length of database: 175,037,686
effective search space: 56187097206
effective search space used: 56187097206
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 92 (40.0 bits)
Query= orf25 [6995-7440]
(444 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni... 67 5e-010
ref|NP_251770.1| (NC_002516) hypothetical protein [Pseudomo... 60 4e-008
ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PR... 60 5e-008
ref|NP_298995.1| (NC_002488) phage-related tail fiber prote... 55 2e-006
ref|NP_347555.1| (NC_003030) Probably secreted sialidase; s... 53 6e-006
emb|CAC13065.1| (AL445503) conserved hypothetical protein [... 50 5e-005
gb|AAK06388.1| (AF078042) glycosyl hydrolase 5 [Caldicellul... 49 1e-004
pir||T35237 probable secreted cellulase - Streptomyces coel... 47 5e-004
ref|NP_105312.1| (NC_002678) microcystin dependent protein ... 47 5e-004
ref|NP_487884.1| (NC_003272) hypothetical protein [Nostoc s... 45 0.002
ref|NP_197703.1| (NM_122218) photosystem II stability/assem... 42 0.010
gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger] 42 0.017
pir||UMMS period clock protein - mouse (fragment) >gi|13341... 40 0.049
ref|NP_032862.1| (NM_008836) per-hexamer repeat gene 5 [Mus... 40 0.049
ref|NP_189520.1| (NM_113799) histone-H4-like protein [Arabi... 40 0.049
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 225
Score = 66.6 bits (161), Expect = 5e-010
Identities = 42/96 (43%), Positives = 53/96 (54%), Gaps = 9/96 (9%)
Query: 104 YLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQVPT-------PTPGETFV 156
+L+ +G+AVSRTTYA LF VIGTT+GAGDGSTTFNLP+ + P G F
Sbjct: 80 WLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLPELRAEFPRGWDDGRGVDAGRVFG 139
Query: 157 EKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSGTTI 192
Q+ TAI DG+ L +G + T I
Sbjct: 140 SVQAQALSSHQHKTAIGF--DGSNLFGWGDGNATPI 173
>ref|NP_251770.1| (NC_002516) hypothetical protein [Pseudomonas aeruginosa]
pir||D83260 hypothetical protein PA3080 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG06468.1|AE004732_2 (AE004732) hypothetical protein [Pseudomonas aeruginosa]
Length = 365
Score = 60.5 bits (145), Expect = 4e-008
Identities = 60/236 (25%), Positives = 97/236 (40%), Gaps = 17/236 (7%)
Query: 195 SGDGGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTG- 253
S D G TW G + + I A++ + + L ++DGG++W D
Sbjct: 138 SDDAGSTWARQFEDLGREAPLLDIWFADEQHGLAV----GAYGALLETRDGGQHWEDVSE 193
Query: 254 -LGNGLTINRGDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAVDST 312
L N + IAA + V G++FRS D GA W ++ G + F +I
Sbjct: 194 RLDNEDQFHLNAIAAVKDNGLLVVGEAGSLFRSKDWGATWEKLEGPYEGSLFGAIGTADA 253
Query: 313 GAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYIISPD 372
G V V+ ++RSA+ G +W + A L L+ A A +G + ++
Sbjct: 254 GGVLVYG---LRGHLFRSADFGDSWEEIPLKAASGDLEFGLSDGALLA--DGRIVVVG-H 307
Query: 373 NNQGYRSVDNGASWSPVSGLIADFSFAYVTSTPNGSIYAVAGTFSAGGIFASEGSG 428
S D G S+S V S A V++T +G + V GG+ + +G
Sbjct: 308 GGSVLESTDGGRSFS-VFNRPDRLSLAGVSATGDGHLILV----GQGGVHLAAANG 358
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 554
Score = 60.1 bits (144), Expect = 5e-008
Identities = 27/42 (64%), Positives = 35/42 (83%)
Query: 104 YLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQ 145
+L+ +G+AVSRTTYA L+A IGTT+GAGDG+ TFNLP G+
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLPDLRGE 456
>ref|NP_298995.1| (NC_002488) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
pir||C82649 phage-related tail fiber protein XF1706 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF84515.1|AE003994_14 (AE003994) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
Length = 387
Score = 54.7 bits (130), Expect = 2e-006
Identities = 29/59 (49%), Positives = 36/59 (60%)
Query: 82 TAIAAPRVGERLRTYGTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLP 140
TA++ G+ + T G A P L DG AVSR Y LF I T+YGAGDG +TFN+P
Sbjct: 131 TALSRYEPGQIVYTAGKRALPGTLLCDGRAVSRAMYPRLFEEINTSYGAGDGVSTFNIP 189
>ref|NP_347555.1| (NC_003030) Probably secreted sialidase; several ASP-boxes and
dockerin domain [Clostridium acetobutylicum]
gb|AAK78895.1|AE007608_1 (AE007608) Probably secreted sialidase; several ASP-boxes and
dockerin domain [Clostridium acetobutylicum]
Length = 839
Score = 53.1 bits (126), Expect = 6e-006
Identities = 59/204 (28%), Positives = 88/204 (42%), Gaps = 18/204 (8%)
Query: 232 RNNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGD--FYYVNNGGAIFRSTDNG 289
+N + S DGG+NW G G +AA + + G STDNG
Sbjct: 526 KNQQDKDCGISYDGGKNWFSAGSNISGVYKAGTVAAGADAKTIVWSPEEGANAAYSTDNG 585
Query: 290 ANWSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGL 349
W+ SG LP A + D F F+ G + Y S + GA + + +++GL
Sbjct: 586 NKWTPCSG-LPQG--AKVRSDRVNP-KKFYGFLNG-KFYISTDAGATFT----QSSQTGL 636
Query: 350 PDTLNPVAFGA--GRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIADFSFAYVTS-TPN 406
P T F G GD++I + + S D+GA+++ VSG+ A + S T +
Sbjct: 637 P-TKGKGIFKTVIGHEGDIWIAGGKDGL-WHSTDSGATFTKVSGVDASDTVGLGKSKTDD 694
Query: 407 G--SIYAVAGTFSAGGIFASEGSG 428
G +IY A GIF S+ G
Sbjct: 695 GYPAIYMDATIDGTAGIFRSDDEG 718
Score = 44.3 bits (103), Expect = 0.003
Identities = 53/221 (23%), Positives = 78/221 (34%), Gaps = 53/221 (23%)
Query: 195 SGDGGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSD-TG 253
S DGG W S G N Y +AA ++ + + S D G W+ +G
Sbjct: 536 SYDGGKNWFSAG-SNISGVYKAGTVAAGADAKTIVWSPEEGANAAY-STDNGNKWTPCSG 593
Query: 254 LGNGLTINRGDIAAAP-----NGDFYYVNNGGAIF------------------------- 283
L G + + NG FY + GA F
Sbjct: 594 LPQGAKVRSDRVNPKKFYGFLNGKFYISTDAGATFTQSSQTGLPTKGKGIFKTVIGHEGD 653
Query: 284 -----------RSTDNGANWSQVSGQLPDNTFA---SIAVDSTGAVYVFAEFVTGTRVYR 329
STD+GA +++VSG +T S D A+Y+ A ++R
Sbjct: 654 IWIAGGKDGLWHSTDSGATFTKVSGVDASDTVGLGKSKTDDGYPAIYMDATIDGTAGIFR 713
Query: 330 SANNGANWAIAAEEGAESGLPD---TLNPVAFG---AGRNG 364
S + GA W ++ + G PD T +P +G G NG
Sbjct: 714 SDDEGATWVRINDDAHQYGSPDYCITGDPNKYGRVFVGTNG 754
>emb|CAC13065.1| (AL445503) conserved hypothetical protein [Streptomyces coelicolor]
Length = 384
Score = 50.1 bits (118), Expect = 5e-005
Identities = 60/223 (26%), Positives = 94/223 (41%), Gaps = 42/223 (18%)
Query: 202 WLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGLGN----- 256
W+ G G GL + + A D G++ + ++ +FR+ DGG +W+ + G
Sbjct: 165 WVPGGGGEGLHTV---LTDARDPGSVTVAV---STAGVFRTADGGASWAPSNSGVSAVFL 218
Query: 257 -------GLTINR-GDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQVSGQLPDN-TFASI 307
G +++ AA P D Y+ N ++RS D GA+W+ + LP FA
Sbjct: 219 PDPDPEFGQCVHKVARDAATP--DRLYLQNHWGVYRSDDAGAHWTDIGEGLPSTFGFAVA 276
Query: 308 AVDSTG-AVYVF-----AEFVTG---TRVYRSANNGANWAIAAEEGAESGLP--DTLNPV 356
A G YVF A+ V RV+R+A+ G +W E +GLP D V
Sbjct: 277 AHPHRGDTAYVFPINADADRVPADHRCRVFRTADAGKSW-----EPLSAGLPREDHYGTV 331
Query: 357 AFGAGRNGDL----YIISPDNNQGYRSVDNGASWSPVSGLIAD 395
A D N + + S D+G SW ++ + D
Sbjct: 332 LRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQLASHLPD 374
Score = 46.2 bits (108), Expect = 7e-004
Identities = 33/127 (25%), Positives = 56/127 (43%), Gaps = 19/127 (14%)
Query: 192 IRKSGDGGDTWLSTGLGNGL-SSYNIPIMA---ANDSGTILLIDRNNNSRP------LFR 241
+ +S D G W T +G GL S++ + A D+ + I+ + + P +FR
Sbjct: 250 VYRSDDAGAHW--TDIGEGLPSTFGFAVAAHPHRGDTAYVFPINADADRVPADHRCRVFR 307
Query: 242 SQDGGENWSDTGLGNGLTINRGDIAA-------APNGDFYYVNNGGAIFRSTDNGANWSQ 294
+ D G++W G + G + A Y+ N G +F S D+G +W Q
Sbjct: 308 TADAGKSWEPLSAGLPREDHYGTVLRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQ 367
Query: 295 VSGQLPD 301
++ LPD
Sbjct: 368 LASHLPD 374
>gb|AAK06388.1| (AF078042) glycosyl hydrolase 5 [Caldicellulosiruptor sp. Tok7B.1]
Length = 996
Score = 48.9 bits (115), Expect = 1e-004
Identities = 53/197 (26%), Positives = 81/197 (40%), Gaps = 16/197 (8%)
Query: 233 NNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGD-FYYVNNGGAIFRSTDNGAN 291
N N+ + S DGG++W T G +AAA +G + G + STDNG
Sbjct: 529 NPNTNRIGFSYDGGKSWFQGNTEPQGTSEGGTVAAAADGSAVVWAPKGAKVCYSTDNGNK 588
Query: 292 WSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPD 351
W + + + S V+ F F G + Y SA+ G + E +GLP
Sbjct: 589 WVECANVPSEAIVYSDRVNPN----KFYAFKNG-KFYISADKGKTFI----ESPAAGLPI 639
Query: 352 TLNPVAFGAGRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIADFSFAY---VTSTPNGS 408
+ N G GD++++ NN + S D G S+ +SG+ S + +
Sbjct: 640 SGN-FKTVPGIEGDIWLVG--NNGMWHSTDGGYSFVKISGVEDAASIGFGKPAEGETYPA 696
Query: 409 IYAVAGTFSAGGIFASE 425
IY A GIF S+
Sbjct: 697 IYTYAKINGVRGIFRSD 713
>pir||T35237 probable secreted cellulase - Streptomyces coelicolor
emb|CAA20642.1| (AL031515) putative secreted cellulase [Streptomyces coelicolor
A3(2)]
Length = 890
Score = 46.6 bits (109), Expect = 5e-004
Identities = 56/223 (25%), Positives = 84/223 (37%), Gaps = 35/223 (15%)
Query: 198 GGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGL--- 254
G D W TG+ S P GT D + + + RS D G +W L
Sbjct: 94 GWDDWGHTGVVALASDAVDPDRVYAAVGTYTN-DWDPTNGAVLRSADRGASWEKADLPFK 152
Query: 255 --GNGLTINRGD-IAAAPN-GDFYYVN--NGGAIFRSTDNGANWSQVSGQLPDNTFASIA 308
GN G+ +A P+ D Y+ +G ++RSTD G WS+V+ +A
Sbjct: 153 LGGNMPGRGMGERLAVDPHDNDVLYLGAPSGHGLWRSTDAGVTWSEVTAFPNPGNYAQDP 212
Query: 309 VDSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYI 368
D++G Y S N G W + +E G + G
Sbjct: 213 NDTSG--------------YASDNQGITW-VTFDESTGGGAGTATRTLYVG--------- 248
Query: 369 ISPDNNQGYRSVDNGASWSPVSGLIADF-SFAYVTSTPNGSIY 410
++ N YRS D GA+W ++G + + V NG +Y
Sbjct: 249 VADKENAVYRSTDAGATWERLAGQPTGYLAHKGVLDAENGYLY 291
Score = 40.4 bits (93), Expect = 0.038
Identities = 39/167 (23%), Positives = 67/167 (39%), Gaps = 34/167 (20%)
Query: 195 SGDGGDTWL----STGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWS 250
S + G TW+ STG G G ++ + + A+ + +RS D G W
Sbjct: 220 SDNQGITWVTFDESTGGGAGTATRTLYVGVADKENAV------------YRSTDAGATWE 267
Query: 251 D-TGLGNGLTINRGDIAAAPNGDFY---------YVNNGGAIFRSTDNGANWSQVSGQLP 300
G G ++G + A NG Y Y G ++R W+ +S
Sbjct: 268 RLAGQPTGYLAHKG-VLDAENGYLYLAYSDTGGPYDGGKGRLYRYATATGTWTDISPAAE 326
Query: 301 DNT---FASIAVD----STGAVYVFAEFVTGTRVYRSANNGANWAIA 340
+T F+ + VD T ++ + T+++RS ++GA W+ A
Sbjct: 327 ADTYYGFSGLTVDRQRPGTVMATAYSSWWPDTQIFRSTDSGATWSQA 373
>ref|NP_105312.1| (NC_002678) microcystin dependent protein MdpB [Mesorhizobium loti]
dbj|BAB51098.1| (AP003004) microcystin dependent protein; MdpB [Mesorhizobium loti]
Length = 169
Score = 46.6 bits (109), Expect = 5e-004
Identities = 26/64 (40%), Positives = 36/64 (55%), Gaps = 1/64 (1%)
Query: 84 IAAPRVGERLRTYGTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEG 143
+A P +GE G A ++ +G + + Y LF +IGTTYG GDG +TF LP
Sbjct: 1 MAQPYIGEIRMFAGNFAPAGWMFCEGQLLPISEYETLFQLIGTTYG-GDGQSTFALPDLR 59
Query: 144 GQVP 147
G+VP
Sbjct: 60 GRVP 63
>ref|NP_487884.1| (NC_003272) hypothetical protein [Nostoc sp. PCC 7120]
dbj|BAB75543.1| (AP003594) ORF_ID:alr3844~hypothetical protein [Nostoc sp. PCC
7120]
Length = 339
Score = 44.7 bits (104), Expect = 0.002
Identities = 40/161 (24%), Positives = 62/161 (37%), Gaps = 18/161 (11%)
Query: 235 NSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQ 294
+S L + DGG NW L R D + + + V + +TD G +WS+
Sbjct: 62 SSATLLETNDGGNNWQPLNLALDDDRYRFDSVSFAGKEGWIVGEPSLLLHTTDEGRSWSR 121
Query: 295 V--SGQLPDNTFASIAVDSTGAVYVFAEFVTGT-RVYRSANNGANW--AIAAEEGAESGL 349
+ S +LP N IA+ + G AE T +Y++ + G NW + A G L
Sbjct: 122 IPLSEKLPGN---PIAIQALGT--DIAEMATDVGAIYKTTDGGKNWKAQVEAAVGVVRNL 176
Query: 350 PDTLNPVAFGAGRNGDLYII--------SPDNNQGYRSVDN 382
+++ G Y P N R V+N
Sbjct: 177 ERSVDGKYVAVSAKGSFYSTWEAGQNAWVPHNRNSSRRVEN 217
>ref|NP_197703.1| (NM_122218) photosystem II stability/assembly factor HCF136
(sp|O82660) [Arabidopsis thaliana]
sp|O82660|H136_ARATH Photosystem II stability/assembly factor HCF136, chloroplast
precursor
pir||T51828 probable photosystem II stability protein HCF136 [imported] -
Arabidopsis thaliana
emb|CAA75723.1| (Y15628) HCF136 protein [Arabidopsis thaliana]
dbj|BAB09829.1| (AB006708) photosystem II stability/assembly factor HCF136
[Arabidopsis thaliana]
gb|AAK74049.1| (AY045691) AT5g23120/MYJ24_11 [Arabidopsis thaliana]
Length = 403
Score = 42.4 bits (98), Expect = 0.010
Identities = 52/224 (23%), Positives = 83/224 (36%), Gaps = 43/224 (19%)
Query: 239 LFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYY--------------VNNGGAIFR 284
L ++DGG W N I +A DF Y + +
Sbjct: 120 LLETKDGGSTW-----------NPRSIPSAEEEDFNYRFNSISFKGKEGWIIGKPAILLY 168
Query: 285 STDNGANWSQV--SGQLP-DNTFASIAVDSTGAVYVFAEFVTGT-RVYRSANNGANWAIA 340
+ D G NW ++ S QLP D F D + AE VT +Y ++N G NW A
Sbjct: 169 TADAGENWDRIPLSSQLPGDMVFIKATEDKS------AEMVTDEGAIYVTSNRGYNWKAA 222
Query: 341 AEEGAESGLPDTLNPVAFGAG-RNGDLYIISPDNNQGYRSVDNGA----SWSPVSGLIAD 395
+E + L T++ GA G ++ + Y +V + +W P
Sbjct: 223 IQETVSATLNRTVSSGISGASYYTGTFSAVNRSPDGRYVAVSSRGNFFLTWEPGQPYWQP 282
Query: 396 FSFAYVTSTPNGSIYAVAGTF---SAGGIFASEGSGVMPDPTDV 436
+ A N A G + GG++ S+G+G+ + +V
Sbjct: 283 HNRAVARRIQNMGWRADGGLWLLVRGGGLYLSKGTGITEEFEEV 326
>gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger]
Length = 857
Score = 41.6 bits (96), Expect = 0.017
Identities = 42/170 (24%), Positives = 74/170 (42%), Gaps = 22/170 (12%)
Query: 144 GQVPTPTPGETFVEKQSDGYPRDVRS---TAIAGLPDGTILAA-------FGRFSGTTIR 193
GQV +T + SDG R S +A++ LP G ++A+ F SG++
Sbjct: 536 GQVALSADADTILLMNSDGAYRSANSATLSAVSSLPSGAVIASDKANNTYFYGASGSSFY 595
Query: 194 KSGDGGDTW-LSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDT 252
S D T+ ++T LG+ ++ I + + D LF S + G++++
Sbjct: 596 LSSDTAATFTVTTTLGSSTTANAIRAQPSLAGDVWVSTDTG-----LFHSTNYGKSFTQI 650
Query: 253 GLGNGLTINRGDIAAAPNGD------FYYVNNGGAIFRSTDNGANWSQVS 296
G G + G + +GD F+ V+ +F++ D G NW +S
Sbjct: 651 GSGCTEGWSFGFGKPSSDGDYPVLFGFFTVDGVTGLFKTEDQGVNWQIIS 700
>pir||UMMS period clock protein - mouse (fragment)
emb|CAA26710.1| (X02966) unidentified reading frame (first ATG at pos. 210) [Mus
musculus]
prf||1111288A gene cp2.2 [Mus musculus]
Length = 713
Score = 40.0 bits (92), Expect = 0.049
Identities = 74/338 (21%), Positives = 108/338 (31%), Gaps = 33/338 (9%)
Query: 97 GTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQVPTPTPGETFV 156
GT + + G+A T D GT G G GS + G T T ET
Sbjct: 391 GTGSGSGTAKVTGTATGTGTGTDTGTGTGTGTGTGTGSGSGTAKVTGTATTTATVTETGT 450
Query: 157 EKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSGT-TIRKSGDGGDTWLSTGLGNGLSSYN 215
K + D + + G GT G +GT T +G G T TG G G +
Sbjct: 451 AKVTG---TDTGTAKVTGTGTGT-----GTGTGTGTGTGTGTGTGTGTGTGTGTGTGT-G 501
Query: 216 IPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYY 275
+ SGT + + + + + G + TG G G G + + +G
Sbjct: 502 TGTGTGSGSGTAKVTGTDTGTAKVTGTGTGTGTGTGTGTGTGTGTGTGTGSGSGSGS--- 558
Query: 276 VNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGA 335
+G T G SG + V TG TGT + +G+
Sbjct: 559 -GSGSGSGTGTGTGLGSGSGSGTAKVTGTGTAKVTGTG---------TGTGTGTGSGSGS 608
Query: 336 NWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIAD 395
+ G+ SG G+G + S + G +G
Sbjct: 609 GSGSGSGSGSGSG---------SGSGTGTGTGLGSGSGSGSGTGTGTGTGTGTGTGTGTG 659
Query: 396 FSFAYVTSTPNGSIYAV-AGTFSAGGIFASEGSGVMPD 432
S V T G+ A GT + G G+G D
Sbjct: 660 TSTVTVRGTGTGTATATGTGTGTGTGTGTGTGTGTGTD 697
>ref|NP_032862.1| (NM_008836) per-hexamer repeat gene 5 [Mus musculus]
sp|P08399|PHX5_MOUSE PER-HEXAMER REPEAT PROTEIN 5
gb|AAA88320.1| (M12039) ORF starts at 87, first start codon is found at 210.;
putative [Mus musculus]
Length = 672
Score = 40.0 bits (92), Expect = 0.049
Identities = 74/338 (21%), Positives = 108/338 (31%), Gaps = 33/338 (9%)
Query: 97 GTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQVPTPTPGETFV 156
GT + + G+A T D GT G G GS + G T T ET
Sbjct: 350 GTGSGSGTAKVTGTATGTGTGTDTGTGTGTGTGTGTGSGSGTAKVTGTATTTATVTETGT 409
Query: 157 EKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSGT-TIRKSGDGGDTWLSTGLGNGLSSYN 215
K + D + + G GT G +GT T +G G T TG G G +
Sbjct: 410 AKVTG---TDTGTAKVTGTGTGT-----GTGTGTGTGTGTGTGTGTGTGTGTGTGTGT-G 460
Query: 216 IPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYY 275
+ SGT + + + + + G + TG G G G + + +G
Sbjct: 461 TGTGTGSGSGTAKVTGTDTGTAKVTGTGTGTGTGTGTGTGTGTGTGTGTGSGSGSGS--- 517
Query: 276 VNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGA 335
+G T G SG + V TG TGT + +G+
Sbjct: 518 -GSGSGSGTGTGTGLGSGSGSGTAKVTGTGTAKVTGTG---------TGTGTGTGSGSGS 567
Query: 336 NWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIAD 395
+ G+ SG G+G + S + G +G
Sbjct: 568 GSGSGSGSGSGSG---------SGSGTGTGTGLGSGSGSGSGTGTGTGTGTGTGTGTGTG 618
Query: 396 FSFAYVTSTPNGSIYAV-AGTFSAGGIFASEGSGVMPD 432
S V T G+ A GT + G G+G D
Sbjct: 619 TSTVTVRGTGTGTATATGTGTGTGTGTGTGTGTGTGTD 656
>ref|NP_189520.1| (NM_113799) histone-H4-like protein [Arabidopsis thaliana]
dbj|BAB03175.1| (AP002057) gene_id:T19N8.7~unknown protein [Arabidopsis thaliana]
Length = 614
Score = 40.0 bits (92), Expect = 0.049
Identities = 75/298 (25%), Positives = 109/298 (36%), Gaps = 53/298 (17%)
Query: 130 AGDGSTTFNLPQEGGQVPTPTPGETFVEKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSG 189
+G S P +GG T G++ S YP D S + AG P G+ G
Sbjct: 147 SGSPSADTGSPTDGGSYGDTT-GDSGSSAGSPSYPSDDGSGSTAGGPSGSTT------DG 199
Query: 190 TTIRKSGDGGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENW 249
+ +S GGD S+ G AA +SG+ D + + + GG
Sbjct: 200 SAGGESSMGGD---SSSAG-----------AAGESGSAATADSGDAA----GADSGGAAG 241
Query: 250 SDTGLGNGLTINRGDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAV 309
+D+ G + + G AA +GGA T G+ ++ G+ AS A
Sbjct: 242 ADS--GGAASADSGGAAAGETA------SGGAAAADTSGGS--AETGGESASGGAASGAG 291
Query: 310 DSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYII 369
++G A TG +A+ G +AE G ES + GA G
Sbjct: 292 AASG-----ASAKTGGESGEAASGG-----SAETGGES--------ASAGAASGGSAETG 333
Query: 370 SPDNNQGYRSVDNGASWSPVSGLIADFSFAYVTSTPNGSIYAVAGTFSAGGIFASEGS 427
+ G S AS SG + + T + S A +G SA G AS GS
Sbjct: 334 GESGSGGAASGGESASGGATSGGSPETGGSAETGGESASGGAASGGESASGGAASSGS 391
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.314 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 301,766,192
Number of Sequences: 919193
Number of extensions: 15191604
Number of successful extensions: 25233
Number of sequences better than 5.0e-02: 25
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 25109
Number of HSP's gapped (non-prelim): 79
length of query: 444
length of database: 288,098,425
effective HSP length: 123
effective length of query: 321
effective length of database: 175,037,686
effective search space: 56187097206
effective search space used: 56187097206
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 92 (40.0 bits)
Query= orf26 [7447-7622]
(174 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_274047.1| (NC_003112) hypothetical protein [Neisseri... 125 2e-028
ref|NP_108221.1| (NC_002678) hypothetical protein [Mesorhiz... 84 5e-016
ref|NP_539912.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [B... 82 3e-015
ref|NP_108621.1| (NC_002678) hypothetical protein [Mesorhiz... 79 2e-014
ref|NP_540027.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [B... 72 2e-012
ref|NP_103651.1| (NC_002678) hypothetical protein [Mesorhiz... 64 5e-010
ref|NP_437108.1| (NC_003078) putative protein [Sinorhizobiu... 59 2e-008
>ref|NP_274047.1| (NC_003112) hypothetical protein [Neisseria meningitidis MC58]
ref|NP_283994.1| (NC_003116) hypothetical protein NMA1230 [Neisseria meningitidis
Z2491]
pir||E81131 hypothetical protein NMB1012 [imported] - Neisseria meningitidis
(group B strain MD58, group A strain Z2491)
gb|AAF41413.1| (AE002452) hypothetical protein [Neisseria meningitidis MC58]
emb|CAB84490.1| (AL162755) hypothetical protein NMA1230 [Neisseria meningitidis
Z2491]
Length = 167
Score = 125 bits (313), Expect = 2e-028
Identities = 68/171 (39%), Positives = 101/171 (58%), Gaps = 8/171 (4%)
Query: 2 DTFDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVD 61
D F++ +RV+ HEGG+ N KD GG+ G K T A + ++ +T +
Sbjct: 3 DKFNQFINRVLSHEGGYANHPKD----PGGETNWGITKRT----AQANGYNGSMRAMTRE 54
Query: 62 EAKAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDGIIGPKTI 121
+A +IYR+ +W+ A+ +Q FDA +NHG NA +MLQRA+G DDG+IG ++
Sbjct: 55 QAISIYRKAFWERYRADQMPEAVAFQFFDACVNHGYGNAARMLQRAAGVPDDGVIGAVSL 114
Query: 122 GAARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWSRRIAGNLRLAAEDN 172
A + +DLL+ F AERL F+T + TF +G+GW RR+A NL A+ DN
Sbjct: 115 KAINSLPENDLLLRFNAERLVFYTKLGTFTSFGKGWVRRVAQNLIHASADN 165
>ref|NP_108221.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
dbj|BAB53682.1| (AP003013) hypothetical protein [Mesorhizobium loti]
Length = 283
Score = 84.3 bits (207), Expect = 5e-016
Identities = 52/158 (32%), Positives = 78/158 (48%), Gaps = 8/158 (5%)
Query: 10 RVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEAKAIYRR 69
RV+ HEGG+ N D G T V G + G T +K +T DE AIY R
Sbjct: 12 RVLAHEGGYSNHPNDPGGPTMKGVTQRVYDGYRKGKGLATR---SVKGITTDELNAIYDR 68
Query: 70 DWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQR----ASGAHDDGIIGPKTIGAAR 125
+WDA+ + Y +FD A+N G + LQ+ A DG++G T+ A +
Sbjct: 69 QYWDAVRGDDLPAGVDYVVFDGAVNSGPSRSIMWLQQALRPAYTGRIDGVLGMGTLAALK 128
Query: 126 ATDLDDLLMLFLAE-RLEFFTNIKTFDHYGRGWSRRIA 162
A +D L+ + R+ F ++ TF ++GRGW+ R+A
Sbjct: 129 ADKNNDALIDRICNARMAFLKHLSTFGNFGRGWTARVA 166
>ref|NP_539912.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
gb|AAL52176.1| (AE009540) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
Length = 252
Score = 81.6 bits (200), Expect = 3e-015
Identities = 47/156 (30%), Positives = 71/156 (45%), Gaps = 4/156 (2%)
Query: 3 TFDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDE 62
TF KA V EGG+ + KD GG +G T ++K LT +
Sbjct: 5 TFAKAMPHVFSEEGGYVDHPKD----PGGATNMGITLATLSAWEGRKVSKAEVKALTKTK 60
Query: 63 AKAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDGIIGPKTIG 122
A IYR ++W+ + + + D AI+ G A KMLQ+ G DG+IG KT+
Sbjct: 61 ATDIYRENYWNKVAGDDLPAGVDHATLDFAIHSGPARALKMLQKVVGVDQDGVIGAKTLA 120
Query: 123 AARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWS 158
A R + ++ RL + + TF +G+GW+
Sbjct: 121 AVRKMAANRIINELCDARLAWLKGLGTFSTFGKGWT 156
>ref|NP_108621.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
dbj|BAB54407.1| (AP003014) hypothetical protein [Mesorhizobium loti]
Length = 283
Score = 79.3 bits (194), Expect = 2e-014
Identities = 50/158 (31%), Positives = 76/158 (47%), Gaps = 8/158 (5%)
Query: 10 RVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEAKAIYRR 69
RV+ HEGG+ N D G T V G + G T +K++T E IY R
Sbjct: 12 RVLAHEGGYSNHPNDPGGPTMKGVTQRVYDGFRKGKGLATR---SVKSITTAELNEIYDR 68
Query: 70 DWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQR----ASGAHDDGIIGPKTIGAAR 125
+WDA+ + Y +FD A+N G + LQ+ A DG++G T+ A +
Sbjct: 69 QYWDAVQGDALPAGVDYVVFDGAVNSGPGRSIMWLQQALRPAYTGRIDGVLGMGTLAALK 128
Query: 126 ATDLDDLLM-LFLAERLEFFTNIKTFDHYGRGWSRRIA 162
A +D L+ + R+ F ++ TF +GRGW+ R+A
Sbjct: 129 ADKNNDALIDRICSARMAFLKHLSTFGTFGRGWTARVA 166
>ref|NP_540027.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
gb|AAL52291.1| (AE009550) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
Length = 253
Score = 72.4 bits (176), Expect = 2e-012
Identities = 42/158 (26%), Positives = 67/158 (41%), Gaps = 4/158 (2%)
Query: 4 FDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEA 63
F + EGG+ ++ D GG +G T D+K LT A
Sbjct: 5 FQTVMPYIFSEEGGYADNPAD----PGGATNMGITINTLSAWEGRQVSPQDVKELTQATA 60
Query: 64 KAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDGIIGPKTIGA 123
IY+ ++W+ + + Y +FD A+N G A K LQ+ +DGIIG +T+ A
Sbjct: 61 TQIYQVEFWNKIDGNDLPSGVDYALFDFAVNSGPGRAAKTLQKILAMPEDGIIGAQTVAA 120
Query: 124 ARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWSRRI 161
A A + ++ R + + T +G GW R+
Sbjct: 121 AAARSPEGIINALCDARAAWLRGLSTAATFGNGWLARV 158
>ref|NP_103651.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
dbj|BAB49437.1| (AP002999) hypothetical protein [Mesorhizobium loti]
Length = 409
Score = 64.3 bits (155), Expect = 5e-010
Identities = 46/172 (26%), Positives = 78/172 (44%), Gaps = 10/172 (5%)
Query: 4 FDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEA 63
F++A V EGG+ +D KD GG G + T D+ N++ EA
Sbjct: 49 FERAQGIVREFEGGYSDDPKD----PGGATNFGIARKTLAAWRGKPVSKADVANMSYQEA 104
Query: 64 KAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHD-----DGIIGP 118
K I+ ++W G + +++ A++ G+R+A LQ+A + DG IG
Sbjct: 105 KEIFFSEYWSKSSCGAMPGPLALAVYNVAVHAGVRDAATFLQKALNQNGAAVAVDGGIGG 164
Query: 119 KTIGAARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWSRRIAGNLRLAAE 170
+T+GA L D++ + + ++HY G++RR+ LRL E
Sbjct: 165 ETLGAIPKAPLPDVIGDIIDLYDAKLRAHRDYEHYKNGFNRRV-NKLRLETE 215
>ref|NP_437108.1| (NC_003078) putative protein [Sinorhizobium meliloti]
emb|CAC48968.1| (AL603644) putative protein [Sinorhizobium meliloti]
Length = 91
Score = 58.9 bits (141), Expect = 2e-008
Identities = 25/71 (35%), Positives = 40/71 (56%)
Query: 55 IKNLTVDEAKAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDG 114
++ ++ +EA IYRR +W G+ P + Y +FD +N G A K LQ+ G +DG
Sbjct: 6 VRAMSREEAADIYRRSYWPQCGVDLLPPGLDYAVFDFGVNSGPARAVKTLQKVVGVREDG 65
Query: 115 IIGPKTIGAAR 125
+G +T+ A R
Sbjct: 66 HVGEQTLAAVR 76
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.321 0.139 5.40e-067
Gapped
Lambda K H
0.267 0.0410 5.40e-067
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 103,312,389
Number of Sequences: 919193
Number of extensions: 4205788
Number of successful extensions: 7680
Number of sequences better than 5.0e-02: 13
Number of HSP's better than 0.1 without gapping: 8
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 7660
Number of HSP's gapped (non-prelim): 13
length of query: 174
length of database: 288,098,425
effective HSP length: 112
effective length of query: 62
effective length of database: 185,148,809
effective search space: 11479226158
effective search space used: 11479226158
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 87 (38.1 bits)
Query= orf27 [7620-7819]
(198 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.320 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 91,180,404
Number of Sequences: 919193
Number of extensions: 2836423
Number of successful extensions: 6222
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6222
Number of HSP's gapped (non-prelim): 0
length of query: 198
length of database: 288,098,425
effective HSP length: 114
effective length of query: 84
effective length of database: 183,310,423
effective search space: 15398075532
effective search space used: 15398075532
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 88 (38.5 bits)
Query= orf28 [7855-7934]
(78 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.313 0.128 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 40,670,857
Number of Sequences: 919193
Number of extensions: 1325222
Number of successful extensions: 2122
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2122
Number of HSP's gapped (non-prelim): 0
length of query: 78
length of database: 288,098,425
effective HSP length: 54
effective length of query: 24
effective length of database: 238,462,003
effective search space: 5723088072
effective search space used: 5723088072
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf29 [7934-8632]
(697 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_496184.1| (NM_063783) polycistic kidney disease prot... 42 0.023
sp|Q09625|YS8A_CAEEL Hypothetical 84.3 kDa protein ZK945.10... 42 0.023
ref|NP_506158.1| (NM_073757) F56H9.1.p [Caenorhabditis eleg... 42 0.030
>ref|NP_496184.1| (NM_063783) polycistic kidney disease protein 1 like with repeated
domains at the N-terminus [Caenorhabditis elegans]
emb|CAB70201.1| (Z48582) N-terminus contains five repeated domains and the
C-terminus shows similarity to the polycistic kidney
disease protein 1 (PIR accession number A44604),
contains similarity to Pfam domain: PF01477 (PLAT/LH2
domain), Score=130.4, E-value=1.1e-35>
emb|CAB70192.1| (Z48544) N-terminus contains five repeated domains and the
C-terminus shows similarity to the polycistic kidney
disease protein 1 (PIR accession number A44604),
contains similarity to Pfam domain: PF01477 (PLAT/LH2
domain), Score=130.4, E-value=1.1e-35>
Length = 3178
Score = 42.0 bits (97), Expect = 0.023
Identities = 55/238 (23%), Positives = 86/238 (36%), Gaps = 27/238 (11%)
Query: 470 ENTWRIFASHDFNETRSGNLGQASLHSLSTSLPDSVNVGTALPTSVTVEYNVT-----NH 524
E+T F S T S + Q S ++ TS P S + T++PT+ T E T ++
Sbjct: 321 ESTSTTFTSTASTSTSSTSTTQQSSSTI-TSSPSSTTLSTSIPTTTTPEITSTLSSLPDN 379
Query: 525 ADIRTLQLQIDGQSFTPPLPTQDGPQS---------YTATVTAISTTAPRNIITSLTGFR 575
A L +FT + T + T+T + ++TT P +T+ T
Sbjct: 380 AICSYLDETTTSTTFTTTMLTSTTTEEPSTSTTTTEVTSTSSTVTTTEPTTTLTTSTAST 439
Query: 576 RNGSALNTVTDSTAVVVPPPSVFWGRSASNNPATVDTATLQSAPRANRMTVGTGQAA--- 632
+ ++ P S S+S+ T T+T +S + TV T A
Sbjct: 440 STTEPSTSTVTTSPSTSPVTSTVTSSSSSSTTVTTPTST-ESTSTSPSSTVTTSTTAPST 498
Query: 633 --AGDYFIVLAPDSQAISKITDTV------LDQDVLSLFTKTEDVRQIAGLEYDAYVI 682
G P S A S ++ T S TK+E G D Y +
Sbjct: 499 STTGPSSSSSTPSSTASSSVSSTASSTQSSTSTQQSSTTTKSETTTSSDGTNPDFYFV 556
>sp|Q09625|YS8A_CAEEL Hypothetical 84.3 kDa protein ZK945.10 in chromosome II
pir||T21460 hypothetical protein ZK945.10 - Caenorhabditis elegans
Length = 796
Score = 42.0 bits (97), Expect = 0.023
Identities = 55/238 (23%), Positives = 86/238 (36%), Gaps = 27/238 (11%)
Query: 470 ENTWRIFASHDFNETRSGNLGQASLHSLSTSLPDSVNVGTALPTSVTVEYNVT-----NH 524
E+T F S T S + Q S ++ TS P S + T++PT+ T E T ++
Sbjct: 321 ESTSTTFTSTASTSTSSTSTTQQSSSTI-TSSPSSTTLSTSIPTTTTPEITSTLSSLPDN 379
Query: 525 ADIRTLQLQIDGQSFTPPLPTQDGPQS---------YTATVTAISTTAPRNIITSLTGFR 575
A L +FT + T + T+T + ++TT P +T+ T
Sbjct: 380 AICSYLDETTTSTTFTTTMLTSTTTEEPSTSTTTTEVTSTSSTVTTTEPTTTLTTSTAST 439
Query: 576 RNGSALNTVTDSTAVVVPPPSVFWGRSASNNPATVDTATLQSAPRANRMTVGTGQAA--- 632
+ ++ P S S+S+ T T+T +S + TV T A
Sbjct: 440 STTEPSTSTVTTSPSTSPVTSTVTSSSSSSTTVTTPTST-ESTSTSPSSTVTTSTTAPST 498
Query: 633 --AGDYFIVLAPDSQAISKITDTV------LDQDVLSLFTKTEDVRQIAGLEYDAYVI 682
G P S A S ++ T S TK+E G D Y +
Sbjct: 499 STTGPSSSSSTPSSTASSSVSSTASSTQSSTSTQQSSTTTKSETTTSSDGTNPDFYFV 556
>ref|NP_506158.1| (NM_073757) F56H9.1.p [Caenorhabditis elegans]
pir||T22808 hypothetical protein F56H9.1 - Caenorhabditis elegans
emb|CAA98949.1| (Z74473) predicted using Genefinder [Caenorhabditis elegans]
Length = 770
Score = 41.6 bits (96), Expect = 0.030
Identities = 30/119 (25%), Positives = 47/119 (39%), Gaps = 2/119 (1%)
Query: 498 STSLPDSVNVGTALPTSVTVEYNVTNHADIRTLQLQIDGQSFTPPLPTQDGPQSYTATV- 556
+T++P S T+ T+ TV T+ A T+ + T +P + T TV
Sbjct: 35 TTTVPVSTTTSTSSTTTTTVPSTTTSSATTTTVPSTTTSSTTTTTVPPTTTSTTTTTTVP 94
Query: 557 -TAISTTAPRNIITSLTGFRRNGSALNTVTDSTAVVVPPPSVFWGRSASNNPATVDTAT 614
T STT + + T + T T ST PP+ + + P T T+T
Sbjct: 95 PTTTSTTTTTTVPPTTTSTTTTTTVPPTTTSSTTTTTVPPTTTSSTTTTTVPPTTTTST 153
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.315 0.132 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 393,010,755
Number of Sequences: 919193
Number of extensions: 16972490
Number of successful extensions: 40467
Number of sequences better than 5.0e-02: 3
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 40426
Number of HSP's gapped (non-prelim): 17
length of query: 697
length of database: 288,098,425
effective HSP length: 127
effective length of query: 570
effective length of database: 171,360,914
effective search space: 97675720980
effective search space used: 97675720980
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 95 (41.2 bits)
Query= orf30 [8633-9724]
(1090 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.315 0.132 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 617,640,609
Number of Sequences: 919193
Number of extensions: 27205065
Number of successful extensions: 45361
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 45361
Number of HSP's gapped (non-prelim): 0
length of query: 1090
length of database: 288,098,425
effective HSP length: 131
effective length of query: 959
effective length of database: 167,684,142
effective search space: 160809092178
effective search space used: 160809092178
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 96 (41.6 bits)
Query= orf31 [9803-9961]
(157 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.334 0.148 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 57,689,308
Number of Sequences: 919193
Number of extensions: 1482263
Number of successful extensions: 3309
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3309
Number of HSP's gapped (non-prelim): 0
length of query: 157
length of database: 288,098,425
effective HSP length: 110
effective length of query: 47
effective length of database: 186,987,195
effective search space: 8788398165
effective search space used: 8788398165
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.6 bits)
S2: 86 (37.7 bits)
Query= orf32 [9966-10056]
(89 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.321 0.135 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,468,450
Number of Sequences: 919193
Number of extensions: 857689
Number of successful extensions: 2230
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2230
Number of HSP's gapped (non-prelim): 0
length of query: 89
length of database: 288,098,425
effective HSP length: 65
effective length of query: 24
effective length of database: 228,350,880
effective search space: 5480421120
effective search space used: 5480421120
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf33 [10060-10144]
(83 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.320 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 38,558,786
Number of Sequences: 919193
Number of extensions: 915232
Number of successful extensions: 1918
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1918
Number of HSP's gapped (non-prelim): 0
length of query: 83
length of database: 288,098,425
effective HSP length: 59
effective length of query: 24
effective length of database: 233,866,038
effective search space: 5612784912
effective search space used: 5612784912
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf34 [10136-10216]
(79 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.311 0.130 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 36,208,040
Number of Sequences: 919193
Number of extensions: 848691
Number of successful extensions: 1280
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1280
Number of HSP's gapped (non-prelim): 0
length of query: 79
length of database: 288,098,425
effective HSP length: 55
effective length of query: 24
effective length of database: 237,542,810
effective search space: 5701027440
effective search space used: 5701027440
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf35 [10206-10395]
(188 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_059586.1| (NC_002371) eac [Enterobacteria phage P22]... 39 0.027
>ref|NP_059586.1| (NC_002371) eac [Enterobacteria phage P22]
sp|Q03545|VEAC_BPP22 Eac protein
pir||S35280 eaC protein - phage P22
gb|AAC18880.1| (L06296) EaC [Enterobacteria phage P22]
gb|AAF75004.1| (AF217253) eac [Enterobacteria phage P22]
Length = 211
Score = 38.9 bits (89), Expect = 0.027
Identities = 33/166 (19%), Positives = 72/166 (42%), Gaps = 11/166 (6%)
Query: 2 FKYFQVNPDVGHYLVDKYNSTVNAERERIVSHFLSAVGAVGIVMVREW-GMPSRIEALVF 60
+ Y+ V + L+ Y+ T+N +R I++ VGA+ R W G +++ V+
Sbjct: 8 YDYYMVEGEDVKELIQSYD-TINDQRNSILTTAAEKVGAIAWTTARSWGGEGGLLQSFVW 66
Query: 61 PATHEICNMEGVKLTEHRQGQVVTF---DDNAPFAGHYYEAISGL----NAELVAYPDFS 113
+E +K + G+ V N Y + + + NA+L + P+++
Sbjct: 67 EKGYEFPCQITIKREDFLDGKRVVIARGKGNTKEGRAYNKELDAIMHNANAKLKSLPEWN 126
Query: 114 DWLHHTMGVTRYALAEKDDEAGTYKKIATH--SHLMADGRILFAVP 157
++ + G+ R + + + ++T+ H D ++FA+P
Sbjct: 127 YYITNHYGIMRTGIGGQSGRGFGFVMLSTYGGKHPQRDDCLIFAIP 172
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.319 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 107,980,291
Number of Sequences: 919193
Number of extensions: 4133389
Number of successful extensions: 7587
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 7587
Number of HSP's gapped (non-prelim): 1
length of query: 188
length of database: 288,098,425
effective HSP length: 113
effective length of query: 75
effective length of database: 184,229,616
effective search space: 13817221200
effective search space used: 13817221200
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 87 (38.1 bits)
Query= orf36 [10392-10568]
(175 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 100,113,883
Number of Sequences: 919193
Number of extensions: 3645935
Number of successful extensions: 6457
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6457
Number of HSP's gapped (non-prelim): 0
length of query: 175
length of database: 288,098,425
effective HSP length: 112
effective length of query: 63
effective length of database: 185,148,809
effective search space: 11664374967
effective search space used: 11664374967
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 87 (38.1 bits)
Query= orf37 [10587-10650]
(62 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
sp|P06225|DPOL_BPSP2 DNA POLYMERASE >gi|67052|pir||DJBPS2 D... 51 3e-006
ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [... 41 0.003
ref|NP_299570.1| (NC_002488) phage-related protein [Xylella... 41 0.003
ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [... 41 0.003
ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacte... 40 0.004
gb|AAL82293.1| (AF424782) DNA polymerase A domain [Staphylo... 40 0.007
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
SPO2]
emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
Length = 648
Score = 50.8 bits (120), Expect = 3e-006
Identities = 22/62 (35%), Positives = 35/62 (55%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPEDFGSVEEMAALMTEREPWRAWWPIRAAGWQDDRYQ 60
+ + +N GY VMH HDE + +VP ++++ A+M E W P+ A G+ D Y+
Sbjct: 587 LMRLDNAGYKTVMHVHDEAVLDVPRGKNELDKVEAIMGEPISWAKGLPLTADGFVTDYYK 646
Query: 61 KD 62
KD
Sbjct: 647 KD 648
>ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
pir||H82774 phage-related DNA polymerase XF0683 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF83493.1|AE003912_5 (AE003912) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
Length = 726
Score = 40.8 bits (94), Expect = 0.003
Identities = 22/62 (35%), Positives = 31/62 (49%), Gaps = 1/62 (1%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPED-FGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
M E GY +V+ HDE+I E + + +AALM PW P+ A G+ RY
Sbjct: 664 MPAIEAAGYSIVLTVHDEIITEADDHPCFNAAHLAALMATPPPWAEGLPLAAEGFDTYRY 723
Query: 60 QK 61
+K
Sbjct: 724 RK 725
>ref|NP_299570.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
pir||H82574 phage-related protein XF2291 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85090.1|AE004041_2 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
Length = 252
Score = 40.8 bits (94), Expect = 0.003
Identities = 22/62 (35%), Positives = 31/62 (49%), Gaps = 1/62 (1%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPED-FGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
M E GY +V+ HDE+I E + + +AALM PW P+ A G+ RY
Sbjct: 190 MPAIEAAGYSIVLTVHDEIITEADDHPCFNAAHLAALMATPPPWAEGLPLAAEGFDTYRY 249
Query: 60 QK 61
+K
Sbjct: 250 RK 251
>ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
pir||C82548 phage-related DNA polymerase XF2525 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85323.1|AE004059_13 (AE004059) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
Length = 726
Score = 40.8 bits (94), Expect = 0.003
Identities = 22/62 (35%), Positives = 31/62 (49%), Gaps = 1/62 (1%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPED-FGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
M E GY +V+ HDE+I E + + +AALM PW P+ A G+ RY
Sbjct: 664 MPAIEAAGYSIVLTVHDEIITEADDHPCFNAAHLAALMATPPPWAEGLPLAAEGFDTYRY 723
Query: 60 QK 61
+K
Sbjct: 724 RK 725
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
Length = 993
Score = 40.4 bits (93), Expect = 0.004
Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 1/63 (1%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPE-DFGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
M E GY +V+ HDE+I+E P+ S E ++ L++ W P+ A G++ RY
Sbjct: 931 MPPIEKAGYEIVLTVHDEIISEAPDTPQFSAEGLSKLLSFNSDWAWDLPLSANGFETYRY 990
Query: 60 QKD 62
+K+
Sbjct: 991 RKE 993
>gb|AAL82293.1| (AF424782) DNA polymerase A domain [Staphylococcus aureus phage phi
12]
Length = 650
Score = 39.7 bits (91), Expect = 0.007
Identities = 17/60 (28%), Positives = 30/60 (49%)
Query: 3 KCENNGYPVVMHTHDELIAEVPEDFGSVEEMAALMTEREPWRAWWPIRAAGWQDDRYQKD 62
+ E G+ +V H HDE+I E+P ++E+ +M + W + + G+ Y KD
Sbjct: 591 RLEALGFKIVGHVHDEVIVEIPRGSNGLKEIETIMNKPVDWAKGLNLNSDGFTSPFYMKD 650
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.317 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 38,868,347
Number of Sequences: 919193
Number of extensions: 1180621
Number of successful extensions: 3912
Number of sequences better than 5.0e-02: 6
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 3907
Number of HSP's gapped (non-prelim): 6
length of query: 62
length of database: 288,098,425
effective HSP length: 38
effective length of query: 24
effective length of database: 253,169,091
effective search space: 6076058184
effective search space used: 6076058184
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf38 [10649-11364]
(714 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
sp|P06225|DPOL_BPSP2 DNA POLYMERASE >gi|67052|pir||DJBPS2 D... 223 4e-057
gb|AAL82293.1| (AF424782) DNA polymerase A domain [Staphylo... 211 2e-053
ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacte... 154 3e-036
ref|NP_299569.1| (NC_002488) phage-related protein [Xylella... 79 2e-013
gb|AAA23103.1| (M94320) putative DNA polymerase [Citrus gre... 60 8e-008
gb|AAB25243.1| DNA polymerase homolog [bacterium-like organ... 60 8e-008
sp|Q9S1G2|DPO1_RHILE DNA polymerase I (POL I) >gi|5596366|g... 44 0.006
ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas ... 43 0.010
ref|NP_384281.1| (NC_003047) PROBABLE DNA POLYMERASE I PROT... 43 0.014
ref|NP_353146.1| (NC_003062) AGR_C_169p [Agrobacterium tume... 42 0.030
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
SPO2]
emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
Length = 648
Score = 223 bits (569), Expect = 4e-057
Identities = 197/633 (31%), Positives = 284/633 (44%), Gaps = 98/633 (15%)
Query: 72 GTPVYAEHPSTEILCLYYDLKDGKGRRAYIPGGPK-PTDLIEHIANGGPI-AAWNITFEF 129
G Y E P EIL Y D + + G P +++ + + I A+N FE
Sbjct: 20 GVYAYTEAPDFEILLFAYAFDDDPVKIIDLAQGDTLPHEVLVALTSSKVIKTAYNANFE- 78
Query: 130 WIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGSLDAAAQALKTSP-KDKKGKQLINK 188
+AK L Q C A LPG+LD A+ALK S KDK GK LI
Sbjct: 79 ---RTCIAKHFNLMLLPA-QWRCTAVHATTLGLPGNLDGVAKALKLSAQKDKAGKALIRY 134
Query: 189 LCRPVSATKA-----RPLPRWTLYTAPQDYSDLYAYCDQDVKSEDHVSALTPDMTP--SE 241
P TKA R LP P+ + YC QDV+ E + P SE
Sbjct: 135 FSVPCKPTKANGQRVRNLPEHD----PEKWEKFKVYCIQDVEVERAIKNRISKFEPLESE 190
Query: 242 RETWLADQRINARGVQVDVESLDNALDIVDQTTRRFTMELAAITNGAVGSVSEVAKMTDF 301
+ W DQ IN RGV++DV+ + +A+ +Q E +T + + + A++ +
Sbjct: 191 HKLWALDQEINDRGVRIDVDLVKHAIACDEQYQAGLIAEAKKLTG--LPNPNSTAQLKKW 248
Query: 302 LAGLGCRMHNLTADSVSEALERDDLNPTARRVLEIREALAGANVKKLFSLKAQISSDGRL 361
L G + +L D + E +E + + T RVL +R+ +A +VKK +++ + D R+
Sbjct: 249 LEEKGLTISSLAKDKIEELIENTN-DETVHRVLRLRQEMAKTSVKKYLAMEKALCPDNRV 307
Query: 362 RNQYNFFGAG-TGRWSAGGVQLQNLTSKGPKSKTCNGCGRIVGKACTIEVMGATGLCPEC 420
R F+GA TGRW+ VQ+QNL
Sbjct: 308 RGLLQFYGASRTGRWAGRLVQVQNLPQ--------------------------------- 334
Query: 421 GGNEWTDNKDWTVDAVRWALKDLTHRNLDLIIAIWGDPITLLSGCLRGLFIAAPGKKFVC 480
NK +D R LK + ++L+ +G +LS +R FI + G +F
Sbjct: 335 -------NKIEDLDTARNLLKGGHYEAIELL---YGQVPFVLSQLVRTAFIPSEGNEFYV 384
Query: 481 CDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEEMMAYKEQNKHDHPLR 540
DFSAIEA V+A LA EWRLEVF THGKIYE SA+++ VP E + PLR
Sbjct: 385 SDFSAIEARVIAWLAGEEWRLEVFNTHGKIYEASAAQMFKVPVESI-------TKGSPLR 437
Query: 541 KSLGKVAELASGYGGWLGAWKAFGAEAHFADDDEMKKAILGWRDASPEIVEFWGGQYRRD 600
+ GKVAELA GY G GA GA + E+ + + WR A+ +IV+FW
Sbjct: 438 QK-GKVAELALGYQGGKGALIQMGALNMGLAEGELPELVKAWRTANKKIVKFW------- 489
Query: 601 PFTRKWTKEYFGLEGAAIMAILNPGKCYAVHDITYAVHDDVLYCRLPSGRFLQYHQPRLV 660
+ +E AAI A+ H +T+ +L+ +LPSGR L Y +P+L
Sbjct: 490 ----------YDVEAAAIKAVKERKPVKLQHGLTFLYESGILFVQLPSGRRLAYAKPKL- 538
Query: 661 ESDSWKGPEYQITFMGYNTNSQKGPVGWVRMDT 693
E D G E +T+ G + + W R++T
Sbjct: 539 ELDERFGKE-ALTYEGKLESGK-----WGRLNT 565
>gb|AAL82293.1| (AF424782) DNA polymerase A domain [Staphylococcus aureus phage phi
12]
Length = 650
Score = 211 bits (537), Expect = 2e-053
Identities = 199/649 (30%), Positives = 292/649 (44%), Gaps = 103/649 (15%)
Query: 61 KTGNKNGIALVGTPVYAEHPSTEILCLYYDLKDGKGRRAY----IPGGPKPTDLIEH-IA 115
+T + N I+ G Y E EIL + Y + DG A + P D IA
Sbjct: 6 ETYSSNDISKCGAYKYTEAEDFEILIIAYSI-DGGAISAIDMTKVDNEPFHADYETFKIA 64
Query: 116 NGGPIA---AWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGSLDAAAQA 172
P A+N FE LAK + E+ C M + R LP SLD +
Sbjct: 65 LFDPAVKKYAFNANFE----RTCLAKHFN-KQMPPEEWICTMVNSMRIGLPASLDKVGEV 119
Query: 173 LKT-SPKDKKGKQLINKL---CRP--VSATKARPLPRWTLYTAPQDYSDLYAYCDQDVKS 226
L+ S KDK GK LI C+P V+ + R LP L + + YC +DV+
Sbjct: 120 LRLQSQKDKAGKNLIRYFSIPCKPTKVNGGRTRNLPEHDL----EKWQQFIDYCIRDVEV 175
Query: 227 EDHVSALTPD--MTPSERETWLADQRINARGVQVDVESLDNALDIVDQTTRRFTMELAAI 284
E ++ D +T E+ W+ DQ IN RG+++ +SL +++D+ ++ ++ A
Sbjct: 176 EMAIANKIKDFPVTAIEQTYWVFDQHINDRGIKLS-KSLMLGANVLDKQSKEELLKQAKH 234
Query: 285 TNGAVGSVSEVAKMTDFLAGLGCRMHNLTADSVSEALERDDLNPTARRVLEIREALAGAN 344
G S + G + NL +V E L+ + A+++LEIR ++ +
Sbjct: 235 ITGLENPNSPTQLLAWLKDEQGLDIPNLQKKTVQEYLK--EATGKAKKMLEIRLQMSKTS 292
Query: 345 VKKLFSLKAQISSDGRLRNQYNFFGAGTGRWSAGGVQLQNLTSKGPKSKTCNGCGRIVGK 404
VKK + + SD R+R + F+GAGTGRW+ GVQLQNLT
Sbjct: 293 VKKYNKMHDMMCSDERVRGLFQFYGAGTGRWAGRGVQLQNLTKHYIS------------- 339
Query: 405 ACTIEVMGATGLCPECGGNEWTDNKDWTVDAVRWALKDLTHRNLDLIIAIWGDPITLLSG 464
D ++ R +K+ +LDL++ + P LLS
Sbjct: 340 -------------------------DTELEIARDLIKEQRFDDLDLLLNV--HPQDLLSQ 372
Query: 465 CLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFE 524
+R F A G + DFSAIEA V+A A+ +WRL+VF THGKIYE SAS++ VP E
Sbjct: 373 LVRTTFTAEEGNELAVSDFSAIEARVIAWYAKEQWRLDVFNTHGKIYEASASQMFNVPVE 432
Query: 525 EMMAYKEQNKHDHPLRKSLGKVAELASGYGGWLGAWKAFGAEAHFADDDEMKKAILGWRD 584
+ K D PLR+ GKV+ELA GY G GA KA GA +++E++ + WR+
Sbjct: 433 SI------TKGD-PLRQK-GKVSELALGYQGGAGALKAMGALEMGIEENELQGLVDSWRN 484
Query: 585 ASPEIVEFWGGQYRRDPFTRKWTKEYFGLEGAAIMAILNPGKCYAVHDITYAVHDDVLYC 644
A+P IV FW + AAI + K + H + + + L
Sbjct: 485 ANPNIVNFWK-----------------ACQEAAINTV-KSRKTHHTHGLRFYMKKGFLMI 526
Query: 645 RLPSGRFLQYHQPRLVESDSWKGPEYQITFMGYNTNSQKGPVGWVRMDT 693
LPSGR L Y + LV +SW + FMG + N + W ++ T
Sbjct: 527 ELPSGRALAYPK-ALVGENSW--GSQVVEFMGLDLNRK-----WSKLKT 567
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
Length = 993
Score = 154 bits (389), Expect = 3e-036
Identities = 162/622 (26%), Positives = 257/622 (41%), Gaps = 107/622 (17%)
Query: 107 PTDLIEHIANGGPIAAWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGSL 166
P DL ++ + + W+ +++ V+ KR L L + + + +A LPG+L
Sbjct: 54 PADLKAYLDDSEILTVWH---NGGMFDTVILKRVLNIDLPLSRVHDTLVQALAHGLPGAL 110
Query: 167 DAAAQALKTSP---KDKKGKQLINKLCRPVSATKARPLPRWTLYTAPQDYSDLYAYCDQD 223
+ KDK+GK LI+ LC+P K + R T T +++ Y D
Sbjct: 111 GLLCDIFNVNSDKAKDKEGKALISLLCKP--RPKNSKIQRATALTHAEEWQRFKDYAGSD 168
Query: 224 VKSEDHVSALTPD--MTPSERETWLADQRINARGVQVDVESLDNALDIVDQTTRRFTMEL 281
+ + + P+ M E E W DQ+IN RG+ +DVE +AL V+ +R +
Sbjct: 169 ILAMREIYQHLPNWNMNVHETELWQLDQKINRRGMCMDVELAKSALTAVENEQKRLSTVT 228
Query: 282 AAITNGAVGSVSEVAKMTDFLA-GLGCRMHNLTADSVSEALERDDLNPTARRVLEIREAL 340
+T+ AV + ++ + +A G + ++ A ++ + D+ P R +L +R
Sbjct: 229 QQLTDNAVQNATQRDALLQHIASAFGITLPDMQASTLQRRINDPDIPPALRELLSVRLQS 288
Query: 341 AGANVKKLFSLKAQISSDGRLRNQYNFFGAG-TGRWSAGGVQLQNLTSKGPKSKTCNGCG 399
+ K +L +S+DGRLR F GA TGRW+ Q NL T
Sbjct: 289 CTTSTSKYKALLKSVSADGRLRGTKQFCGASRTGRWAGRVFQPDNLPRPTLDPNTI---- 344
Query: 400 RIVGKACTIEVMGATGLCPECGGNEWTDNKDWTVDAVRWALKDLTHRNLDLIIAIWGDPI 459
D ++A++ +L I D +
Sbjct: 345 ------------------------------DNGIEALKAGCAEL----------ICDDIM 364
Query: 460 TLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFR--THGK---IYEMS 514
L S LRG IA PGKK V D S IE +LA LA W++ F +GK +Y+++
Sbjct: 365 QLTSSALRGCIIAPPGKKLVISDLSNIEGRMLAWLAGENWKVNAFSEFDNGKGDDLYKLA 424
Query: 515 ASKISGVPFEEMMAYKEQNKHDHPLRKSLGKVAELASGYGGWLGAWKAFGAEAHFADDDE 574
++ + E++ KEQ + +GKV EL GYGG + A+ F A A+ D DE
Sbjct: 425 YARAFNLLPEDVT--KEQ--------RQIGKVMELGLGYGGGVAAFLTF-ALAYGLDLDE 473
Query: 575 M------------KKAILGWRDASPEIVEFWGGQYR----RDPFTRKWTKE-------YF 611
+ K+ + W S E + +G + D R W ++
Sbjct: 474 LAEAALPNIPPGVKREAMRWYQKSVETDKTYGLSEKIFVTCDSLKRMWRNAHPQTVSFWY 533
Query: 612 GLEGAAIMAILNPGKCYAVHDITYAVHDDVLYCRLPSGRFLQYHQPRLVESDSWKGPEYQ 671
+E A AI +PG + ++ L LPSGR L Y R+ Q
Sbjct: 534 DIEDAVKQAIQSPGIAFKCRKLSVRRDKSWLRICLPSGRSLCYPSARIENG--------Q 585
Query: 672 ITFMGYNTNSQKGPVGWVRMDT 693
IT+MG N S+K W R+ T
Sbjct: 586 ITYMGTNPYSRK----WERLKT 603
>ref|NP_299569.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
pir||G82574 phage-related protein XF2290 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85089.1|AE004041_1 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
Length = 312
Score = 78.6 bits (192), Expect = 2e-013
Identities = 78/298 (26%), Positives = 118/298 (39%), Gaps = 48/298 (16%)
Query: 212 DYSDLYAYCDQDVKSEDHVSALTP--DMTPSERETWLADQRINARGVQVDVESLDNALDI 269
D++ Y +DV + V P + +E W DQ IN RGV +D + + A+
Sbjct: 2 DWAQFVEYARRDVAAMRDVVKRLPSHNYAGAELALWFLDQTINDRGVLIDTDLVQAAIGA 61
Query: 270 VDQTTRRFTMELAAITNGAVGSVSEVAKMTDFL-AGLGCRMHNLTADSVSEALERDDLNP 328
V++ + A+T GAV + ++ + L G + ++ +V+ ++ L
Sbjct: 62 VERAKQTLAQRTEALTGGAVQAATQRDALLQHLNTAHGVALPDMQQHTVARCVDDPALPE 121
Query: 329 TARRVLEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAG-TGRWSAGGVQLQNLTS 387
T R +L IR + + K +L S DGRLR F GA TGRW+ Q NL
Sbjct: 122 TVRELLSIRRQASSTSTAKYQALLHCTSPDGRLRGTLQFNGASRTGRWAGRLFQPHNLP- 180
Query: 388 KGPKSKTCNGCGRIVGKACTIEVMGATGLCPECGGNEWTDNKDWTVDAVRWALKDLTHRN 447
+ + + A G +DA++ DL
Sbjct: 181 ----------------RPTLSQEVIAVG-----------------IDAMKAGCVDL---- 203
Query: 448 LDLIIAIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFR 505
++ D + L S LR IA KK V D S IE VLA LA +L FR
Sbjct: 204 ------VFDDVMALTSSALRSCLIAPTHKKLVVADLSIIEGRVLAWLAGETPKLHAFR 255
>gb|AAA23103.1| (M94320) putative DNA polymerase [Citrus greening
disease-associated bacterium]
Length = 206
Score = 60.1 bits (144), Expect = 8e-008
Identities = 49/166 (29%), Positives = 79/166 (47%), Gaps = 26/166 (15%)
Query: 107 PTDLIEHIANGGPIA-AWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGS 165
P+DL++++ + + A N FE +L K+ + ++ C AR LP S
Sbjct: 54 PSDLLQYLRDETVMCVAHNSLFE-----RILFKKTLGIDIPSKRWICTSVLARTNGLPSS 108
Query: 166 LDAAAQALKTSP---KDKKGKQLINKLCR------PVSATKARPLPRWTLYTAPQDYSDL 216
L A AL+ K ++GK LI + C+ P T+A + W L+
Sbjct: 109 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFRE------- 161
Query: 217 YAYCDQDVKSEDHV-SALTPDMTPSERETWLADQRINARGVQVDVE 261
YC +DV++ + +LTP + P ER WL DQ IN RG ++D++
Sbjct: 162 --YCKRDVEATRELFKSLTP-LNPEERTLWLLDQEINDRGYRIDLD 204
>gb|AAB25243.1| DNA polymerase homolog [bacterium-like organism, citrus greening
disease-associated, Peptide, 207 aa]
Length = 207
Score = 60.1 bits (144), Expect = 8e-008
Identities = 49/166 (29%), Positives = 79/166 (47%), Gaps = 26/166 (15%)
Query: 107 PTDLIEHIANGGPIA-AWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGS 165
P+DL++++ + + A N FE +L K+ + ++ C AR LP S
Sbjct: 55 PSDLLQYLRDETVMCVAHNSLFE-----RILFKKTLGIDIPSKRWICTSVLARTNGLPSS 109
Query: 166 LDAAAQALKTSP---KDKKGKQLINKLCR------PVSATKARPLPRWTLYTAPQDYSDL 216
L A AL+ K ++GK LI + C+ P T+A + W L+
Sbjct: 110 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFRE------- 162
Query: 217 YAYCDQDVKSEDHV-SALTPDMTPSERETWLADQRINARGVQVDVE 261
YC +DV++ + +LTP + P ER WL DQ IN RG ++D++
Sbjct: 163 --YCKRDVEATRELFKSLTP-LNPEERTLWLLDQEINDRGYRIDLD 205
>sp|Q9S1G2|DPO1_RHILE DNA polymerase I (POL I)
gb|AAD45559.1|U86403_1 (U86403) DNA polymerase I [Rhizobium leguminosarum]
Length = 1016
Score = 43.9 bits (102), Expect = 0.006
Identities = 24/61 (39%), Positives = 34/61 (55%)
Query: 466 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 525
+R FI+ PG K + D+S IE VLA +A + F I+ M+AS++ GVP E
Sbjct: 771 IRTAFISTPGHKLISADYSQIELRVLAHVAEIPQLTKAFEDGVDIHAMTASEMFGVPVEG 830
Query: 526 M 526
M
Sbjct: 831 M 831
>ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas aeruginosa]
pir||F82958 DNA polymerase I PA5493 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG08878.1|AE004962_2 (AE004962) DNA polymerase I [Pseudomonas aeruginosa]
Length = 913
Score = 43.1 bits (100), Expect = 0.010
Identities = 20/61 (32%), Positives = 37/61 (59%)
Query: 466 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 525
+R F+A G K + D+S IE ++A LA+ + L+ FR ++ +A+++ GVP E+
Sbjct: 674 IRQAFVAPQGYKLLAADYSQIELRIMAHLAKDDGLLDAFRHDLDVHRATAAEVFGVPLED 733
Query: 526 M 526
+
Sbjct: 734 V 734
>ref|NP_384281.1| (NC_003047) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
meliloti]
emb|CAC41562.1| (AL591782) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
meliloti]
Length = 1004
Score = 42.7 bits (99), Expect = 0.014
Identities = 24/61 (39%), Positives = 34/61 (55%)
Query: 466 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 525
+R F+A PG K V D+S IE VLA +A + F I+ M+AS++ GVP +
Sbjct: 761 IRTAFVATPGHKLVSADYSQIELRVLAHVADIPQLRQAFADGVDIHAMTASEMFGVPVDG 820
Query: 526 M 526
M
Sbjct: 821 M 821
>ref|NP_353146.1| (NC_003062) AGR_C_169p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
ref|NP_530819.1| (NC_003304) DNA polymerase I [Agrobacterium tumefaciens str. C58
(U. Washington)]
gb|AAK85931.1| (AE007952) AGR_C_169p [Agrobacterium tumefaciens str. C58 (Cereon)]
gb|AAL41135.1| (AE008985) DNA polymerase I [Agrobacterium tumefaciens str. C58 (U.
Washington)]
Length = 1031
Score = 41.6 bits (96), Expect = 0.030
Identities = 24/61 (39%), Positives = 33/61 (53%)
Query: 466 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 525
+R FI+ PG K + D+S IE VLA +A F I+ M+AS++ GVP E
Sbjct: 788 IRTAFISTPGHKLLSADYSQIELRVLAHVADIPQLRNAFENGIDIHAMTASEMFGVPVEG 847
Query: 526 M 526
M
Sbjct: 848 M 848
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.319 0.135 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 441,678,739
Number of Sequences: 919193
Number of extensions: 19680397
Number of successful extensions: 40735
Number of sequences better than 5.0e-02: 14
Number of HSP's better than 0.1 without gapping: 8
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 40679
Number of HSP's gapped (non-prelim): 30
length of query: 714
length of database: 288,098,425
effective HSP length: 128
effective length of query: 586
effective length of database: 170,441,721
effective search space: 99878848506
effective search space used: 99878848506
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 95 (41.2 bits)
Query= orf39 [11363-11534]
(170 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.132 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 98,850,707
Number of Sequences: 919193
Number of extensions: 3900634
Number of successful extensions: 8824
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8824
Number of HSP's gapped (non-prelim): 0
length of query: 170
length of database: 288,098,425
effective HSP length: 112
effective length of query: 58
effective length of database: 185,148,809
effective search space: 10738630922
effective search space used: 10738630922
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 86 (37.7 bits)
Query= orf40 [11555-11893]
(337 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.319 0.138 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 117,136,152
Number of Sequences: 919193
Number of extensions: 5115288
Number of successful extensions: 8711
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8711
Number of HSP's gapped (non-prelim): 0
length of query: 337
length of database: 288,098,425
effective HSP length: 121
effective length of query: 216
effective length of database: 176,876,072
effective search space: 38205231552
effective search space used: 38205231552
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 91 (39.7 bits)
Query= orf41 [11897-12248]
(350 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.135 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 206,383,156
Number of Sequences: 919193
Number of extensions: 8667353
Number of successful extensions: 19229
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 19229
Number of HSP's gapped (non-prelim): 0
length of query: 350
length of database: 288,098,425
effective HSP length: 121
effective length of query: 229
effective length of database: 176,876,072
effective search space: 40504620488
effective search space used: 40504620488
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 91 (39.7 bits)
Query= orf42 [12274-12562]
(287 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.309 0.127 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 170,952,616
Number of Sequences: 919193
Number of extensions: 7307020
Number of successful extensions: 15722
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 15722
Number of HSP's gapped (non-prelim): 0
length of query: 287
length of database: 288,098,425
effective HSP length: 119
effective length of query: 168
effective length of database: 178,714,458
effective search space: 30024028944
effective search space used: 30024028944
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 90 (39.3 bits)
Query= orf43 [12656-12742]
(85 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.325 0.139 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 49,758,466
Number of Sequences: 919193
Number of extensions: 1709365
Number of successful extensions: 3548
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3548
Number of HSP's gapped (non-prelim): 0
length of query: 85
length of database: 288,098,425
effective HSP length: 61
effective length of query: 24
effective length of database: 232,027,652
effective search space: 5568663648
effective search space used: 5568663648
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf44 [12739-13290]
(550 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAB63670.1| (AJ251789) hypothetical protein [Lactobacil... 199 6e-050
ref|NP_511031.1| (NC_003291) helicase [Bacteriophage PSA] >... 189 7e-047
ref|NP_050128.1| (NC_000896) putative DEAH-family helicase ... 187 3e-046
ref|NP_049952.1| (NC_000871) putative helicase [Streptococc... 185 1e-045
ref|NP_049999.1| (NC_000872) putative helicase [Streptococc... 185 1e-045
pir||T13299 probable helicase - Streptococcus phage phi-O12... 185 1e-045
emb|CAC04160.1| (AJ292531) putative helicase [Lactococcus b... 183 4e-045
gb|AAL74161.1|AF442520_1 (AF442520) putative helicase [Stre... 183 5e-045
ref|NP_471919.1| (NC_003212) similar to DEAH-family helicas... 177 2e-043
sp|P24125|V51K_BPL79 51.5 kDa protein >gi|76364|pir||JU0445... 174 2e-042
ref|NP_607398.1| (NC_003485) hypothetical phage protein [St... 164 3e-039
ref|NP_490263.1| (NC_003276) putative helicase [Nostoc sp. ... 155 8e-037
ref|NP_231273.1| (NC_002505) helicase-related protein [Vibr... 110 5e-023
ref|NP_142208.1| (NC_000961) hypothetical protein [Pyrococc... 77 5e-013
gb|AAF12716.1|AF066865_14 (AF066865) unknown [bacteriophage... 66 1e-009
ref|NP_559418.1| (NC_003364) helicase, possible DNA repair ... 65 2e-009
ref|NP_602261.1| (NC_003450) COG1061:DNA or RNA helicases o... 61 4e-008
ref|NP_248512.1| (NC_000909) ATP dependent RNA helicase, pu... 55 2e-006
sp|O02494|IF4A_CRYPV Eukaryotic initiation factor 4A (eIF-4... 50 6e-005
ref|NP_421775.1| (NC_002696) excinuclease ABC, subunit B [C... 48 3e-004
ref|NP_375993.1| (NC_003106) 621aa long conserved hypotheti... 44 0.006
gb|AAM02248.1| (AE010392) Lhr-like Superfamily II helicase ... 40 0.050
>emb|CAB63670.1| (AJ251789) hypothetical protein [Lactobacillus casei bacteriophage
A2]
Length = 455
Score = 199 bits (506), Expect = 6e-050
Identities = 139/422 (32%), Positives = 205/422 (47%), Gaps = 52/422 (12%)
Query: 1 MFKLRDYQQLAYNRI---MAAWQEYRSVLAVLPTGAGKTVIFSKIIH---DHVGAAAAIV 54
MF+L YQ+ N+ +AA +SVL V P G+GK+VI ++I + G +V
Sbjct: 1 MFQLHPYQKELVNQAREKLAAGN--KSVLLVSPAGSGKSVIIAEIARLAVERSGHVMFMV 58
Query: 55 HRREIVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLT 114
HR+E+V QI Q Q I +Q T
Sbjct: 59 HRQELVNQIV------------------------------------QTFQADEIDLQATT 82
Query: 115 SKSTMNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXX 174
+ A + + + +L + DE HH + + + K + L +A+P R +
Sbjct: 83 IMTVGKIANRLNRLPRPSLIITDESHHSLAK-TYRKIYSFYADVPRLGFSASPWRMNGQG 141
Query: 175 XXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKAR 234
++M+EGP+ WLI+ +L+ + Y AP T ID++ L GD+ K++ A
Sbjct: 142 LGNVY----ESMVEGPSVKWLIDHQYLAPYDYYAP-TLIDVQKLQTSSTGDYTNKSMDAA 196
Query: 235 VVESHIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERD 294
V ++ I GDVV HY+ ++AIV+A ++ ++Q+ E F+ AG TA +T G+RD
Sbjct: 197 VPKA-IFGDVVSHYQHLAGGRQAIVYAHSIEASKQVVEAFQSAGITAVHADAKTPKGQRD 255
Query: 295 RLLEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEY 354
R++ +F+ G I IL NVDL EGF+VP V +I+ RPT SL +Q R +R GK
Sbjct: 256 RIMADFKAGKITILSNVDLISEGFNVPDVGVIIMLRPTASLVLDIQQSMRGMRYKPGKR- 314
Query: 355 AIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTCDACTIPYEAYNKACPYC 414
+IIID V N R G P WTL R K + + P KTCDAC K CP C
Sbjct: 315 SIIIDHVANAYRFGLPDTEHEWTLNDRPKKKKRDNTGPPIKTCDACYAVIPVQCKICPIC 374
Query: 415 GH 416
GH
Sbjct: 375 GH 376
>ref|NP_511031.1| (NC_003291) helicase [Bacteriophage PSA]
emb|CAC85606.1| (AJ312240) helicase [Bacteriophage PSA]
Length = 418
Score = 189 bits (480), Expect = 7e-047
Identities = 126/388 (32%), Positives = 195/388 (49%), Gaps = 47/388 (12%)
Query: 32 GAGKTVIFSKIIH---DHVGAAAAIVHRREIVGQISLSLASFGVKHXXXXXXXXXXXXXX 88
GAGK+VI S+II + +VHR+E++ QI +L V
Sbjct: 9 GAGKSVILSEIIRMTTHNKNYVLFLVHRKELIDQIRNTLIMNEVD--------------- 53
Query: 89 XXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHHYVEQGIW 148
SY+ VQT+ + + + +L + DE HH + +
Sbjct: 54 -------MSYVKLGM------VQTIVRR--------LNKTSEPSLIIIDESHHVLANS-Y 91
Query: 149 AKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRA 208
K +H AK++ TATP R + T+IE WLIE FL+ + Y A
Sbjct: 92 KKIIHHFSKAKVVGFTATPVRINGGGLGDIN----DTLIEKVNVKWLIENQFLAHYKYYA 147
Query: 209 PQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVDTAE 268
P+T + L V + G+F+ +L + + I GDV++HY+K + ++AI++A+ + +E
Sbjct: 148 PET-VQTETLNVKRTGEFDMTSLDDQFNKRMIWGDVIKHYQKLADGEQAILYASSIYQSE 206
Query: 269 QMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECVIL 328
+MA F AG ++A + G+T RD ++++F +G +++L N+DL EGFDVP VI+
Sbjct: 207 KMAASFNAAGISSAHIDGKTPKPIRDDIIKQFREGELKVLCNLDLIGEGFDVPDCSTVIM 266
Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSG 388
RPTQSL+ ++Q R +R +GK A IID V N +R G P RTW+L+ R KG +S
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYKQGKT-ATIIDHVGNVKRFGLPDMERTWSLEPR-KGSNST 324
Query: 389 SDTVPQKTCDACTIPYEAYNKACPYCGH 416
P K C C + K C +CGH
Sbjct: 325 KAEAPVKICKECFMTVSQTAKKCEHCGH 352
>ref|NP_050128.1| (NC_000896) putative DEAH-family helicase [Lactobacillus
bacteriophage phi adh]
emb|CAB52498.1| (AJ131519) putative DEAH-family helicase [Lactobacillus
bacteriophage phi adh]
Length = 455
Score = 187 bits (475), Expect = 3e-046
Identities = 132/443 (29%), Positives = 214/443 (47%), Gaps = 50/443 (11%)
Query: 4 LRDYQQLAYNRIMAAWQE-YRSVLAVLPTGAGKTVIFSKI---IHDHVGAAAAIVHRREI 59
+R YQ+ + + + + +++VL P G+GK++ +++ I + +VHRRE+
Sbjct: 5 IRSYQKKLIDDLRLSLKHGHKNVLIQSPAGSGKSITMAELAKRITSNGERVLFVVHRREL 64
Query: 60 VGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTM 119
V QI + +S+GV C + VQT + +
Sbjct: 65 VKQIKNTFSSWGVNMNL----------------------------CEIYMVQTASRR--- 93
Query: 120 NDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXX 179
+ + + DE HH + + + K + +NA +L TATP R
Sbjct: 94 -----LEKLMTPNYILVDEAHHSLAK-TYKKINNHFKNAHILGFTATPVRLSGKGFKDIY 147
Query: 180 XXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESH 239
++ GP WLIE +L+ + Y + ID L GD+ K+++ ++
Sbjct: 148 ----DDLVLGPKISWLIENHYLAPYTYYSVNL-IDQTKLKKSSTGDYTHKSIE-NAGKNI 201
Query: 240 IVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEE 299
+ GDV+Q Y K+ N +AI+++ V + +Q+A+EF K A V G+T+ +RD+ +++
Sbjct: 202 VYGDVIQSYHKFANNTKAIIYSYSVHSCQQIAKEFNKNNIPAKEVDGKTKKEDRDKAMQD 261
Query: 300 FEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
F G I+IL+N +L+ EG DVP E VI+ RPTQSL+ F+Q R +R K+ AIIID
Sbjct: 262 FRDGKIKILVNAELYGEGVDVPDCETVIMLRPTQSLSLFIQQSMRCMRYQPDKQ-AIIID 320
Query: 360 PVRNWERHGPPTAVRTWTLKGREK--GQSSGSDTVPQKTCDACTIPYEAYNKACPYCGHV 417
V N+ R G P R WTL+ R K + GSD + KTC C A CP CG+
Sbjct: 321 QVANYTRFGLPDMDRVWTLEDRAKHPQREGGSDGIAIKTCPNCFGVIMASYHKCPLCGYS 380
Query: 418 NVYPDRSAPDKVDGDLVELDLDA 440
R + +L +++LDA
Sbjct: 381 FEAEFRKLAEDKRAELEKINLDA 403
>ref|NP_049952.1| (NC_000871) putative helicase [Streptococcus thermophilus
bacteriophage Sfi19]
gb|AAC97921.1| (AF077306) putative helicase; gp443 [Streptococcus thermophilus
bacteriophage Sfi19]
gb|AAD44071.1|AF115102_30 (AF115102) orf443 gp [Streptococcus thermophilus bacteriophage
Sfi19]
Length = 443
Score = 185 bits (470), Expect = 1e-045
Identities = 132/423 (31%), Positives = 207/423 (48%), Gaps = 54/423 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
+LR+YQ N I + + + ++ P +GKTV+ + I I+ HR+E
Sbjct: 2 ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61
Query: 59 IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
I Q+ + GV D N ++++ ++T +
Sbjct: 62 INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKIAR 90
Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
D +++ ++ + DE HH V+ + K + N+ +L T TP R D
Sbjct: 91 NLDR-----IQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143
Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
A ++ G WL + G ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
I GDV++HY K + K+AIV+ V+ +E ++ F + GYT+ A+SG+T RDR ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257
Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F G + I++N +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
D V N +RHG P A R W+LKG K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPTT--RTCDECYATFWSAERICPMCG 374
Query: 416 HVN 418
H N
Sbjct: 375 HEN 377
>ref|NP_049999.1| (NC_000872) putative helicase [Streptococcus thermophilus
bacteriophage Sfi21]
ref|NP_056709.1| (NC_002214) putative helicase [Streptococcus thermophilus
bacteriophage Sfi11]
gb|AAC72435.1| (AF004379) orf443 [Streptococcus thermophilus bacteriophage Sfi21]
gb|AAD44104.1|AF115103_34 (AF115103) orf443 gp [Streptococcus thermophilus bacteriophage
Sfi21]
gb|AAF63056.1|AF158600_10 (AF158600) putative helicase [Streptococcus thermophilus
bacteriophage Sfi11]
gb|AAF63085.1|AF158601_13 (AF158601) putative helicase [Streptococcus thermophilus
bacteriophage SFi18]
Length = 443
Score = 185 bits (470), Expect = 1e-045
Identities = 132/423 (31%), Positives = 207/423 (48%), Gaps = 54/423 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
+LR+YQ N I + + + ++ P +GKTV+ + I I+ HR+E
Sbjct: 2 ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61
Query: 59 IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
I Q+ + GV D N ++++ ++T +
Sbjct: 62 INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKIAR 90
Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
D +++ ++ + DE HH V+ + K + N+ +L T TP R D
Sbjct: 91 NLDR-----IQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143
Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
A ++ G WL + G ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
I GDV++HY K + K+AIV+ V+ +E ++ F + GYT+ A+SG+T RDR ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257
Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F G + I++N +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
D V N +RHG P A R W+LKG K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPTT--RTCDECYATFWSAERICPMCG 374
Query: 416 HVN 418
H N
Sbjct: 375 HEN 377
>pir||T13299 probable helicase - Streptococcus phage phi-O1205
gb|AAC79526.1| (U88974) ORF10 [Streptococcus thermophilus temperate bacteriophage
O1205]
Length = 443
Score = 185 bits (470), Expect = 1e-045
Identities = 131/423 (30%), Positives = 208/423 (48%), Gaps = 54/423 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
+LR+YQ N I + + + ++ P +GKTV+ + I I+ HR+E
Sbjct: 2 ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61
Query: 59 IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
I Q+ + GV D N ++++ ++T +
Sbjct: 62 INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKIAR 90
Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
D +++ ++ + DE HH V+ + K + N+ +L T TP R D
Sbjct: 91 NLDR-----IQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143
Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
A ++ G + WL + G ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKSVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
I GDV++HY K + K+AIV+ V+ +E ++ F + GYT+ A+SG+T RDR ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257
Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F G + I++N +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
D V N +RHG P A R W++KG K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSIKGVNKTKKKLKLGEPTT--RTCDECYATFWSAERICPMCG 374
Query: 416 HVN 418
H N
Sbjct: 375 HEN 377
>emb|CAC04160.1| (AJ292531) putative helicase [Lactococcus bacteriophage phi31]
Length = 448
Score = 183 bits (465), Expect = 4e-045
Identities = 145/442 (32%), Positives = 212/442 (47%), Gaps = 56/442 (12%)
Query: 4 LRDYQQLAYNRIMAAWQE-YRSVLAVLPTGAGKTVIF---SKIIHDHVGAAAAIVHRREI 59
LRDYQ+ I +++ + RS++ P +GKTV+ SK D HR+EI
Sbjct: 3 LRDYQEELVESIKSSFLKGNRSIIVQSPPRSGKTVVMADISKGATDKKNHVLFFSHRKEI 62
Query: 60 VGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTM 119
Q+ + +F + Y+ + SVQ+L K
Sbjct: 63 NDQV---VKTFELNQV-------------------NMEYVT------IGSVQSLVRK--- 91
Query: 120 NDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXX 179
+ + + + DE HH ++ + K + NA LF T TP R +
Sbjct: 92 -----IDELPLPDIILVDEAHH-IKANSYKKILEAFPNALKLFFTGTPIRLNGQGFED-- 143
Query: 180 XXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESH 239
A +I G + WL E G ++ F Y AP ID L +GDF K++ +
Sbjct: 144 --MADDLITGKSIKWLQEHGNIAPFKYYAPNI-IDTSQLKK-TSGDFTQKSMD-EAFKRA 198
Query: 240 IVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEE 299
I GDV+ HY K + K+AI +A +V TA+ ++EEF +AG TA V G+T ER+ ++ +
Sbjct: 199 IYGDVIAHYNKLSKGKQAICYAHNVATAQHISEEFNQAGITAEVVHGKTPKTEREAIMNK 258
Query: 300 FEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
F G I +LINV+LF EG D+P V I+ RPTQSL+ FLQ R L GK AI+ID
Sbjct: 259 FRAGEILVLINVELFTEGVDLPDVTTCIMLRPTQSLSLFLQFAMRPLNPKPGKT-AILID 317
Query: 360 PVRNWERHGPPTAVRTWTLKGREKGQS----SGSDTVPQKTCDACTIPYEAYN-KACPYC 414
V N+ RHG P R WTL G K +S G T+ K C+ C +++ N + CPYC
Sbjct: 318 HVGNYTRHGLPNEDREWTLSGISKKRSEYNTKGELTI--KQCEMCFGCFDSSNARTCPYC 375
Query: 415 GHVNVYPDRSAPDKVDGDLVEL 436
GH +R + + +L E+
Sbjct: 376 GHEPELTERELENIKEIELQEI 397
>gb|AAL74161.1|AF442520_1 (AF442520) putative helicase [Streptococcus thermophilus
bacteriophage k3]
Length = 443
Score = 183 bits (464), Expect = 5e-045
Identities = 131/423 (30%), Positives = 207/423 (47%), Gaps = 54/423 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
+LR+YQ N I + + + ++ P +GKTV+ + I ++ HR+E
Sbjct: 2 ELRNYQNDLVNGIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTVLFFSHRKE 61
Query: 59 IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
I Q+ + GV D N ++++ ++T +
Sbjct: 62 INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKVAR 90
Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
D + + ++ + DE HH V+ + K + N+ +L T TP R D
Sbjct: 91 NLDR-----ISEPSIILIDEAHH-VKAKTYLKIIEYYINSIVLMFTGTPARLDGSGFDD- 143
Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
A ++ G + WL E G ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKSVKWLQENGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
I GDV++HY K + K+AIV+ V+ +E ++ F+K GYT+ A+SG+T R+R ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFKKNGYTSIAISGKTPPEVRERAMQ 257
Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F + I++N +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 TFRDRELTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
D V N +RHG P A R W+LKG K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGINKTKKKLKLGEPTT--RTCDECYATFWSAERICPLCG 374
Query: 416 HVN 418
H N
Sbjct: 375 HEN 377
>ref|NP_471919.1| (NC_003212) similar to DEAH-family helicase [Listeria innocua]
emb|CAC97816.1| (AL596173) similar to DEAH-family helicase [Listeria innocua]
Length = 418
Score = 177 bits (450), Expect = 2e-043
Identities = 121/388 (31%), Positives = 188/388 (48%), Gaps = 47/388 (12%)
Query: 32 GAGKTVIFSKIIH---DHVGAAAAIVHRREIVGQISLSLASFGVKHXXXXXXXXXXXXXX 88
GAGK+VI S+II + +VHR+E++ QI +L V
Sbjct: 9 GAGKSVILSEIIRMTTRNKNNVLFLVHRKELIDQIRNTLTMNDVD--------------- 53
Query: 89 XXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHHYVEQGIW 148
+++ VQT+ + + + L + DE HH + +
Sbjct: 54 -------MKFVNLGM------VQTVVRR--------LEKTSEPALIIIDESHHVLANS-Y 91
Query: 149 AKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRA 208
K ++ NAK++ TATP R + +IE WLIE FL+ + Y A
Sbjct: 92 KKIINHFSNAKVVGFTATPVRINGGGLGDIN----DMLIEKVNVKWLIENQFLAPYKYFA 147
Query: 209 PQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVDTAE 268
P+ + L + + G+F+ L + + I GDV++HY+K ++AI++A+ + +E
Sbjct: 148 PEI-VQTETLDIKRTGEFDMTGLDDQFNKRMIWGDVIKHYQKLANGEQAILYASSLYQSE 206
Query: 269 QMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECVIL 328
+MA F G T+A + G+T RD +++ F +G I++L N+DL EGFDVP VI+
Sbjct: 207 KMAMSFESVGITSAHIDGKTPKSIRDDIIQRFREGEIKVLCNLDLIGEGFDVPDCSTVIM 266
Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSG 388
RPTQSL+ ++Q R +R GK AIIID V N R G P R W+L+ + KG +S
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYRTGKT-AIIIDHVGNVNRFGLPDMERKWSLEAK-KGSNSN 324
Query: 389 SDTVPQKTCDACTIPYEAYNKACPYCGH 416
P K C C + + N C +CGH
Sbjct: 325 KAEAPVKICPDCFMTVLSTNIKCSHCGH 352
>sp|P24125|V51K_BPL79 51.5 kDa protein
pir||JU0445 51K protein GP51C - Lactococcus phage mi7-9
gb|AAA32363.1| (M36388) conserved bacteriophage encoded protein [Lactococcus
bacteriophage (ISOLATE 7-9)]
Length = 452
Score = 174 bits (442), Expect = 2e-042
Identities = 129/394 (32%), Positives = 188/394 (46%), Gaps = 55/394 (13%)
Query: 30 PTGAGKTVIFS---KIIHDHVGAAAAIVHRREIVGQISLSLASFGVKHXXXXXXXXXXXX 86
P G+GK+V+ S K+ + G +VHR+E++ QI+ S GV
Sbjct: 5 PPGSGKSVVISEITKLATEKGGRVLFLVHRKELIDQITNSFKVHGVP------------- 51
Query: 87 XXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHHYVEQG 146
+ V+ LT N +++R + TL + DEGHH +
Sbjct: 52 --------------------LNQVELLTVGKAKNRLSVLR---KPTLIITDEGHHG-KAA 87
Query: 147 IWAKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVY 206
+ K + L TATP R MIEG T WLI L+ + Y
Sbjct: 88 TYQKIYEFYADVPRLGFTATPWRMSGDGFKDTYDY----MIEGKTVEWLINNKRLAPYQY 143
Query: 207 RAPQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVDT 266
+ + ID+ L + KNGD++ +++ + ++ I GDVVQ Y K ++AI++A V+
Sbjct: 144 YSLPS-IDISKLRI-KNGDYSNQSIDDALGKT-IFGDVVQEYVKHANGQKAILYAHSVEA 200
Query: 267 AEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECV 326
++ A+EF++AG A V +T ERD+L+ +F G I++L NVDL EGFDVP
Sbjct: 201 SQSFAKEFQEAGIKAVHVDAKTPKNERDKLMLDFRNGRIKVLCNVDLISEGFDVPDCTVT 260
Query: 327 ILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWT--LKG--RE 382
IL RPT+SL FLQ R++R K AIIID V NW HG P W +G ++
Sbjct: 261 ILCRPTKSLVLFLQQSMRSMRYQHRKT-AIIIDNVMNWHAHGLPDTHHDWKDYFEGGWKK 319
Query: 383 KGQSSGSDTVPQKTCDACTIPYEAYNKACPYCGH 416
KGQ + V K C C+ + K C CGH
Sbjct: 320 KGQ---KNIVQAKQCPDCSAMWPLSQKMCNLCGH 350
>ref|NP_607398.1| (NC_003485) hypothetical phage protein [Streptococcus pyogenes
MGAS8232]
gb|AAL97897.1| (AE010051) hypothetical phage protein [Streptococcus pyogenes
MGAS8232]
Length = 455
Score = 164 bits (414), Expect = 3e-039
Identities = 120/414 (28%), Positives = 189/414 (44%), Gaps = 50/414 (12%)
Query: 25 VLAVLPTGAGKTVIFSKIIHDHV---GAAAAIVHRREIVGQISLSLASFGVKHXXXXXXX 81
++ V P G+GK+V+ S I G +VHR+E++ QI+ S G
Sbjct: 1 MMIVSPPGSGKSVVISDIAKSATQKNGHVLFLVHRKELIDQITNSFKFHG---------- 50
Query: 82 XXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHH 141
ID N + + T+ D + + TL + DEGHH
Sbjct: 51 -----------------IDMNK----VDLMTVGKAKNRLDK-----LTKPTLIITDEGHH 84
Query: 142 YVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFL 201
+ + + + TATP R M+ G T WLI L
Sbjct: 85 G-KASTYQIIYEYFSDVPRIGFTATPWRLSGDGFTDTY----DVMVLGKTVEWLINNNKL 139
Query: 202 SKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFA 261
+ + Y + + ID L V +NGD++ K++ + I GDVVQ Y K ++AI++A
Sbjct: 140 APYDYYSVLS-IDTAKLKV-QNGDYSNKSIDESFGKK-IFGDVVQEYIKKANGQKAILYA 196
Query: 262 TDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVP 321
V+ ++ A+EF+ G A +T +RD+++++F G IQ++ NVDL EGFDVP
Sbjct: 197 HSVEASQAFAKEFQSMGINAIHADAKTPKAKRDKIMKDFRDGKIQVICNVDLISEGFDVP 256
Query: 322 AVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWT--LK 379
IL RPT+SL FLQ R++R K+ AII+D V NW HG P W +
Sbjct: 257 DCTVTILCRPTKSLVLFLQQSMRSMRYQPNKK-AIILDHVGNWNIHGLPDTPHHWENYFR 315
Query: 380 GREKGQSSGSDTVPQKTCDACTIPYEAYNKACPYCGHVNVYPDRSAPDKVDGDL 433
G K +S+ ++TV K C C+ + + C C H ++ ++++ +L
Sbjct: 316 GGWKKESNKTNTVHAKECPVCSALWPLSQQLCELCNHDFGLKEKQEKERIEAEL 369
>ref|NP_490263.1| (NC_003276) putative helicase [Nostoc sp. PCC 7120]
dbj|BAB78241.1| (AP003600) ORF_ID:alr7157~putative helicase [Nostoc sp. PCC 7120]
Length = 629
Score = 155 bits (393), Expect = 8e-037
Identities = 121/421 (28%), Positives = 182/421 (42%), Gaps = 56/421 (13%)
Query: 1 MFKLRDYQQLAYNRIMAAWQE-YRSVLAVLPTGAGKTVIFSKIIHDHVG---AAAAIVHR 56
MF LRDYQQ ++ AAW R VL L TG GKTVIF++I + HR
Sbjct: 1 MFNLRDYQQDLISKTFAAWSSGIRKVLLQLSTGGGKTVIFAEIASKMTAQGEGVLVVAHR 60
Query: 57 REIVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSK 116
E++ Q + L + Y ++ + S+QTL +
Sbjct: 61 EELILQAAEKLTA----------------VTKLQPGIIKAGYKSTDSPLQIASIQTLARR 104
Query: 117 STMNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXX 176
T A ++ + DE HH + K + +A +L +TATP R D
Sbjct: 105 QTYPSAQLV---------IIDEAHH-SSANSYRKLLDAYPHALVLGLTATPRRED----G 150
Query: 177 XXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVV 236
+I +T LI G+L+ + A V + DF K L+ V
Sbjct: 151 YGLRDIFDHLICSVSTKELITLGYLTDYKLIA---GFKYSRHKVPQKRDFTRKELE-EVA 206
Query: 237 ESHIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRL 296
+ +V++ ++ + K+ ++FA +V ++Q+A F G T + GET ER +
Sbjct: 207 SDYKPSEVLKQWQNFCAGKKTVIFAVNVMHSKQIAAAFCADGITCEHLDGETPHNERQAI 266
Query: 297 LEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAI 356
L F G Q++ N + EGFD P ++ARPT S+ +LQM+GR LR GK+YA
Sbjct: 267 LNRFRSGQTQVISNCAILTEGFDCPDSSAAVIARPTSSVTLWLQMIGRVLRPAPGKDYAT 326
Query: 357 IIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTCDACTIPYEAYNKACPYCGH 416
I+D NW R G P R W+L+ +CD P ++ CP+C H
Sbjct: 327 ILDMTDNWFRLGRPCDNRKWSLE--------------PVSCD----PDTQGSRCCPHCHH 368
Query: 417 V 417
V
Sbjct: 369 V 369
>ref|NP_231273.1| (NC_002505) helicase-related protein [Vibrio cholerae]
pir||B82176 helicase-related protein VC1636 [imported] - Vibrio cholerae (group
O1 strain N16961)
gb|AAF94787.1| (AE004241) helicase-related protein [Vibrio cholerae]
Length = 657
Score = 110 bits (274), Expect = 5e-023
Identities = 111/441 (25%), Positives = 186/441 (42%), Gaps = 74/441 (16%)
Query: 1 MFKLRDYQQLAYNRIMAAWQEYRS-VLAVLPTGAGKTVIFSKIIHDHVGAAAAIVHRREI 59
M+ LR YQ + ++ ++++ + + VLPTGAGK+++ +++ G + H +E+
Sbjct: 64 MYTLRPYQADSVKAVVHYFRQHSTPAVIVLPTGAGKSLVIAELARLARGRVLVLAHVKEL 123
Query: 60 VGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTM 119
V Q +G+ G+ DQ Q SVQ++
Sbjct: 124 VEQNHAKYEGYGL-------------TGAIFSAGLGRKETDQ--QVVFASVQSVVRN--- 165
Query: 120 NDATIMRWVKQVTLSVYDEGHHYVEQ--GIWAKAV-HLME---NAKLLFVTATPERADXX 173
+ + Q +L V DE H + + K + HL+E K+L +TATP R
Sbjct: 166 ----LSEFQNQFSLLVIDECHRVPDDKNSSYQKVITHLLELNPGMKVLGLTATPYRLGMG 221
Query: 174 XXXXXXXX----------FAQTMIEGPTTHWLIEQGFLSKF-VYRAPQTDIDLRDLAVGK 222
F + E P H+L+++GFL+ + P D L
Sbjct: 222 WIYQYHTRGLVRSEEPRFFRDCIFELPI-HYLLDEGFLTPAQLIDTPVMSYDFSQLKPAN 280
Query: 223 NGDFNAKALKARVVESH-----IVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKA 277
G + L + +S IV +++ + + ++FA V AE++
Sbjct: 281 TGRYRESELDLVIEQSQRATPQIVAQIIELAKA---RQGVMIFAATVRHAEEIYRLLPTE 337
Query: 278 GYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAK 337
AA V G+T + ERDR+++ F+Q I+ L+NV + GFD P V+ + + RPT+S++
Sbjct: 338 H--AALVIGDTPTPERDRIIQAFKQQQIKFLVNVSVLTTGFDAPHVDLIAILRPTESVSL 395
Query: 338 FLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTC 397
+ Q+VGR LR+ GK +++D N + L E G D P
Sbjct: 396 YQQIVGRGLRLAAGKSDCLVLDYAGN-----------QYDLYQPEVG-----DPKPDSDS 439
Query: 398 DACTIPYEAYNKACPYCGHVN 418
+ TIP CP CG N
Sbjct: 440 EIITIP-------CPACGFNN 453
>ref|NP_142208.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
pir||H71243 probable helicase protein PH0210 - Pyrococcus horikoshii
dbj|BAA29279.1| (AP000001) 444aa long hypothetical protein [Pyrococcus horikoshii]
Length = 444
Score = 77.0 bits (188), Expect = 5e-013
Identities = 91/359 (25%), Positives = 154/359 (42%), Gaps = 37/359 (10%)
Query: 2 FKLRDYQQLAYNRIMAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIVHRREIVG 61
+KLR YQ+ A + A +E VLA LP G+GKTV+ +IIH+ +A IVH +E++
Sbjct: 93 YKLRKYQKKA---VKLAIKEKMGVLA-LPVGSGKTVVGLRIIHEINKSALVIVHTKELLY 148
Query: 62 QISLSLASF-GVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTMN 120
Q + + GV+ V +QTL S+ T
Sbjct: 149 QWANKVREILGVEPGIIGDNKWS------------------EGPITVAMIQTLLSRGTD- 189
Query: 121 DATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXX 180
+ + + ++DE H + K + ++ATP R
Sbjct: 190 -----KLQNKYAIVMFDECHRTSAAEKFYKVGISLPQVYRFGLSATPWRRLRGEEMKIEG 244
Query: 181 XFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESHI 240
+ E LI++GFL+K + + D + LA K + +A+ +
Sbjct: 245 VVGPIIYEVKAED-LIKEGFLAKPKFEVIEYDSKMPALA-DKYKELYEEAVMENEERNRA 302
Query: 241 VGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEF 300
+ + K G R ++ +D E + + R G A +S ++S R +LE++
Sbjct: 303 IVEKAIELAKQGH--RVLIDVKRIDHGEILVKMLRDRGINAEFLS--SQSPNRWEILEKY 358
Query: 301 EQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
++G I +L++ L EG D+P + +ILA +S +Q +GRALR G E A+I+D
Sbjct: 359 KKGEIPVLVST-LLKEGVDIPEISAIILAGGGKSDVMTIQTIGRALRPKPGGE-AVIVD 415
>gb|AAF12716.1|AF066865_14 (AF066865) unknown [bacteriophage TPW22]
Length = 205
Score = 65.9 bits (159), Expect = 1e-009
Identities = 46/136 (33%), Positives = 67/136 (48%), Gaps = 8/136 (5%)
Query: 137 DEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLI 196
DE HH ++ + K + NA LF T TP R + A +I G + WL
Sbjct: 78 DEAHH-IKANSYKKILEAFPNALKLFFTGTPIRLNGQGFED----MADDLITGKSIKWLQ 132
Query: 197 EQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKR 256
E G ++ F Y AP ID L +GDF K++ + I GDV+ HY K + K+
Sbjct: 133 EHGNIAPFKYYAPNI-IDTSQLKK-TSGDFTQKSID-EAFKRAIYGDVIAHYNKLSKGKQ 189
Query: 257 AIVFATDVDTAEQMAE 272
AI +A +V TA+ ++E
Sbjct: 190 AICYAHNVATAQHISE 205
>ref|NP_559418.1| (NC_003364) helicase, possible DNA repair rad25 [Pyrobaculum
aerophilum]
gb|AAL63600.1| (AE009830) helicase, possible DNA repair rad25 [Pyrobaculum
aerophilum]
Length = 662
Score = 65.1 bits (157), Expect = 2e-009
Identities = 63/242 (26%), Positives = 104/242 (42%), Gaps = 32/242 (13%)
Query: 133 LSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXX-------XFAQT 185
L ++DE HH + A+ L E+ L ++ATPER D ++
Sbjct: 387 LVIFDEAHHVPAETFKEVALSL-ESPFRLALSATPEREDQNEHLIYEAVGPPVYRASYRS 445
Query: 186 MIEG----PTTHWLIEQGFL--SKFVYRA-PQTD-IDLRDLAVGKNGDFNAKALKARVVE 237
MIE P H+ I + +Y + PQ + I LR+ A AKA K +
Sbjct: 446 MIEAGLVVPVEHYRIYVRMTREEEALYASLPQNNAIVLRNAA--------AKASKKIPIA 497
Query: 238 SHIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLL 297
+V V + +VF +D AE++ + R++G A ++ +E G R+
Sbjct: 498 VRVVVQEV------ALGSKVLVFTQFIDQAEELYKRLRESGIAAELIT--SEEGNREAAF 549
Query: 298 EEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAII 357
F G ++++ + DEG DVP E ++ T S + +Q GR +R GK A +
Sbjct: 550 RRFSNGLSKVVVTTTVLDEGVDVPDAEVAVIVSGTGSRRQMIQRAGRVVRATAGKRAARV 609
Query: 358 ID 359
+
Sbjct: 610 YE 611
>ref|NP_602261.1| (NC_003450) COG1061:DNA or RNA helicases of superfamily II
[Corynebacterium glutamicum]
Length = 1643
Score = 60.8 bits (146), Expect = 4e-008
Identities = 68/255 (26%), Positives = 99/255 (38%), Gaps = 52/255 (20%)
Query: 158 AKLLFVTATPERADXXXXXXXXXXFAQTM------IEGPTTHWL-----IEQGFLSKFVY 206
AK L++TATP D A+ I GP H L +E+G L+ +
Sbjct: 360 AKRLYMTATPRLFDDSVKGKAADHSAEVSSMDDEAIYGPEFHRLGFGEAVEKGLLTDYKV 419
Query: 207 RAPQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENK----------- 255
D + A+ G + L + S I+G ++ G+ +
Sbjct: 420 VVMTVDEQVAASALTVLGSTPGEELTLDMT-SAIIGAWNGLAKRSGKEQDTKTGFSSSDA 478
Query: 256 ---RAIVFATDVDTAEQMAEEFRKA--GYTAAA--------------------VSGETES 290
RA+ FA D+ T++Q+AE F + YT V G +
Sbjct: 479 AMERAVAFARDIKTSQQIAESFPRVVNAYTTELEVKNDDVDEHNLNLSVACQHVDGSMNA 538
Query: 291 GERDRLLEEFEQGT----IQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRAL 346
ER+ L + T +IL N EG DVPA++ VI P S+ +Q VGR +
Sbjct: 539 LERNSRLTWLKAPTQSMETKILTNARCLSEGVDVPALDSVIFFNPRNSMVDVVQSVGRVM 598
Query: 347 RIMEGKEYAIIIDPV 361
R GK Y II PV
Sbjct: 599 RKSPGKNYGYIILPV 613
>ref|NP_248512.1| (NC_000909) ATP dependent RNA helicase, putative [Methanococcus
jannaschii]
sp|Q58900|YF05_METJA Putative ATP-dependent RNA helicase MJ1505
gb|AAB99518.1| (U67591) ATP dependent RNA helicase, putative [Methanococcus
jannaschii]
Length = 778
Score = 55.1 bits (131), Expect = 2e-006
Identities = 35/124 (28%), Positives = 63/124 (50%), Gaps = 8/124 (6%)
Query: 241 VGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESG-------ER 293
V D+V++ + +++R I+FA DT E++ + G A G+ E+
Sbjct: 339 VVDMVKNILEKNKDERIIIFAQYRDTVEKIVNLLTQNGIKAIRFIGQANKEGKGMSQKEQ 398
Query: 294 DRLLEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKE 353
+E F++ +L++ + +EG D+P+V +I P S +F+Q GRA+R GK
Sbjct: 399 IEAIERFKKEG-SVLVSTSVSEEGIDIPSVNYIIFYEPVPSEIRFIQRRGRAMRGEGGKV 457
Query: 354 YAII 357
Y +I
Sbjct: 458 YVLI 461
>sp|O02494|IF4A_CRYPV Eukaryotic initiation factor 4A (eIF-4A) (eIF4A)
gb|AAB58726.1| (AF001211) translation initiation factor [Cryptosporidium parvum]
gb|AAB58799.1| (AF001378) translation initiation factor [Cryptosporidium parvum]
Length = 405
Score = 50.1 bits (118), Expect = 6e-005
Identities = 29/117 (24%), Positives = 59/117 (49%), Gaps = 5/117 (4%)
Query: 256 RAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFD 315
+AI++ +Q+ ++ R+ +T +++ G+ + +R+ ++ +F G+ ++LI DL
Sbjct: 272 QAIIYCNTRRRVDQLTKQMRERDFTCSSMHGDMDQKDREVIMRQFRSGSSRVLITTDLLA 331
Query: 316 EGFDVPAVECVILARPTQSLAKFLQMVGRALRI-MEGKEYAIIIDP----VRNWERH 367
G DV V VI S ++ +GR+ R +G + D +R+ ERH
Sbjct: 332 RGIDVQQVSLVINYDLPVSPETYIHRIGRSGRFGKKGVSINFVTDDDIVCLRDIERH 388
>ref|NP_421775.1| (NC_002696) excinuclease ABC, subunit B [Caulobacter crescentus
CB15]
gb|AAK24943.1| (AE005961) excinuclease ABC, subunit B [Caulobacter crescentus
CB15]
Length = 749
Score = 47.8 bits (112), Expect = 3e-004
Identities = 59/275 (21%), Positives = 107/275 (38%), Gaps = 29/275 (10%)
Query: 241 VGDVVQHYRKW-GENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEE 299
V DVV R+ + R +V AE + E + G + + ++ ER ++ +
Sbjct: 478 VDDVVDEIRQTIQKGYRTLVTVLTKKMAEDLTEYLTEQGIRVRYMHSDVDTIERIEIIRD 537
Query: 300 FEQGTIQILINVDLFDEGFDVPAVECVILA-------RPTQSLAKFLQMVGRALRIMEGK 352
G +L+ ++L EG D+P EC ++A +S +Q +GRA R ++GK
Sbjct: 538 LRLGHFDVLVGINLLREGLDIP--ECGLVAILDADKEGFLRSETSLIQTIGRAARNVDGK 595
Query: 353 EYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTCDACTIPYEAYNKAC- 411
+ ER TA R ++V + D PYE ++
Sbjct: 596 VILYADRVTGSMERAMAETARRREKQHAYNLEHGITPESVKRDIKDILNSPYERGDRVLV 655
Query: 412 ----------PYCGHVNVYPDRSAPDKVDGDLVELDLDAWNALFAEIERAKMSDDDYELD 461
P+ G + K+ L+ + L EI+R K+ D ++ +
Sbjct: 656 PMGMSETDDRPFSGDNFKAALKDLEAKMREAAANLEFETAARLRDEIKRMKLMDLEFANE 715
Query: 462 M-------IRRGVPQIGRGQELRRHRAAKYRREVL 489
+ + + +P+ R ELR +A +R+ L
Sbjct: 716 VLTAPGEAVDKAMPKRVRA-ELRAEQAEAFRKSRL 749
>ref|NP_375993.1| (NC_003106) 621aa long conserved hypothetical protein [Sulfolobus
tokodaii]
dbj|BAB65102.1| (AP000981) 621aa long conserved hypothetical protein [Sulfolobus
tokodaii]
Length = 621
Score = 43.5 bits (101), Expect = 0.006
Identities = 29/107 (27%), Positives = 50/107 (46%), Gaps = 14/107 (13%)
Query: 255 KRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLF 314
K A+VF ++AE +AE RK A RD+++E F +G +++L++
Sbjct: 215 KNALVFVRSRNSAETLAETLRKFSIPALPYHSGLPYDVRDKVIESFMKGEVRVLVSTTAL 274
Query: 315 DEGFDVPAVECVI--LARPTQ------------SLAKFLQMVGRALR 347
+G ++P V ++ P S+A+F Q+ GRA R
Sbjct: 275 GQGVNLPVYATVFYDISLPDSDDKGEFKGWRDLSVAEFKQIAGRAGR 321
>gb|AAM02248.1| (AE010392) Lhr-like Superfamily II helicase [Methanopyrus kandleri
AV19]
Length = 910
Score = 40.4 bits (93), Expect = 0.050
Identities = 61/291 (20%), Positives = 106/291 (35%), Gaps = 50/291 (17%)
Query: 253 ENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE---EFEQGTIQILI 309
E +VF AE +A K Y V S R++ +E F++G I +L+
Sbjct: 242 ERDSVLVFTNTRQMAELLATRL-KTEYDDIEVEIHHSSISREKRMEVEKRFKKGEIDVLV 300
Query: 310 NVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGP 369
+ G D+ V+ V+ + + + +Q VGRA R + E +I N +
Sbjct: 301 CTSSLELGIDIGHVDLVVQYGSPRQVTRLVQRVGRAGRRRKRAEGLVI---TSNPDDLAE 357
Query: 370 PTAVRTWTLKGREKGQSSGS---DTVPQKTCDAC------TIPY--EAYNKACPYCGHVN 418
+ LKG + D + + C T+ Y E + +A PY H++
Sbjct: 358 AAVICRRALKGSLEPTEIPEGCLDVLAHQLVGLCLDGHQVTVDYALEVFRRAYPY-RHLD 416
Query: 419 VYPDRSAPD--------KVDGDLVELDLDAWNALFAEIE----------------RAKMS 454
R + +V GD V DAW ++ + +
Sbjct: 417 AETLREVAEFLDDIEVLRVRGDRVYRTKDAWKYYYSNLSMIPDERHYRVVTESGGHVSVL 476
Query: 455 DDDYELDMIRRGV-------PQIGRGQELRRHRAAKYRREVLENTVRWWVG 498
D+ + L+ +R G+ P I + + R+ E +E + WVG
Sbjct: 477 DEPFVLEYLRPGIKFICAGRPWIVQDVDHDRYEVLVTPAEAVEGAIPSWVG 527
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.321 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 289,393,974
Number of Sequences: 919193
Number of extensions: 11422833
Number of successful extensions: 28078
Number of sequences better than 5.0e-02: 682
Number of HSP's better than 0.1 without gapping: 520
Number of HSP's successfully gapped in prelim test: 162
Number of HSP's that attempted gapping in prelim test: 27076
Number of HSP's gapped (non-prelim): 1053
length of query: 550
length of database: 288,098,425
effective HSP length: 125
effective length of query: 425
effective length of database: 173,199,300
effective search space: 73609702500
effective search space used: 73609702500
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 93 (40.4 bits)
Query= orf45 [13289-13458]
(168 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_416862.1| (NC_000913) orf, hypothetical protein [Esc... 96 1e-019
sp|P76514|YFDR_ECOLI Hypothetical protein yfdR 96 1e-019
ref|NP_599062.1| (NC_003444) unknown [Shigella flexneri bac... 95 3e-019
ref|NP_420264.1| (NC_002696) conserved hypothetical protein... 58 4e-008
ref|NP_531749.1| (NC_003304) conserved hypothetical protein... 54 6e-007
ref|NP_354075.1| (NC_003062) AGR_C_1942p [Agrobacterium tum... 54 6e-007
dbj|BAA34643.1| (AB020211) unnamed protein product [Roseoba... 54 8e-007
ref|NP_108031.1| (NC_002678) hypothetical protein [Mesorhiz... 53 1e-006
ref|NP_385209.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN... 52 3e-006
>ref|NP_416862.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
pir||F65009 hypothetical protein b2361 - Escherichia coli (strain K-12)
gb|AAC75420.1| (AE000324) orf, hypothetical protein [Escherichia coli K12]
Length = 187
Score = 95.9 bits (237), Expect = 1e-019
Identities = 53/116 (45%), Positives = 73/116 (62%), Gaps = 2/116 (1%)
Query: 19 DKFDMPLATIAYHLSHINRFNGAVGQ-YSVAQHCVQVAALLPQDLKLAGLLHDATEAVLC 77
+K D+ + IA LS+I RF G + YSVAQH V + L+PQ+ L+HDATEA
Sbjct: 27 NKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQ 86
Query: 78 DVPAPLKRMLPDYQRIEERLQDAVDRRFNV-KTRHKRVREADLSMLAAEARDFNLD 132
D+PAPLKR+LPDY+R+EE++ + ++ + V+ ADL MLA E RD LD
Sbjct: 87 DIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLD 142
>sp|P76514|YFDR_ECOLI Hypothetical protein yfdR
Length = 178
Score = 95.9 bits (237), Expect = 1e-019
Identities = 53/116 (45%), Positives = 73/116 (62%), Gaps = 2/116 (1%)
Query: 19 DKFDMPLATIAYHLSHINRFNGAVGQ-YSVAQHCVQVAALLPQDLKLAGLLHDATEAVLC 77
+K D+ + IA LS+I RF G + YSVAQH V + L+PQ+ L+HDATEA
Sbjct: 18 NKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQ 77
Query: 78 DVPAPLKRMLPDYQRIEERLQDAVDRRFNV-KTRHKRVREADLSMLAAEARDFNLD 132
D+PAPLKR+LPDY+R+EE++ + ++ + V+ ADL MLA E RD LD
Sbjct: 78 DIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLD 133
>ref|NP_599062.1| (NC_003444) unknown [Shigella flexneri bacteriophage V]
gb|AAL89432.1| (U82619) unknown [Shigella flexneri bacteriophage V]
Length = 178
Score = 94.7 bits (234), Expect = 3e-019
Identities = 52/116 (44%), Positives = 73/116 (62%), Gaps = 2/116 (1%)
Query: 19 DKFDMPLATIAYHLSHINRFNGAVGQ-YSVAQHCVQVAALLPQDLKLAGLLHDATEAVLC 77
+K D+ + IA LS+I RF G + YSVAQH V + L+PQ+ L+HDATEA
Sbjct: 18 NKDDIDINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQ 77
Query: 78 DVPAPLKRMLPDYQRIEERLQDAVDRRFNV-KTRHKRVREADLSMLAAEARDFNLD 132
D+PAPLKR+LPDY+++EE++ + ++ + V+ ADL MLA E RD LD
Sbjct: 78 DIPAPLKRLLPDYKQMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLD 133
>ref|NP_420264.1| (NC_002696) conserved hypothetical protein [Caulobacter crescentus
CB15]
gb|AAK23432.1| (AE005819) conserved hypothetical protein [Caulobacter crescentus
CB15]
Length = 250
Score = 57.8 bits (138), Expect = 4e-008
Identities = 44/121 (36%), Positives = 69/121 (56%), Gaps = 16/121 (13%)
Query: 22 DMPLATIAYHLSHINRFNG-AVGQ--YSVAQHCV---QVAALLPQDL----KLAGLLHDA 71
D+ + IA+ L+ + R+NG VG +SVAQH + ++AA + DL +LA LLHDA
Sbjct: 68 DIEIEDIAHGLARVARWNGQTVGDHGFSVAQHSLVVEEIAAHIKPDLEPRWRLAALLHDA 127
Query: 72 TEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNVKTR-----HKRVREADLSMLAAE 125
+E V+ D+ +P K L Y+ E RL+DA+ RF + + K +++AD + E
Sbjct: 128 SEYVIGDMISPFKAALGVSYKDFETRLEDAIHIRFGLPVKTPTPIKKLIKQADRACAFFE 187
Query: 126 A 126
A
Sbjct: 188 A 188
>ref|NP_531749.1| (NC_003304) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAL42065.1| (AE009069) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 207
Score = 53.9 bits (128), Expect = 6e-007
Identities = 39/124 (31%), Positives = 68/124 (54%), Gaps = 18/124 (14%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALL--------PQDLKLAGLLHD 70
D+ +A IA+ L+ + R+NG ++VAQHC+ V + P ++++A LLHD
Sbjct: 28 DVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQMA-LLHD 86
Query: 71 ATEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV-----KTRHKRVREADLSMLAA 124
A E V+ D+ +P K ++ Y+ +E+RL+ AV RF + + R+++AD
Sbjct: 87 APEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADTVAAFF 146
Query: 125 EARD 128
EA +
Sbjct: 147 EATE 150
>ref|NP_354075.1| (NC_003062) AGR_C_1942p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
gb|AAK86860.1| (AE008035) AGR_C_1942p [Agrobacterium tumefaciens str. C58
(Cereon)]
Length = 226
Score = 53.9 bits (128), Expect = 6e-007
Identities = 39/124 (31%), Positives = 68/124 (54%), Gaps = 18/124 (14%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALL--------PQDLKLAGLLHD 70
D+ +A IA+ L+ + R+NG ++VAQHC+ V + P ++++A LLHD
Sbjct: 47 DVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQMA-LLHD 105
Query: 71 ATEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV-----KTRHKRVREADLSMLAA 124
A E V+ D+ +P K ++ Y+ +E+RL+ AV RF + + R+++AD
Sbjct: 106 APEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADTVAAFF 165
Query: 125 EARD 128
EA +
Sbjct: 166 EATE 169
>dbj|BAA34643.1| (AB020211) unnamed protein product [Roseobacter denitrificans]
Length = 195
Score = 53.5 bits (127), Expect = 8e-007
Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 11/97 (11%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALL-------PQDLKLAGLLHDA 71
D+ + IA L+ + R+NG YSVA+H + V L P +LA LLHDA
Sbjct: 23 DVEIEDIARGLAFVARWNGQTSGAHAYSVAEHSLLVETLFTRIADKSPAKWRLAALLHDA 82
Query: 72 TEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV 107
E V+ D+ +P+K + P Y+ ++ERL A+ RF +
Sbjct: 83 PEYVIGDMISPVKAAVGPGYEALDERLTAAIHLRFGL 119
>ref|NP_108031.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
dbj|BAB54176.1| (AP003012) hypothetical protein [Mesorhizobium loti]
Length = 211
Score = 53.1 bits (126), Expect = 1e-006
Identities = 37/97 (38%), Positives = 58/97 (59%), Gaps = 11/97 (11%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAAL----LPQDL---KLAGLLHDA 71
D+ ++ IA+ L+ + R+NG +SVAQH + V AL +PQ +LA LLHDA
Sbjct: 31 DIEISDIAHGLARVARWNGQTSGEHAFSVAQHSLLVEALFNDLVPQASATDRLAALLHDA 90
Query: 72 TEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV 107
E V+ D+ +P K ++ Y+ E RLQ A+ +RF++
Sbjct: 91 PEYVIGDMISPFKSVMGGSYKDCELRLQRAIHQRFSL 127
>ref|NP_385209.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
emb|CAC45682.1| (AL591786) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
Length = 202
Score = 51.6 bits (122), Expect = 3e-006
Identities = 36/112 (32%), Positives = 63/112 (56%), Gaps = 15/112 (13%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALLPQDLK------LAGLLHDAT 72
D+ L+ IA+ L+ + R+NG +SVAQH + V + + + L LLHDA
Sbjct: 25 DVELSDIAHGLARVARWNGQTSGDHAFSVAQHSLVVEEIFRRTNRCDAQDCLMALLHDAP 84
Query: 73 EAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV-----KTRHKRVREAD 118
E V+ D+ +P K ++ Y+ +E+RL++AV RF + + +R+++AD
Sbjct: 85 EYVIGDMISPFKAVVGGGYKTVEKRLENAVHLRFGLPAHTPRELKERIKKAD 136
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.324 0.139 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 78,813,900
Number of Sequences: 919193
Number of extensions: 2875408
Number of successful extensions: 6513
Number of sequences better than 5.0e-02: 12
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 6501
Number of HSP's gapped (non-prelim): 12
length of query: 168
length of database: 288,098,425
effective HSP length: 111
effective length of query: 57
effective length of database: 186,068,002
effective search space: 10605876114
effective search space used: 10605876114
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 86 (37.7 bits)
Query= orf46 [13458-13629]
(170 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.319 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 96,103,402
Number of Sequences: 919193
Number of extensions: 3660078
Number of successful extensions: 9533
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 9533
Number of HSP's gapped (non-prelim): 0
length of query: 170
length of database: 288,098,425
effective HSP length: 112
effective length of query: 58
effective length of database: 185,148,809
effective search space: 10738630922
effective search space used: 10738630922
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 86 (37.7 bits)
Query= orf47 [13644-13756]
(111 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_335121.1| (NC_002755) transcriptional regulator, put... 48 2e-005
ref|NP_215195.1| (NC_000962) hypothetical protein Rv0681 [M... 46 6e-005
emb|CAC95193.1| (AJ416905) hypothetical transcriptional reg... 42 9e-004
ref|NP_266283.1| (NC_002662) transcription regulator [Lacto... 42 0.002
pir||T34657 probable transcription regulator - Streptomyces... 41 0.002
ref|NP_241219.1| (NC_002570) BH0353~unknown conserved prote... 40 0.003
ref|NP_252662.1| (NC_002516) probable transcriptional regul... 39 0.008
ref|NP_149192.1| (NC_001988) HTH transcriptional regulator ... 39 0.010
ref|NP_355336.1| (NC_003062) AGR_C_4325p [Agrobacterium tum... 39 0.013
ref|NP_251575.1| (NC_002516) probable transcriptional regul... 38 0.022
emb|CAC03642.1| (AL391338) putative TetR-family transcripti... 37 0.029
ref|NP_534677.1| (NC_003305) transcriptional regulator, Tet... 37 0.029
ref|NP_511232.1| (NC_003292) tetracycline repressor protein... 37 0.029
pir||RPECR1 tet repressor protein (Tn 1721) - Escherichia c... 37 0.049
>ref|NP_335121.1| (NC_002755) transcriptional regulator, putative [Mycobacterium
tuberculosis CDC1551]
gb|AAK44935.1| (AE006965) transcriptional regulator, putative [Mycobacterium
tuberculosis CDC1551]
Length = 218
Score = 47.8 bits (112), Expect = 2e-005
Identities = 34/101 (33%), Positives = 51/101 (49%), Gaps = 8/101 (7%)
Query: 2 AKKITKRPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
A K RPA R E IV+ L R GW+S+T A+A +G PSL H ++ L+
Sbjct: 19 AVKSVARPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLR 77
Query: 62 RAIVRAAVRANDGLVVAM-AVQAGDWSRDKTDELLYIRGKH 101
RA+ +R D ++ + V AG R + D +L + G +
Sbjct: 78 RAV---RIRVIDDIITMLNRVGAG---RARDDAVLVMAGAY 112
>ref|NP_215195.1| (NC_000962) hypothetical protein Rv0681 [Mycobacterium tuberculosis
H37Rv]
pir||B70827 hypothetical protein Rv0681 - Mycobacterium tuberculosis (strain
H37RV)
emb|CAA17464.1| (AL021943) hypothetical protein Rv0681 [Mycobacterium tuberculosis
H37Rv]
Length = 196
Score = 46.2 bits (108), Expect = 6e-005
Identities = 32/95 (33%), Positives = 49/95 (50%), Gaps = 8/95 (8%)
Query: 8 RPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRA 67
RPA R E IV+ L R GW+S+T A+A +G PSL H ++ L+RA+
Sbjct: 3 RPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLRRAV--- 58
Query: 68 AVRANDGLVVAM-AVQAGDWSRDKTDELLYIRGKH 101
+R D ++ + V AG R + D +L + G +
Sbjct: 59 RIRVIDDIITMLNRVGAG---RARDDAVLVMAGAY 90
>emb|CAC95193.1| (AJ416905) hypothetical transcriptional regulator [Rhizobium
leguminosarum]
Length = 216
Score = 42.4 bits (98), Expect = 9e-004
Identities = 20/64 (31%), Positives = 38/64 (59%)
Query: 2 AKKITKRPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
AK + + ++EQI++A L V +G+ T +AE G++ P+L+Y++RT +
Sbjct: 7 AKTLRRTRIQEEKEEQILEAALDVFSASGFRGSTIDQIAEVAGMSKPNLLYYFRTKEAMH 66
Query: 62 RAIV 65
RA++
Sbjct: 67 RALI 70
>ref|NP_266283.1| (NC_002662) transcription regulator [Lactococcus lactis subsp.
lactis]
gb|AAK04225.1|AE006251_1 (AE006251) transcription regulator [Lactococcus lactis subsp.
lactis]
Length = 191
Score = 41.6 bits (96), Expect = 0.002
Identities = 18/52 (34%), Positives = 36/52 (68%)
Query: 12 AVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRA 63
A+ +++IVQA + +A + G ++++ +AE+ GI APSL H++ M +++ A
Sbjct: 5 ALTRDKIVQATIELAGKIGLSNVSFPRLAEYFGIKAPSLYNHFKNMEEVRVA 56
>pir||T34657 probable transcription regulator - Streptomyces coelicolor
emb|CAA22378.1| (AL034446) putative transcriptional regulator [Streptomyces
coelicolor A3(2)]
Length = 189
Score = 41.2 bits (95), Expect = 0.002
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 9 PAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAA 68
P + +++V A +A TG+ES+T A+A H G+ SL H R + L+ VR A
Sbjct: 2 PRVGLTTDRVVAAAADLADETGFESVTVSALARHFGVKDASLYTHVRNLQDLR---VRVA 58
Query: 69 VRANDGLVVAMA 80
+ A L+ +A
Sbjct: 59 LLAGGELIEEIA 70
>ref|NP_241219.1| (NC_002570) BH0353~unknown conserved protein [Bacillus
halodurans]
dbj|BAB04072.1| (AP001508) BH0353~unknown conserved protein [Bacillus halodurans]
Length = 197
Score = 40.4 bits (93), Expect = 0.003
Identities = 22/73 (30%), Positives = 39/73 (53%)
Query: 13 VRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVRAN 72
V KE++++A G S+T K+VAE G+ ++ YH+R+ QL IV + +
Sbjct: 7 VTKEELIEAAKRCIVEKGIHSLTLKSVAEGAGVTQGTVYYHFRSKDQLMLEIVESMCTSA 66
Query: 73 DGLVVAMAVQAGD 85
+ +M+ Q G+
Sbjct: 67 WHSLESMSKQEGE 79
>ref|NP_252662.1| (NC_002516) probable transcriptional regulator [Pseudomonas
aeruginosa]
pir||C83149 probable transcription regulator PA3973 [imported] - Pseudomonas
aeruginosa (strain PAO1)
gb|AAG07360.1|AE004815_4 (AE004815) probable transcriptional regulator [Pseudomonas
aeruginosa]
Length = 215
Score = 39.3 bits (90), Expect = 0.008
Identities = 27/82 (32%), Positives = 44/82 (52%), Gaps = 3/82 (3%)
Query: 1 MAKKITKR--PAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMT 58
M ++T+R + +E+I+Q GL G+ ++T +A+A+ GIA SL H+R
Sbjct: 1 MVYRVTERRLQRDSALRERILQLGLRRVVEGGFAALTMQALADDAGIATGSLYRHFRGKG 60
Query: 59 QLKRAIV-RAAVRANDGLVVAM 79
+L I RA+ R D L V +
Sbjct: 61 ELAAEIFRRASQREVDALAVVL 82
>ref|NP_149192.1| (NC_001988) HTH transcriptional regulator TetR family
[Clostridium acetobutylicum]
gb|AAK76774.1|AE001438_27 (AE001438) HTH transcriptional regulator TetR family [Clostridium
acetobutylicum]
Length = 187
Score = 38.9 bits (89), Expect = 0.010
Identities = 17/47 (36%), Positives = 29/47 (61%)
Query: 15 KEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
KE+I+Q +A G + +T++ +AE + I +PSL H+ M+ LK
Sbjct: 8 KEKIIQIAFSLADEIGLDKVTFQKLAEKLDIKSPSLYNHFSNMSDLK 54
>ref|NP_355336.1| (NC_003062) AGR_C_4325p [Agrobacterium tumefaciens]
[Agrobacterium tumefaciens str. C58 (Cereon)]
ref|NP_533056.1| (NC_003304) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAK88121.1| (AE008152) AGR_C_4325p [Agrobacterium tumefaciens str. C58
(Cereon)]
gb|AAL43372.1| (AE009186) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 213
Score = 38.5 bits (88), Expect = 0.013
Identities = 16/52 (30%), Positives = 32/52 (60%)
Query: 14 RKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIV 65
++E I++A L V G+ T +AE G++ P+++Y++RT + RA++
Sbjct: 19 KQETILEAALSVFSTNGFRGSTIDQIAEAAGMSKPNVLYYFRTKEAMHRALI 70
>ref|NP_251575.1| (NC_002516) probable transcriptional regulator [Pseudomonas
aeruginosa]
pir||C83286 probable transcriptional regulator PA2885 [imported] -
Pseudomonas aeruginosa (strain PAO1)
gb|AAG06273.1|AE004714_10 (AE004714) probable transcriptional regulator [Pseudomonas
aeruginosa]
Length = 198
Score = 37.7 bits (86), Expect = 0.022
Identities = 18/75 (24%), Positives = 39/75 (52%)
Query: 11 PAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVR 70
P + +++Q + R G+E T + +A VGI + S+ +H+++ ++ R+++ +
Sbjct: 13 PESARGKLLQTAAHLFRSKGYERTTVRDLASAVGIQSGSIFHHFKSKDEILRSVMEETIL 72
Query: 71 ANDGLVVAMAVQAGD 85
N L+ A A D
Sbjct: 73 YNTALMRAALADAED 87
>emb|CAC03642.1| (AL391338) putative TetR-family transcriptional regulator.
[Streptomyces coelicolor A3(2)]
Length = 190
Score = 37.4 bits (85), Expect = 0.029
Identities = 32/104 (30%), Positives = 51/104 (48%), Gaps = 10/104 (9%)
Query: 4 KITKRPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRA 63
K +R A A R +QI+ A + G ++++ + +AE VGI APSL H+ +A
Sbjct: 2 KSGERSAVADRAQQIMSAARELLDAEGPDALSMRRIAERVGIRAPSLYKHF-----ADKA 56
Query: 64 IVRAAVRANDGLVVAMAVQAGDWSRDKTDELLYIRG----KHAL 103
+ A ++A GL + VQ + D L + G +HAL
Sbjct: 57 ALVAGLQA-QGLTLLAEVQEAAEAEIGADPPLLVLGYAYRRHAL 99
>ref|NP_534677.1| (NC_003305) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAL44993.1| (AE009349) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 190
Score = 37.4 bits (85), Expect = 0.029
Identities = 20/72 (27%), Positives = 37/72 (50%), Gaps = 11/72 (15%)
Query: 12 AVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAI------- 64
A R+++I+ A ++A R GW S+T + +A+ + + P L H+ + A+
Sbjct: 12 ADREKRIIAAARMLAERDGWASVTVRRLAQEIEYSQPVLYAHFENRDAIVGAVALEGFGE 71
Query: 65 ----VRAAVRAN 72
+RA+VR N
Sbjct: 72 LGPTLRASVRRN 83
>ref|NP_511232.1| (NC_003292) tetracycline repressor protein [IncN plasmid R46]
sp|P03039|TER3_ECOLI TETRACYCLINE REPRESSOR PROTEIN CLASS C
pir||RPECYS tetracycline repressor - Escherichia coli plasmid pSC101
gb|AAA25677.1| (M36272) tetracycline resistance element repressor protein (tetR)
[Plasmid pSC101]
gb|AAG43219.1| (AY046276) tetracycline repressor protein [IncN plasmid R46]
prf||1101269A repressor tet [Salmonella enterica subsp. enterica serovar
Panama]
Length = 219
Score = 37.4 bits (85), Expect = 0.029
Identities = 15/60 (25%), Positives = 33/60 (55%)
Query: 13 VRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVRAN 72
+++E +++ L + G E +T + +AE +G+ P+L +H++ L A+ A + N
Sbjct: 4 LQREAVIRTALELLNDVGMEGLTTRRLAERLGVQQPALYWHFKNKRALLDALAEAMLTIN 63
>pir||RPECR1 tet repressor protein (Tn 1721) - Escherichia coli plasmid RP1
emb|CAA24908.1| (X00006) tetR regulatory protein [unidentified bacterium]
gb|AAB67685.1| (U75324) TetR [Cloning vector pJB3Tc20]
gb|AAB82322.1| (U82001) TetR [Expression vector pJB866]
Length = 216
Score = 36.6 bits (83), Expect = 0.049
Identities = 19/69 (27%), Positives = 34/69 (48%), Gaps = 2/69 (2%)
Query: 18 IVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVRANDGLVV 77
+++A L + G + +T + +AE +G+ P+L +H+R L A+ A + N V
Sbjct: 9 VIRAALDLLNEVGVDGLTTRKLAERLGVQQPALYWHFRNKRALLDALAEAMLAENHSTSV 68
Query: 78 AMAVQAGDW 86
A DW
Sbjct: 69 PRA--DDDW 75
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.321 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,920,324
Number of Sequences: 919193
Number of extensions: 1736924
Number of successful extensions: 3990
Number of sequences better than 5.0e-02: 27
Number of HSP's better than 0.1 without gapping: 21
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 3969
Number of HSP's gapped (non-prelim): 27
length of query: 111
length of database: 288,098,425
effective HSP length: 87
effective length of query: 24
effective length of database: 208,128,634
effective search space: 4995087216
effective search space used: 4995087216
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf48 [13774-13858]
(83 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.323 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 43,811,467
Number of Sequences: 919193
Number of extensions: 1420069
Number of successful extensions: 2117
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2117
Number of HSP's gapped (non-prelim): 0
length of query: 83
length of database: 288,098,425
effective HSP length: 59
effective length of query: 24
effective length of database: 233,866,038
effective search space: 5612784912
effective search space used: 5612784912
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 84 (37.0 bits)
Query= orf49 [13876-13947]
(70 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 40,666,443
Number of Sequences: 919193
Number of extensions: 1181701
Number of successful extensions: 1994
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1994
Number of HSP's gapped (non-prelim): 0
length of query: 70
length of database: 288,098,425
effective HSP length: 46
effective length of query: 24
effective length of database: 245,815,547
effective search space: 5899573128
effective search space used: 5899573128
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf50 [13945-13998]
(52 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.340 0.156 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,850,366
Number of Sequences: 919193
Number of extensions: 650993
Number of successful extensions: 3522
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3522
Number of HSP's gapped (non-prelim): 0
length of query: 52
length of database: 288,098,425
effective HSP length: 28
effective length of query: 24
effective length of database: 262,361,021
effective search space: 6296664504
effective search space used: 6296664504
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf51 [13994-14115]
(120 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.326 0.139 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 56,115,016
Number of Sequences: 919193
Number of extensions: 1903534
Number of successful extensions: 6421
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6421
Number of HSP's gapped (non-prelim): 0
length of query: 120
length of database: 288,098,425
effective HSP length: 96
effective length of query: 24
effective length of database: 199,855,897
effective search space: 4796541528
effective search space used: 4796541528
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf52 [14113-14260]
(146 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.327 0.141 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 72,648,722
Number of Sequences: 919193
Number of extensions: 2292330
Number of successful extensions: 5068
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 5068
Number of HSP's gapped (non-prelim): 0
length of query: 146
length of database: 288,098,425
effective HSP length: 109
effective length of query: 37
effective length of database: 187,906,388
effective search space: 6952536356
effective search space used: 6952536356
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 85 (37.4 bits)
Query= orf53 [14259-14422]
(162 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.323 0.141 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 87,289,747
Number of Sequences: 919193
Number of extensions: 3244195
Number of successful extensions: 4869
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4869
Number of HSP's gapped (non-prelim): 0
length of query: 162
length of database: 288,098,425
effective HSP length: 111
effective length of query: 51
effective length of database: 186,068,002
effective search space: 9489468102
effective search space used: 9489468102
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 86 (37.7 bits)
Query= orf54 [14426-14493]
(66 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.322 0.141 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 39,155,623
Number of Sequences: 919193
Number of extensions: 1267224
Number of successful extensions: 2391
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2391
Number of HSP's gapped (non-prelim): 0
length of query: 66
length of database: 288,098,425
effective HSP length: 42
effective length of query: 24
effective length of database: 249,492,319
effective search space: 5987815656
effective search space used: 5987815656
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf55 [14494-14558]
(63 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.327 0.139 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,553,366
Number of Sequences: 919193
Number of extensions: 455914
Number of successful extensions: 898
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 898
Number of HSP's gapped (non-prelim): 0
length of query: 63
length of database: 288,098,425
effective HSP length: 39
effective length of query: 24
effective length of database: 252,249,898
effective search space: 6053997552
effective search space used: 6053997552
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf56 [14558-14636]
(77 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.321 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 41,244,835
Number of Sequences: 919193
Number of extensions: 1300766
Number of successful extensions: 2914
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2914
Number of HSP's gapped (non-prelim): 0
length of query: 77
length of database: 288,098,425
effective HSP length: 53
effective length of query: 24
effective length of database: 239,381,196
effective search space: 5745148704
effective search space used: 5745148704
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf57 [14656-14704]
(47 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.321 0.140 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,988,848
Number of Sequences: 919193
Number of extensions: 568199
Number of successful extensions: 1291
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1291
Number of HSP's gapped (non-prelim): 0
length of query: 47
length of database: 288,098,425
effective HSP length: 23
effective length of query: 24
effective length of database: 266,956,986
effective search space: 6406967664
effective search space used: 6406967664
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf58 [14703-14796]
(92 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.319 0.130 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 49,535,502
Number of Sequences: 919193
Number of extensions: 1626927
Number of successful extensions: 3066
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3066
Number of HSP's gapped (non-prelim): 0
length of query: 92
length of database: 288,098,425
effective HSP length: 68
effective length of query: 24
effective length of database: 225,593,301
effective search space: 5414239224
effective search space used: 5414239224
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf59 [14794-14910]
(115 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.318 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,455,584
Number of Sequences: 919193
Number of extensions: 1895909
Number of successful extensions: 15447
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 15447
Number of HSP's gapped (non-prelim): 0
length of query: 115
length of database: 288,098,425
effective HSP length: 91
effective length of query: 24
effective length of database: 204,451,862
effective search space: 4906844688
effective search space used: 4906844688
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf60 [14909-15047]
(137 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAC01493.1| (AL391017) putative polypeptide deformylase... 80 3e-015
ref|NP_600815.1| (NC_003450) COG0242:N-formylmethionyl-tRNA... 70 3e-012
ref|NP_213407.1| (NC_000918) polypeptide deformylase [Aquif... 69 1e-011
ref|NP_438782.1| (NC_000907) polypeptide deformylase (def) ... 68 1e-011
ref|NP_248709.1| (NC_002516) polypeptide deformylase [Pseud... 68 2e-011
sp|Q9XAQ2|DEF_STRCO Polypeptide deformylase (PDF) (Formylme... 67 2e-011
ref|NP_273168.1| (NC_003112) polypeptide deformylase [Neiss... 67 4e-011
ref|NP_601934.1| (NC_003450) COG0242:N-formylmethionyl-tRNA... 67 4e-011
ref|NP_219861.1| (NC_000117) Polypeptide Deformylase [Chlam... 66 7e-011
emb|CAB94606.1| (AL359214) putative polypeptide deformylase... 63 6e-010
ref|NP_246498.1| (NC_002663) Def [Pasteurella multocida] >g... 63 6e-010
ref|NP_487119.1| (NC_003272) polypeptide deformylase [Nosto... 62 7e-010
ref|NP_354552.1| (NC_003062) AGR_C_2856p [Agrobacterium tum... 59 8e-009
ref|NP_269070.1| (NC_002737) putative polypeptide deformyla... 57 3e-008
ref|NP_607075.1| (NC_003485) putative polypeptide deformyla... 57 3e-008
ref|NP_345996.1| (NC_003028) polypeptide deformylase [Strep... 52 8e-007
ref|NP_360311.1| (NC_003103) polypeptide deformylase [EC:3.... 51 2e-006
ref|NP_229461.1| (NC_000853) polypeptide deformylase [Therm... 50 4e-006
ref|NP_561676.1| (NC_003366) polypeptide deformylase [Clost... 47 4e-005
sp|O08450|DEF_CLOBE Polypeptide deformylase (PDF) (Formylme... 44 3e-004
>emb|CAC01493.1| (AL391017) putative polypeptide deformylase [Streptomyces
coelicolor A3(2)]
Length = 217
Score = 80.5 bits (197), Expect = 3e-015
Identities = 49/132 (37%), Positives = 63/132 (47%), Gaps = 6/132 (4%)
Query: 9 PELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFK---- 63
P LHA AEV G ++ LV D NQVG R+ V CP +
Sbjct: 60 PVLHARCAEVTDFGPELAALVEDLFATMYAAHGVGLAANQVGEAVRVFVYDCPDDEDERH 119
Query: 64 -GCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTA 122
G V+NP + V PEGCLS PG +R ++ VV G+ + +P+T+ G A
Sbjct: 120 LGHVVNPRLVETGGVVVRGPEGCLSLPGLEAGTERYDEAVVTGFTVAGEPVTVRGTGFFA 179
Query: 123 FCLQHEIDHLNG 134
CLQHE DHL G
Sbjct: 180 RCLQHECDHLEG 191
>ref|NP_600815.1| (NC_003450) COG0242:N-formylmethionyl-tRNA deformylase
[Corynebacterium glutamicum]
Length = 169
Score = 70.5 bits (171), Expect = 3e-012
Identities = 43/122 (35%), Positives = 55/122 (44%), Gaps = 4/122 (3%)
Query: 18 VPHGEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPT----FKGCVINPIITR 73
V E + L+ D NQVGVL+R+ V +G VINP+
Sbjct: 22 VDFDESLSTLIDDMFDTMEDAGGVGLAANQVGVLRRVFVFDTSHQEGGLRGHVINPVWEP 81
Query: 74 HTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLN 133
T+ EGCLS P + R V + G D D P+ A GL A C+QHE DHL+
Sbjct: 82 LTEDTQTGKEGCLSIPDVSAETTRYETVRLSGQDRDGNPVGFVANGLLARCIQHETDHLD 141
Query: 134 GV 135
GV
Sbjct: 142 GV 143
>ref|NP_213407.1| (NC_000918) polypeptide deformylase [Aquifex aeolicus]
sp|O66847|DEF_AQUAE Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||C70352 polypeptide deformylase - Aquifex aeolicus
gb|AAC06802.1| (AE000696) polypeptide deformylase [Aquifex aeolicus]
Length = 169
Score = 68.6 bits (166), Expect = 1e-011
Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 7/121 (5%)
Query: 22 EDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLR------CPTFKGCVINPIITRHT 75
++VK+L+ D NQ+GV ++V+ P K +INP I +
Sbjct: 25 KEVKNLIRDMFDTMYEAEGVGLAANQIGVPLSVMVIDTSPKEDAPPLKLVLINPEI-KEG 83
Query: 76 DGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
+G + EGCLS+PG +V +R KV V + +P+ + +G A QHE+DHL G+
Sbjct: 84 EGKIKYKEGCLSFPGLSVEVERFQKVKVNALNEHGEPVELTLEGFPAIVFQHELDHLKGI 143
Query: 136 T 136
T
Sbjct: 144 T 144
>ref|NP_438782.1| (NC_000907) polypeptide deformylase (def) [Haemophilus influenzae
Rd]
sp|P44786|DEF_HAEIN Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||D64082 N-formylmethionylaminoacyl-tRNA deformylase (EC 3.5.1.27) -
Haemophilus influenzae (strain Rd KW20)
gb|AAC22282.1| (U32745) polypeptide deformylase (def) [Haemophilus influenzae Rd]
Length = 169
Score = 68.2 bits (165), Expect = 1e-011
Identities = 40/92 (43%), Positives = 51/92 (54%), Gaps = 4/92 (4%)
Query: 47 QVGVLKRIIVLRCPTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVV 103
QV +L+RII + K +INP I ++G EGCLS PG R+ KV V
Sbjct: 51 QVDILQRIITIDVEGDKQNQFVLINPEILA-SEGETGIEEGCLSIPGFRALVPRKEKVTV 109
Query: 104 EGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
D D + T+ A GL A C+QHEIDHLNG+
Sbjct: 110 RALDRDGKEFTLDADGLLAICIQHEIDHLNGI 141
>ref|NP_248709.1| (NC_002516) polypeptide deformylase [Pseudomonas aeruginosa]
sp|Q9I7A8|DEF_PSEAE Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||H83643 polypeptide deformylase PA0019 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG03409.1|AE004441_10 (AE004441) polypeptide deformylase [Pseudomonas aeruginosa]
Length = 168
Score = 67.8 bits (164), Expect = 2e-011
Identities = 49/138 (35%), Positives = 61/138 (43%), Gaps = 4/138 (2%)
Query: 1 MKILKDDAPELHAIAAEVPHGED-VKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC 59
+ IL+ P L IA V +D V+ L+ D QV V KRI+V+
Sbjct: 4 LNILEFPDPRLRTIAKPVEVVDDAVRQLIDDMFETMYEAPGIGLAATQVNVHKRIVVMDL 63
Query: 60 PTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
K INP T+ EGCLS PG R KV ++ D D P
Sbjct: 64 SEDKSEPRVFINPEFEPLTEDMDQYQEGCLSVPGFYENVDRPQKVRIKALDRDGNPFEEV 123
Query: 117 AKGLTAFCLQHEIDHLNG 134
A+GL A C+QHE DHLNG
Sbjct: 124 AEGLLAVCIQHECDHLNG 141
>sp|Q9XAQ2|DEF_STRCO Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||T34626 probable polypetide deformylase - Streptomyces coelicolor
emb|CAB44533.1| (AL078618) putative polypetide deformylase [Streptomyces
coelicolor]
Length = 208
Score = 67.4 bits (163), Expect = 2e-011
Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 8/122 (6%)
Query: 21 GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKGC-----VINPIITRHT 75
G D+ L+ D NQVGV R+ V CP +G ++NP++
Sbjct: 48 GPDLAALIDDMFRTMYVAEGAGLAANQVGVDLRLFVYDCPDDEGVRHVGHLVNPVLDALD 107
Query: 76 DGH---VYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHL 132
+ EGCLS PG +A R ++ VV G D D P+ + G A CL HE DH+
Sbjct: 108 PAARRLLDEGEGCLSVPGAVMAVPRPDRAVVRGLDKDGVPLLVEGTGYFARCLAHETDHV 167
Query: 133 NG 134
NG
Sbjct: 168 NG 169
>ref|NP_273168.1| (NC_003112) polypeptide deformylase [Neisseria meningitidis MC58]
ref|NP_283013.1| (NC_003116) polypeptide deformylase [Neisseria meningitidis Z2491]
sp|Q9JQN0|DEF_NEIMA Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||E81238 formylmethionine deformylase (EC 3.5.1.31) NMA0164 [imported] -
Neisseria meningitidis (group B strain MD58, group A
strain Z2491)
gb|AAF40569.1| (AE002369) polypeptide deformylase [Neisseria meningitidis MC58]
emb|CAB83478.1| (AL162752) polypeptide deformylase [Neisseria meningitidis Z2491]
Length = 167
Score = 66.6 bits (161), Expect = 4e-011
Identities = 44/139 (31%), Positives = 63/139 (44%), Gaps = 5/139 (3%)
Query: 1 MKILKDDAPELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC 59
+ IL+ LH +A V E ++ L+ D QV V +R++V+
Sbjct: 4 LNILQYPDERLHTVAKPVEQVDERIRKLIADMFETMYESRGIGLAATQVDVHERVVVMDL 63
Query: 60 PTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
+ INP+I DG EGCLS PG R +V VE + + T+
Sbjct: 64 TEDRSEPRVFINPVIVEK-DGETTYEEGCLSVPGIYDTVTRAERVKVEALNEKGEKFTLE 122
Query: 117 AKGLTAFCLQHEIDHLNGV 135
A GL A C+QHE+DHL G+
Sbjct: 123 ADGLLAICVQHELDHLMGI 141
>ref|NP_601934.1| (NC_003450) COG0242:N-formylmethionyl-tRNA deformylase
[Corynebacterium glutamicum]
Length = 193
Score = 66.6 bits (161), Expect = 4e-011
Identities = 40/125 (32%), Positives = 58/125 (46%), Gaps = 15/125 (12%)
Query: 23 DVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPT-----FKGCVINPIITRHT-- 75
++++L+ D NQ+GV KRI V CP KGC INP++
Sbjct: 27 ELQELIADMYETMDVANGVGLAANQIGVSKRIFVYDCPDDEGVMHKGCFINPVLETSEIP 86
Query: 76 ------DGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEI 129
DG EGCLS PG+ R + V G + + +++ A+G A C QHE+
Sbjct: 87 ETMPADDGS--DEEGCLSVPGEGFPTGRAHWAKVTGLNEKGEEVSVEAEGFLARCFQHEV 144
Query: 130 DHLNG 134
HL+G
Sbjct: 145 GHLDG 149
>ref|NP_219861.1| (NC_000117) Polypeptide Deformylase [Chlamydia trachomatis]
sp|O84357|DEF_CHLTR Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||B71526 probable polypeptide deformylase - Chlamydia trachomatis (serotype
D, strain UW3/Cx)
gb|AAC67948.1| (AE001308) Polypeptide Deformylase [Chlamydia trachomatis]
Length = 181
Score = 65.9 bits (159), Expect = 7e-011
Identities = 43/141 (30%), Positives = 65/141 (45%), Gaps = 12/141 (8%)
Query: 7 DAPELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPT---- 61
D+P L +AA V ++++ LVLD QVG + ++
Sbjct: 9 DSPILRKVAAPVTEITDELRQLVLDMSETMAFYKGVGLAAPQVGQSISLFIMGVERELED 68
Query: 62 -------FKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPIT 114
F INP+IT+ ++ VY EGCLS PG R +K+ V ++D Q +
Sbjct: 69 GELVFCDFPRVFINPVITQKSEQLVYGNEGCLSIPGLRGEVARPDKITVSAKNLDGQQFS 128
Query: 115 IAAKGLTAFCLQHEIDHLNGV 135
+A +G A + HE DHL+GV
Sbjct: 129 LALEGFLARIVMHETDHLHGV 149
>emb|CAB94606.1| (AL359214) putative polypeptide deformylase [Streptomyces
coelicolor A3(2)]
Length = 216
Score = 62.8 bits (151), Expect = 6e-010
Identities = 46/138 (33%), Positives = 57/138 (40%), Gaps = 15/138 (10%)
Query: 9 PELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKGC-- 65
P LH +V GE+ + LV D NQ+GV K++ V CP +G
Sbjct: 50 PVLHKECEDVTDFGEEFQQLVADMFASQRTAEGVGLAANQIGVSKKVFVYDCPDDEGVRH 109
Query: 66 ---VINPIIT------RHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
V NP + R D S EGCLS P R + V G D P+ +
Sbjct: 110 VGVVCNPRLVELPADRRRLDD---SNEGCLSVPTAYAPLARPDYAEVTGQDEKGNPVKVR 166
Query: 117 AKGLTAFCLQHEIDHLNG 134
G A CLQHE DHL G
Sbjct: 167 GTGYFARCLQHETDHLYG 184
>ref|NP_246498.1| (NC_002663) Def [Pasteurella multocida]
sp|P57948|DEF_PASMU Polypeptide deformylase (PDF) (Formylmethionine deformylase)
gb|AAK03643.1| (AE006193) Def [Pasteurella multocida]
Length = 170
Score = 62.8 bits (151), Expect = 6e-010
Identities = 38/117 (32%), Positives = 59/117 (49%), Gaps = 4/117 (3%)
Query: 22 EDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKG---CVINPIITRHTDGH 78
++++++V + QV + +RII + K +INP I + G
Sbjct: 26 DEIREIVDNMFETMYLEEGIGLAATQVNIHQRIITIDVEGTKENQYVLINPEII-DSCGE 84
Query: 79 VYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
EGCLS PG R+ KV ++ D + T++A+GL A C+QHEIDHLNG+
Sbjct: 85 TGIEEGCLSLPGFRGFVPRKEKVTIKALDRHGEEYTLSAEGLLAICIQHEIDHLNGI 141
>ref|NP_487119.1| (NC_003272) polypeptide deformylase [Nostoc sp. PCC 7120]
dbj|BAB74778.1| (AP003591) polypeptide deformylase [Nostoc sp. PCC 7120]
Length = 187
Score = 62.4 bits (150), Expect = 7e-010
Identities = 39/119 (32%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
Query: 22 EDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKGC-----VINPIITRHTD 76
++ + L+ D QVG+ K++IV+ C +INP I + +
Sbjct: 39 DETRQLIRDMLQTMYSSDGIGLAAPQVGINKQLIVIDCEPDNPANPPLILINPTIKQVSR 98
Query: 77 GHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
+ EGCLS PG + KR V V D + +P T+ A L C+QHE+DHLNGV
Sbjct: 99 EICSAQEGCLSIPGVYMDVKRPEVVEVAYKDENGRPQTLKATDLLGRCIQHEMDHLNGV 157
>ref|NP_354552.1| (NC_003062) AGR_C_2856p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
ref|NP_532239.1| (NC_003304) polypeptide deformylase [Agrobacterium tumefaciens str.
C58 (U. Washington)]
gb|AAK87337.1| (AE008078) AGR_C_2856p [Agrobacterium tumefaciens str. C58
(Cereon)]
gb|AAL42555.1| (AE009113) polypeptide deformylase [Agrobacterium tumefaciens str.
C58 (U. Washington)]
Length = 164
Score = 58.9 bits (141), Expect = 8e-009
Identities = 43/127 (33%), Positives = 60/127 (46%), Gaps = 2/127 (1%)
Query: 11 LHAIAAEVPHGED-VKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC-PTFKGCVIN 68
L I A V +D +++LV D +GVL+R+ VL P IN
Sbjct: 14 LSGICAPVTVFDDHLRELVTDLIDTMRAAPGVGITAAHIGVLQRVFVLELTPGTILTYIN 73
Query: 69 PIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHE 128
P IT H+ + EG +S PG T +R + V V D+ T A+G A C+QHE
Sbjct: 74 PEITSHSPQTMRHVEGSVSMPGFTDEVERPSTVEVRFQDITGAEQTETAEGFHAICIQHE 133
Query: 129 IDHLNGV 135
ID L+G+
Sbjct: 134 IDQLDGI 140
>ref|NP_269070.1| (NC_002737) putative polypeptide deformylase [Streptococcus
pyogenes] [Streptococcus pyogenes M1 GAS]
gb|AAK33791.1| (AE006536) putative polypeptide deformylase [Streptococcus pyogenes
M1 GAS]
Length = 136
Score = 57.0 bits (136), Expect = 3e-008
Identities = 32/92 (34%), Positives = 49/92 (52%), Gaps = 2/92 (2%)
Query: 46 NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEG 105
N +G KRI+++ + NP++ G + E CLS G + R +K+ VE
Sbjct: 47 NMIGEQKRIVIVSMGFIDLVMFNPVMVSKK-GIYQTKESCLSLSGYRKTQ-RYDKITVEY 104
Query: 106 YDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
D +W+P ++ GLTA QHE+DHL G+ I
Sbjct: 105 LDHNWRPKRLSLTGLTAQICQHELDHLEGILI 136
>ref|NP_607075.1| (NC_003485) putative polypeptide deformylase [Streptococcus
pyogenes MGAS8232]
gb|AAL97574.1| (AE010021) putative polypeptide deformylase [Streptococcus pyogenes
MGAS8232]
Length = 136
Score = 57.0 bits (136), Expect = 3e-008
Identities = 32/92 (34%), Positives = 49/92 (52%), Gaps = 2/92 (2%)
Query: 46 NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEG 105
N +G KRI+++ + NP++ G + E CLS G + R +K+ VE
Sbjct: 47 NMIGEQKRIVIVSMGFIDLVMFNPVMVSKK-GIYQTKESCLSLSGYRKTQ-RYDKITVEY 104
Query: 106 YDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
D +W+P ++ GLTA QHE+DHL G+ I
Sbjct: 105 LDHNWRPKRLSLTGLTAQICQHELDHLEGILI 136
>ref|NP_345996.1| (NC_003028) polypeptide deformylase [Streptococcus pneumoniae
TIGR4]
ref|NP_359001.1| (NC_003098) Formylmethionine deformylase [Streptococcus pneumoniae
R6]
gb|AAK75636.1| (AE007449) polypeptide deformylase [Streptococcus pneumoniae TIGR4]
gb|AAL00212.1| (AE008510) Formylmethionine deformylase [Streptococcus pneumoniae
R6]
gb|AAK13237.1| (AY014508) peptide deformylase-like protein [Streptococcus
pneumoniae]
Length = 136
Score = 52.4 bits (124), Expect = 8e-007
Identities = 31/89 (34%), Positives = 43/89 (47%), Gaps = 2/89 (2%)
Query: 46 NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEG 105
N +GV KR+I+ + NP++ +G + EGCLS G + KR + +
Sbjct: 47 NMIGVQKRVIIFNLGLVPVVMFNPVLLSF-EGSYEAEEGCLSLVGVR-STKRYETIRLAY 104
Query: 106 YDMDWQPITIAAKGLTAFCLQHEIDHLNG 134
D WQ TI G A QHE+DHL G
Sbjct: 105 RDSKWQEQTITLTGFPAQICQHELDHLEG 133
>ref|NP_360311.1| (NC_003103) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
conorii]
gb|AAL03212.1| (AE008626) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
conorii]
Length = 183
Score = 50.8 bits (120), Expect = 2e-006
Identities = 34/95 (35%), Positives = 49/95 (50%), Gaps = 4/95 (4%)
Query: 46 NQVGVLKRIIVLRC----PTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKV 101
N VG+LKRI V+ + INP IT ++ EG LS+PG + R +
Sbjct: 54 NMVGILKRIAVVDLHENNKSSPIVFINPNITYFSEEKQTFIEGSLSFPGIEASITRSKAI 113
Query: 102 VVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGVT 136
V+ D + +AA+G A +QHEI++LNG T
Sbjct: 114 KVKYLDYNGNKQELAAEGFLATVIQHEIEYLNGKT 148
>ref|NP_229461.1| (NC_000853) polypeptide deformylase [Thermotoga maritima]
sp|P96113|DEF_THEMA Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||C72224 polypeptide deformylase - Thermotoga maritima (strain MSB8)
emb|CAA71356.1| (Y10306) polypeptide deformylase [Thermotoga maritima]
gb|AAD36728.1|AE001808_3 (AE001808) polypeptide deformylase [Thermotoga maritima]
Length = 164
Score = 50.1 bits (118), Expect = 4e-006
Identities = 31/91 (34%), Positives = 44/91 (48%)
Query: 47 QVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGY 106
QVG+ +R V+ VINP I + EGCLS+P V +R ++ V+
Sbjct: 49 QVGISQRFFVMDVGNGPVAVINPEILEIDPETEVAEEGCLSFPEIFVEIERSKRIKVKYQ 108
Query: 107 DMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
+ + + +G A QHE DHLNGV I
Sbjct: 109 NTRGEYVEEELEGYAARVFQHEFDHLNGVLI 139
>ref|NP_561676.1| (NC_003366) polypeptide deformylase [Clostridium perfringens]
dbj|BAB80466.1| (AP003188) polypeptide deformylase [Clostridium perfringens]
Length = 136
Score = 46.6 bits (109), Expect = 4e-005
Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 4/93 (4%)
Query: 46 NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVV-VE 104
N +GV KRI+V ++NP+I + + + E CLS G K R +++ VE
Sbjct: 47 NMIGVKKRILVFSIGNIIVPMVNPVILKKEKSYE-TEESCLSLTG--FRKTTRYEIIEVE 103
Query: 105 GYDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
D +++ G TA +QHE+DH G+ I
Sbjct: 104 YLDRNFKKHKETFSGFTAQIIQHEVDHFEGIII 136
>sp|O08450|DEF_CLOBE Polypeptide deformylase (PDF) (Formylmethionine deformylase)
emb|CAB09662.1| (Z96934) formylmethionine deformylase; peptide deformylase
[Clostridium beijerinckii]
Length = 136
Score = 43.9 bits (102), Expect = 3e-004
Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 6/94 (6%)
Query: 46 NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPG--KTVAKKRRNKVVV 103
N +GV KRI+V +INP+I + + + E CLS G KT KR + V
Sbjct: 47 NMIGVKKRILVFTVGNLIVPMINPVILKKEKPYE-TEESCLSLIGFRKT---KRYETIEV 102
Query: 104 EGYDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
D ++ G TA +QHE+DH G+ I
Sbjct: 103 TYLDRNFNKKKQVFNGFTAQIIQHEMDHFEGIII 136
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.322 0.141 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 73,786,096
Number of Sequences: 919193
Number of extensions: 2521587
Number of successful extensions: 4113
Number of sequences better than 5.0e-02: 103
Number of HSP's better than 0.1 without gapping: 88
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 4001
Number of HSP's gapped (non-prelim): 104
length of query: 137
length of database: 288,098,425
effective HSP length: 113
effective length of query: 24
effective length of database: 184,229,616
effective search space: 4421510784
effective search space used: 4421510784
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf61 [15046-15141]
(94 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.320 0.137 2.88e-153
Gapped
Lambda K H
0.267 0.0410 2.18e+252
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 44,660,447
Number of Sequences: 919193
Number of extensions: 1388488
Number of successful extensions: 4619
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4619
Number of HSP's gapped (non-prelim): 0
length of query: 94
length of database: 288,098,425
effective HSP length: 70
effective length of query: 24
effective length of database: 223,754,915
effective search space: 5370117960
effective search space used: 5370117960
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf62 [15123-15259]
(135 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.324 0.140 2.18e+252
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 95,817,587
Number of Sequences: 919193
Number of extensions: 3978823
Number of successful extensions: 6867
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6867
Number of HSP's gapped (non-prelim): 0
length of query: 135
length of database: 288,098,425
effective HSP length: 111
effective length of query: 24
effective length of database: 186,068,002
effective search space: 4465632048
effective search space used: 4465632048
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 83 (36.6 bits)
Query= orf63 [15268-16060]
(791 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_489430.1| (NC_003273) hypothetical protein [Nostoc s... 99 2e-019
gb|AAB00938.1| (L31763) virulence-associated protein E [Dic... 85 3e-015
ref|NP_371315.1| (NC_002758) hypothetical protein [Staphylo... 85 4e-015
gb|AAL67615.1| (U93688) orf15 [Staphylococcus aureus] 85 4e-015
gb|AAL04136.1|AF410775_12 (AF410775) unknown [Staphylococcu... 84 6e-015
ref|NP_569125.1| (NC_003374) hypothetical protein [Gluconob... 83 1e-014
pir||S49113 hypothetical protein 2 - Microcystis aeruginosa... 81 5e-014
ref|NP_279342.1| (NC_002607) Vng0215c [Halobacterium sp. NR... 55 2e-006
>ref|NP_489430.1| (NC_003273) hypothetical protein [Nostoc sp. PCC 7120]
dbj|BAB77438.1| (AP003604) ORF_ID:all8519~hypothetical protein [Nostoc sp. PCC
7120]
Length = 836
Score = 98.6 bits (244), Expect = 2e-019
Identities = 87/317 (27%), Positives = 147/317 (45%), Gaps = 20/317 (6%)
Query: 466 KSDIKGMIAYMAHRYAHNPVKAWVESAP--WDGLDHVGLLFSHITLTPDEDRPLCEHLFR 523
K K ++ +A +++PV +++S + +D V LL + PL L +
Sbjct: 341 KEQAKSVVLELALANSYSPVVEYLQSVERRYPNVD-VSLLDKLAERYFGTNDPLHAALMK 399
Query: 524 KWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDSVLLDVKDKDSVK 583
+ + AAV+ + + + +Q + K+ F+ L E F TD L+ +KD V
Sbjct: 400 RTLIAAVARAFEPGCKHDEITILQGKQKSL-KSTFWEILAGEEFFTDD--LNGTEKDEVL 456
Query: 584 TAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEA 643
YW++E E + ++ + + + LKAFLSR KD +R PYG +PR + VG+ N
Sbjct: 457 KISQYWILEYAEFENSYKKKDVSQLKAFLSRKKDSMRRPYGTDIEDFPRPSILVGTTNLD 516
Query: 644 EFLVDTTSNRRFIPMKVAALNHQHRVNVQQ---VWAQALQEARSG---MATYVEAHEVAQ 697
EFL D T RRF +KV ++ Q+ +WA A+ R+G T VE +
Sbjct: 517 EFLYDPTGERRFWVIKVLFKKIPIKMLQQERDLIWAAAVAAYRNGEQWRLTEVEDEWLDA 576
Query: 698 RNASFQATSAIDDLLSERLNEATGERNVHMTVTALLK---RCGLHNPTKRDLNDAGKWLR 754
N +Q+T ++ + A E ++V +L + L +K + N LR
Sbjct: 577 ANKQYQSTDTWEEAVM-----AWAELQKEVSVGEILSDVLKIELAKQSKAEQNRVAAILR 631
Query: 755 SNGYERCVRTGVRGFML 771
S+G+ R R V G ++
Sbjct: 632 SHGWTRGDRKRVNGRLI 648
>gb|AAB00938.1| (L31763) virulence-associated protein E [Dichelobacter nodosus]
Length = 437
Score = 85.1 bits (209), Expect = 3e-015
Identities = 71/250 (28%), Positives = 121/250 (48%), Gaps = 20/250 (8%)
Query: 465 PKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTPDEDRPLCEHLFRK 524
P I I +A + P++ ++ES WDG + F E +
Sbjct: 96 PIDSIDMGIKMLAREHIFCPIQQYLESLQWDGKPRIATAFHDFLGV--ERNTYTIGISTL 153
Query: 525 WMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDSVLLDVKDKDSVKT 584
+ RA V+ + V+V +QG +GK+ F R L E + + + K++++
Sbjct: 154 FFRALVARVYQPAIKFDHVIVLQGDQG-IGKSSFLRILGGEWYSDG--IRKFEGKEAIEA 210
Query: 585 AISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEAE 644
+ E+ EL G FS++E +KAF++RT+D R+ YGR +PR T +VG+ N +
Sbjct: 211 IQGVLIGEIPELQG-FSKAEIEEIKAFITRTEDRARMAYGRRVQSFPRRTVFVGTTNADD 269
Query: 645 FLVDTTSNRRFIPMKV-AALNHQHRVNV-QQVWAQALQEARSGMATYV----------EA 692
+L D T NRRF P+ AL+ + V +Q++A+A+ ++ A YV E
Sbjct: 270 YLRDATGNRRFFPIICHKALDFEALAEVREQLFAEAVVHYKA--APYVPLILQGKEALEQ 327
Query: 693 HEVAQRNASF 702
++AQ+ AS+
Sbjct: 328 AKLAQQAASY 337
>ref|NP_371315.1| (NC_002758) hypothetical protein [Staphylococcus aureus subsp.
aureus Mu50]
dbj|BAB56953.1| (AP003360) hypothetical protein [Staphylococcus aureus subsp.
aureus Mu50]
Length = 489
Score = 84.7 bits (208), Expect = 4e-015
Identities = 65/234 (27%), Positives = 107/234 (44%), Gaps = 14/234 (5%)
Query: 452 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 511
+S I+ + N+ + + I +I A++ +P+K+ +ES WDG+ + LF I
Sbjct: 95 ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 151
Query: 512 DEDRPLCEHLFRKWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDS 571
ED + +KWM AV+ + +++ QG VGK+ L +
Sbjct: 152 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 209
Query: 572 VLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 631
+ K + K W+ E+ EL F +S +K F+S D R YG+ ++P
Sbjct: 210 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 267
Query: 632 RMTAYVGSVNEAEFLVDTTSNRRFIPMKV-------AALNHQHRVNVQQVWAQA 678
R +VG+ N EFL D T NRRF P+ + + V VQQ++A+A
Sbjct: 268 RQCVFVGTTNNYEFLKDQTGNRRFFPITTDKNKATKSPFDDLTPVVVQQMFAEA 321
>gb|AAL67615.1| (U93688) orf15 [Staphylococcus aureus]
Length = 477
Score = 84.7 bits (208), Expect = 4e-015
Identities = 65/234 (27%), Positives = 107/234 (44%), Gaps = 14/234 (5%)
Query: 452 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 511
+S I+ + N+ + + I +I A++ +P+K+ +ES WDG+ + LF I
Sbjct: 87 ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 143
Query: 512 DEDRPLCEHLFRKWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDS 571
ED + +KWM AV+ + +++ QG VGK+ L +
Sbjct: 144 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 201
Query: 572 VLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 631
+ K + K W+ E+ EL F +S +K F+S D R YG+ ++P
Sbjct: 202 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 259
Query: 632 RMTAYVGSVNEAEFLVDTTSNRRFIPMKV-------AALNHQHRVNVQQVWAQA 678
R +VG+ N EFL D T NRRF P+ + + V VQQ++A+A
Sbjct: 260 RQCVFVGTTNNYEFLKDQTGNRRFFPITTDKNKATKSPFDDLTPVVVQQMFAEA 313
>gb|AAL04136.1|AF410775_12 (AF410775) unknown [Staphylococcus aureus]
Length = 474
Score = 84.0 bits (206), Expect = 6e-015
Identities = 59/207 (28%), Positives = 96/207 (45%), Gaps = 7/207 (3%)
Query: 452 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 511
+S I+ + N+ + + I +I A++ +P+K+ +ES WDG+ + LF I
Sbjct: 84 ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 140
Query: 512 DEDRPLCEHLFRKWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDS 571
ED + +KWM AV+ + +++ QG VGK+ L +
Sbjct: 141 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 198
Query: 572 VLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 631
+ K + K W+ E+ EL F +S +K F+S D R YG+ ++P
Sbjct: 199 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 256
Query: 632 RMTAYVGSVNEAEFLVDTTSNRRFIPM 658
R +VG+ N EFL D T NRRF P+
Sbjct: 257 RQCVFVGTTNNYEFLKDQTGNRRFFPI 283
>ref|NP_569125.1| (NC_003374) hypothetical protein [Gluconobacter oxydans]
emb|CAD21834.1| (AJ428837) hypothetical protein [Gluconobacter oxydans]
Length = 416
Score = 82.8 bits (203), Expect = 1e-014
Identities = 73/248 (29%), Positives = 108/248 (43%), Gaps = 35/248 (14%)
Query: 463 HYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGL-LFSHITLTPDEDRPLCEHL 521
++ + I +A A++P++ ++ WDG+ + L + E L
Sbjct: 99 NFSTQQVSAAIQSVAEENAYHPIRDYLTGLEWDGVQRLDTWLHDFLGAEDSEYTAGVGRL 158
Query: 522 FRKWMRAAVSAGVGDQAGC--EPVLVFVDEQGG---------VGKTRFFRTLCPEPFRTD 570
F M A V + GC + VL+F QG G +R+F P P R
Sbjct: 159 FLIGMVARVM-----EPGCKNDHVLIFEGNQGIRKSTACKVLAGGSRYFSDSLP-PIREG 212
Query: 571 SVLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKY 630
S KD WLVE+ E+ SR+++ ALKAFL+RT + R YGR +
Sbjct: 213 S-------KDLASHLAGKWLVEVAEMSAA-SRADTEALKAFLTRTHETYRPAYGRNEITQ 264
Query: 631 PRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVNVQQVWAQALQEARSGMATYV 690
PR VG+ N E+L D+T +RR P+K VN + LQEAR + +
Sbjct: 265 PRQCVLVGTTNRKEYLRDSTGDRRVWPVKCG-------VNGEPCDIDGLQEARDQL--FA 315
Query: 691 EAHEVAQR 698
EA + QR
Sbjct: 316 EAFHLYQR 323
>pir||S49113 hypothetical protein 2 - Microcystis aeruginosa
emb|CAA82191.1| (Z28337) ORF2 [Microcystis aeruginosa]
Length = 502
Score = 80.9 bits (198), Expect = 5e-014
Identities = 67/218 (30%), Positives = 106/218 (47%), Gaps = 21/218 (9%)
Query: 555 KTRFFRTLCPEPFRTDSVLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSR 614
K+ FF+TL E F TD+++ DKD + T W++E ELD +++ +K+FLSR
Sbjct: 278 KSTFFKTLGGEYF-TDAMMST--DKDGLLTMAKNWIIEWSELDHFTAKTYHGIIKSFLSR 334
Query: 615 TKDDIRLPYGRTNMKYPRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVNVQQV 674
+ D R PY R M +PR + VGS N FL D T NRRF + + +H++N+
Sbjct: 335 SDDFYREPYAREAMSHPRCSMIVGSTNRDTFLNDPTGNRRF---WIIPIPKKHKINID-- 389
Query: 675 WAQALQEARSGMAT--YVEAHEV----------AQRNASFQATSAIDDLLSERLNEATGE 722
+ QA +E G A Y++ A+ N ++ + +D L+E L+ + +
Sbjct: 390 YLQAYREKLLGWAVWRYLDGEPCQLPEEFKALQAEANKQWENNDSWEDELAEFLDRES-D 448
Query: 723 RNVHMTVTALLKRCGLHNPTKRDLNDAGKWLRSNGYER 760
+V + L R N + D G LR G+ R
Sbjct: 449 LSVKECLQRLEDRGYPINFGRSDEMRMGDILRKAGFSR 486
>ref|NP_279342.1| (NC_002607) Vng0215c [Halobacterium sp. NRC-1]
gb|AAG18822.1| (AE004986) Vng0215c [Halobacterium sp. NRC-1]
Length = 857
Score = 55.5 bits (132), Expect = 2e-006
Identities = 33/112 (29%), Positives = 55/112 (48%), Gaps = 11/112 (9%)
Query: 175 EHDSAERVEITDQEAQTVREALSYIDPGDKREEWVKVGLALKSGFGDDPQGLSLFDEWSS 234
++D E ++ +D + +ALS+++P +EW+K+G A+ + G LF++WS
Sbjct: 212 DYDGDEWLDKSD-----IEDALSHVNPDCGYDEWLKLGFAVHD-WDSGSTGKRLFEQWSK 265
Query: 235 GALWQDGDMPANYVPEHIETQWHSFKAEGGRTIATVYHKAIEGGWQPPAGIN 286
G D N +I+ W S G T+ T+ H A +GGW P N
Sbjct: 266 GGSKWD-----NQAERNIQDIWDSASEGQGVTVGTLIHYAKDGGWTVPTRSN 312
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.317 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 467,303,236
Number of Sequences: 919193
Number of extensions: 20590572
Number of successful extensions: 54834
Number of sequences better than 5.0e-02: 14
Number of HSP's better than 0.1 without gapping: 10
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 54811
Number of HSP's gapped (non-prelim): 17
length of query: 791
length of database: 288,098,425
effective HSP length: 128
effective length of query: 663
effective length of database: 170,441,721
effective search space: 113002861023
effective search space used: 113002861023
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 95 (41.2 bits)
Query= orf64 [16258-16489]
(230 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
919,193 sequences; 288,098,425 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Apr 9, 2002 12:51 PM
Number of letters in database: 288,098,425
Number of sequences in database: 919,193
Lambda K H
0.319 0.132 2.31e-152
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 108,266,773
Number of Sequences: 919193
Number of extensions: 3883020
Number of successful extensions: 11952
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 11952
Number of HSP's gapped (non-prelim): 0
length of query: 230
length of database: 288,098,425
effective HSP length: 116
effective length of query: 114
effective length of database: 181,472,037
effective search space: 20687812218
effective search space used: 20687812218
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 89 (38.9 bits)