National Center for Biotechnology Information (NCBI)
welcome to the blast network service.
PEPTIDE SEQUENCE DATABASES
nr Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
month All new or revised GenBank CDS translation+PDB+SwissProt+PIR
sequences released in the last 30 days
pdb PDB protein sequences
yeast Yeast (Saccharomyces cerevisiae) protein sequences.
kabat Kabat Sequences of Proteins of Immunological Interest
alu * Translations of Select Alu Repeats from REPBASE
swissprot SwissProt sequences
NUCLEOTIDE SEQUENCE DATABASES
nr Non-redundant GenBank+EMBL+DDBJ+PDB sequences
(but no EST, STS, GSS, or HTGS sequences)
month All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
the last 30 days
yeast Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
est + Non-redundant Database of GenBank+EMBL+DDBJ EST Division
sts + Non-redundant Database of GenBank+EMBL+DDBJ STS Division
htgs High Throughput Genomic Sequences
pdb PDB nucleotide sequences
vector Vector subset of GenBank
mito * Database of mitochondrial sequences, Rel. 1.0, July 1995
gss Genome Survey Sequences (includes single-pass genomic data, exon-
trapped sequences, and Alu PCR primers.
kabat Kabat Sequences of Nucleic Acid of Immunological Interest
epd Eukaryotic Promotor Database
alu *+ Select Alu Repeats from REPBASE
BLASTP 2.2.2 [Dec-14-2001]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= orf1 [1-217]
(213 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.314 0.129 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 90,918,893
Number of Sequences: 884779
Number of extensions: 2942183
Number of successful extensions: 6784
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6784
Number of HSP's gapped (non-prelim): 0
length of query: 213
length of database: 277,083,050
effective HSP length: 115
effective length of query: 98
effective length of database: 175,333,465
effective search space: 17182679570
effective search space used: 17182679570
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 88 (38.5 bits)
Query= orf2 [211-887]
(675 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_518974.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 205 1e-051
dbj|BAA89640.1| (AB036666) similar to terminase large subun... 203 5e-051
dbj|BAA89621.1| (AB036665) similar to terminase large subun... 201 3e-050
ref|NP_297798.1| (NC_002488) phage-related terminase large ... 115 2e-024
ref|NP_298001.1| (NC_002488) phage-related terminase large ... 96 1e-018
ref|NP_461544.1| (NC_003197) Gifsy-1 prophage: similar to t... 93 1e-017
ref|NP_310206.1| (NC_002695) putative terminase large subun... 92 1e-017
ref|NP_287368.1| (NC_002655) putative DNA packaging protein... 92 2e-017
ref|NP_040581.1| (NC_001416) A (DNA packaging;641) [bacteri... 91 3e-017
sp|P36693|TERL_BPP21 TERMINASE LARGE SUBUNIT (GP2) >gi|5407... 89 1e-016
>ref|NP_518974.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14555.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 660
Score = 205 bits (521), Expect = 1e-051
Identities = 178/621 (28%), Positives = 282/621 (44%), Gaps = 59/621 (9%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L SEW++R R + + S PG + PY R ++DC P SP+ V K AQ+GAT
Sbjct: 26 LTVSEWSDRHRMLSSKASAEPGRWRTGRTPYLRAIMDCLSPTSPIERVVFMKAAQLGAT- 84
Query: 89 SVLENIIAYGIDYVKTASM-IFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGT 147
+ N I Y I + M ++ TV ++ KR I P++E S + A +
Sbjct: 85 EMGSNWIGYVIHHAPGPMMAVWPTV--EMAKRNSKQRIDPLIEESA----VLAERIAPAR 138
Query: 148 KR-SGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLT 206
R SG T E+ GGVL GA S +RS PV +L DEV G+P V +GD + L
Sbjct: 139 SRDSGNTILAKEF-RGGVLVMTGANSAVGLRSMPVRYLFLDEVDGYPLDVEGEGDAISLA 197
Query: 207 ETRTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKK 266
E RT +F A RK+ +STP AG TI + ++ DQR + VPC HC Q L F
Sbjct: 198 EARTRTF-ARRKIFIVSTPTIAGASTIEREYDASDQRRYFVPCPHCDHRQWLRFE----- 251
Query: 267 GQGRLIWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHL 326
+L W T+G P++ Y+C C + K + +G W + +HL
Sbjct: 252 ---QLRW-TKG---EPETAAYICEACFEPIHEHHKAWMLAQGEWRAMVETGGRT-AGFHL 303
Query: 327 SAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREV 386
S++Y+P RSW+EIA AW + +A ++ F N +LGE + ++ +
Sbjct: 304 SSLYSPVGWRSWREIAAAWESAVSKESGSAA---AIKTFRNTELGETW-VEEGEAPDWQR 359
Query: 387 YAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYV 446
RR+ Y +G VP + AG ++ + V V WA G ++
Sbjct: 360 LLERREDYPIGTVPAGGLLLSAGADVQKDRIEVSV-----------WAFG---RGKEAWL 405
Query: 447 IDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEYCNDW 506
+++ + GD T WK+L E++ + + G +AR+ +D T Y +
Sbjct: 406 VEHRVLMGD--TARDAVWKRLAELVEEQWTHASGATMPLARLALDTGFATQEAYAFVRAC 463
Query: 507 GE-NVIPIRG--RDLPIKGAQIKY---FNRQMSEKGVEYLSVTVDLYKDRWSPTLRK--E 558
G+ V+ ++G R + G N + +G++ +V V + K + LRK +
Sbjct: 464 GDARVMAVKGTARGAALIGTPTAVDVTRNGKKLRRGIKVFTVAVGIAKLEFYNNLRKAAD 523
Query: 559 WSGQG---EMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGSRNEM 615
+ G P G + P I++++L++L E RD + I + W + RNE
Sbjct: 524 VAKDGATIAFPTGFVHLP-KIDAEFLQQLCAEQLITRRDRSGFPI--RAWQKM-RERNEA 579
Query: 616 WDTLIYNTAIFESMVLEACED 636
D +Y A + L+ E+
Sbjct: 580 LDCYVYARAAASAAGLDRFEE 600
>dbj|BAA89640.1| (AB036666) similar to terminase large subunit of phage lambda
[Wolbachia sp. wKue]
Length = 609
Score = 203 bits (516), Expect = 5e-051
Identities = 177/634 (27%), Positives = 280/634 (43%), Gaps = 83/634 (13%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L SEWA R + + PG + + PY +E++D P SP V KGAQ+G T
Sbjct: 17 LKVSEWANEYRVLAPTAASEPGKWRTERTPYLKEIMDSLSPSSPTEKVVFMKGAQIGGTE 76
Query: 89 SVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTK 148
+ N I Y ID M+ ++ KR P++E + +++ ++
Sbjct: 77 AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPC---LKSKVKDPRSR 131
Query: 149 RSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTET 208
SG T + E+ GG V+ GA S +RS PV +L DE+ +P G +GDP+ L+
Sbjct: 132 DSGNTVQSKEFPGGIVVIT-GANSSVALRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSIA 190
Query: 209 RTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKKGQ 268
RTN+F A RK+ +STP G I + FE D+RY+ VPC HC +Q L +
Sbjct: 191 RTNTF-ARRKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW-------- 241
Query: 269 GRLIWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHLSA 328
++ WE + YVC C G++ N K + G W PT + + + + +HLS+
Sbjct: 242 SQIKWENSDS----RTAHYVCTECSGKIENHQKTEMLDRGEWRPTNR-VKGEKKGFHLSS 296
Query: 329 MYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREVYA 388
+Y+P SWQ+ +D +A + L+V+ N LGE + K ++++
Sbjct: 297 LYSPVGWYSWQQAV-------EDFLHAKESEQLLKVWINTTLGETWVDKGEVPDWKQLF- 348
Query: 389 HRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYVID 448
+RR+ + +G VP + AG VDVQ L V + W S + + ID
Sbjct: 349 NRREFFQIGTVPRREVVLTAG---------VDVQKDRLEVEVVAWGKSRE-----SWSID 394
Query: 449 YEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEYC--NDW 506
Y+ EGD T GE W KL+E+++ G Y I+ + +DA T VY + +
Sbjct: 395 YQVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452
Query: 507 GENVIPIRG-------------RDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
V+ ++G D+ + G ++K +G++ V V + K
Sbjct: 453 SGRVMAVKGVNKALVPLSSPSRVDITVGGQKLK--------RGIKLWPVGVSILKSELFQ 504
Query: 554 TLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGSRN 613
L +G+ G + P +Y K+LT E + +Q W + RN
Sbjct: 505 LLNILKEEEGKALPGYCHFP-EYAPEYFKQLTAE---QLVSKVVKGYTKQEWQKV-RERN 559
Query: 614 EMWDTLIYNTAIFESMVLEACEDILGLEALVWPE 647
E+ D IY A LG++ WPE
Sbjct: 560 EVLDCRIY---------ARAASIALGIDR--WPE 582
>dbj|BAA89621.1| (AB036665) similar to terminase large subunit of lambda
[bacteriophage WO]
Length = 610
Score = 201 bits (510), Expect = 3e-050
Identities = 175/629 (27%), Positives = 278/629 (43%), Gaps = 75/629 (11%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L SEWA R + + PG + + PY +E++D P SP V KGAQ+G T
Sbjct: 17 LKVSEWANEYRVLAATAASEPGKWRTERTPYLKEIMDSLSPSSPAEKVIFMKGAQIGGTE 76
Query: 89 SVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTK 148
+ N I Y ID M+ ++ KR P++E + +++ ++
Sbjct: 77 AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPC---LKSKVKDPRSR 131
Query: 149 RSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTET 208
SG T + E+ GG V+ GA S +RS PV +L DE+ +P G +GDP+ L+
Sbjct: 132 DSGNTVQSKEFPGGIVVIT-GANSSVGLRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSIA 190
Query: 209 RTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKKGQ 268
RTN+F A RK+ +STP G I + FE D+RY+ VPC HC +Q L +
Sbjct: 191 RTNTF-ARRKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW-------- 241
Query: 269 GRLIWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHLSA 328
++ WE + P++ Y+C CG ++ N K + G W T + + + +HLS+
Sbjct: 242 SQIKWEDKN----PNTAHYICIECGKKIENHQKTEMLDRGEWRAT-EAKEGEKKGFHLSS 296
Query: 329 MYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREVYA 388
+Y+P W +W +D +A + L+V+ N LGE + K ++++
Sbjct: 297 LYSPV---GW----YSWTQAVEDFLHAKESEQLLKVWINTTLGETWVDKGEVPDWKQLF- 348
Query: 389 HRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYVID 448
+RR+ + +G VP + AG VDVQ L V + W S ++ + ID
Sbjct: 349 NRREFFPVGTVPRREVVLTAG---------VDVQEDRLEVEVVAWGKSREN-----WSID 394
Query: 449 YEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEYCNDWGE 508
Y EGD T GE W KL+E+++ G Y I+ + +DA T VY +
Sbjct: 395 YRVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452
Query: 509 NVIPIRGRDLPIKGAQIKYFNR----------QMSEKGVEYLSVTVDLYKDRWSPTLRKE 558
+ GR + +KG + + Q ++G++ V V + K L
Sbjct: 453 S-----GRVMAVKGVALVPLSSPSRVDITVGGQKLKRGIKLWPVGVSILKSELFQLLNVL 507
Query: 559 WSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGSRNEMWDT 618
+G+ G + P +Y K+LT E + +Q W + RNE+ D
Sbjct: 508 KEEEGKALPGYCHFP-EYAPEYFKQLTAE---QLVSKVVKGYTKQEWQKV-RERNEVLDC 562
Query: 619 LIYNTAIFESMVLEACEDILGLEALVWPE 647
IY A LG++ WPE
Sbjct: 563 RIY---------ARAASIALGIDR--WPE 580
>ref|NP_297798.1| (NC_002488) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
pir||D82798 phage-related terminase large subunit XF0508 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF83318.1|AE003899_10 (AE003899) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
Length = 699
Score = 115 bits (287), Expect = 2e-024
Identities = 116/472 (24%), Positives = 183/472 (38%), Gaps = 49/472 (10%)
Query: 32 SEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVL 91
S+WA+ R I + PG + D P RE+++C ++PV V+ K Q+GAT +
Sbjct: 35 SQWADAYRKIARGSGAEPGQWRTDRHPPLREIMNCLSDHTPVQQVSFMKSGQIGAT-EIG 93
Query: 92 ENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSG 151
N + Y ID + ++ V D + F +L+ L + + N+ +
Sbjct: 94 INWVCYVIDRGIDSMIVTQPVKDLARTWTVAKFDPGVLDMPPLLNKLTTNNTFE------ 147
Query: 152 ATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTN 211
+ GG L+ A S ++R + DE+ +P+ + + G + R
Sbjct: 148 ------KQYPGGTLFVKWANSSSQLRQITACYAFLDEIDEYPRNLNNQGTADQQIAARIM 201
Query: 212 SFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
S K+ TP AG I F GDQR++ + C HCG Q L + G
Sbjct: 202 SHGERGKIYRACTPTVAGGSAIEASFLDGDQRHYHIHCPHCGSEQILDL--EHLQPDGTF 259
Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHLSAMYA 331
G I + H ++ E G W P + PD RSY+L A YA
Sbjct: 260 ACAVNGCI--------IQEHHKNTILKERGTG--GTAFWHPHNPSAPPDHRSYYLWAAYA 309
Query: 332 PY-YARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREVYAHR 390
P SW++IA W + A +D +L F N LG ++ + + EV
Sbjct: 310 PLGLGLSWKQIADKWAE-------AKRDPSKLPGFTNLILGLPFQGERDARNAHEVATLA 362
Query: 391 RDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYVIDYE 450
G VP + AG VD+ H V I A G Y++DY
Sbjct: 363 EPGVYRGLVPRGGLVLAAG---------VDLAHDRAEVHLI-----ATGRGQRRYIVDYA 408
Query: 451 HVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEY 502
++ D + E + L + + G I+ V ID T+ V ++
Sbjct: 409 VIDLDPTVL--ESYADLDAYLRGTWKTASGIDMPISAVAIDGGNWTETVAQF 458
>ref|NP_298001.1| (NC_002488) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
ref|NP_299778.1| (NC_002488) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
pir||C82551 phage-related terminase large subunit XF2500 XF0711 [imported] -
Xylella fastidiosa (strain 9a5c)
gb|AAF83521.1|AE003914_2 (AE003914) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
gb|AAF85298.1|AE004057_11 (AE004057) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
Length = 580
Score = 95.9 bits (237), Expect = 1e-018
Identities = 142/561 (25%), Positives = 225/561 (39%), Gaps = 68/561 (12%)
Query: 76 VAVKKGAQVGATVSVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLA 135
V +K GAQVG T+ L + Y I+ + M+ D+ T F + + GL
Sbjct: 8 VTLKWGAQVGKTMLAL-CVQGYCIEMAPRSQMMLQPSQGDLQAWLETKFSPLIAANQGLQ 66
Query: 136 HLIQANDFSKGTKRSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQK 195
HLI +K R G + ++ GG L + SP MR P ++ DE+ G+ +
Sbjct: 67 HLI-----AKPRGRDGVNNQRMKSYPGGFLMFAWSGSPKTMRGRSAPLIVCDEIDGYERT 121
Query: 196 VGSDGDPMKLTETRTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGF 255
+G P+ L R +F R +L++STP G I + GDQR + V C CG
Sbjct: 122 --DEGHPVSLLWQRAATFGDERFLLEISTPTIEGASYIDDAYRAGDQRRFYVRCPACGYE 179
Query: 256 QRLYFRGNAKKGQGRLIWETEGGILV-----PDSVRYVCPHCGGEMINEDKVAFMGEGHW 310
Q L + + G+ + + + P + RYVC CG + ++A + + W
Sbjct: 180 QTLEWEHVSWVGRQS---DPDADLAAIDAHQPHTARYVCQGCGVCWDDGQRIAAVRQARW 236
Query: 311 VPTAKPSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDL 370
+KP SY L+ +Y+ + R I Q ++D +LQ F N L
Sbjct: 237 -QASKPFN-GHASYELNELYSTF--RRQSAIVQDYLD--------KLKHQDLQTFTNVSL 284
Query: 371 GEAYELKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLT 430
+ + + ++ RR L +VP GG +T +D+Q L V
Sbjct: 285 ARVWSETAEQADIDDLL--RRLETYLADVP-------MGGVF--LTAGIDMQTDRLEVEI 333
Query: 431 IGWAPSADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTR-QYESGGRLYDIARVG 489
+ W + + I + GD + G+ W L +ST Q+ESG RL I
Sbjct: 334 VAWGIDEE-----SWSIHTAVLYGD--PLLGDVWDALDRYLSTTWQHESGLRL-SIQAAC 385
Query: 490 ID---ASELTDVVYEYCND-WGENVIPIR-----GRDLPIKGAQIKYFNRQMSEKGVEYL 540
+D S T Y+Y G + I+ GR + + AQ K+ R +
Sbjct: 386 LDTGGTSGYTQAAYQYLRTRTGRRLFGIKGVGGWGRPI-VDKAQRKHSGRNAPR--INLF 442
Query: 541 SVTVDLYKDRWSPTLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKI 600
+V VD K L G G + P++ + +LT E R +
Sbjct: 443 TVGVDEAKLIVMRRLAITQPGP-----GYSHFPADRSPDWFAQLTAEKLRTR--YLKGQP 495
Query: 601 IRQVWHRPGGSRNEMWDTLIY 621
+RQ W +P + NE D +Y
Sbjct: 496 VRQ-WTKPDKTPNEALDCRVY 515
>ref|NP_461544.1| (NC_003197) Gifsy-1 prophage: similar to terminase large chain gp2
of N15 [Salmonella typhimurium LT2]
gb|AAL21503.1| (AE008818) Gifsy-1 prophage: similar to terminase large chain gp2
of N15 [Salmonella typhimurium LT2]
Length = 643
Score = 92.8 bits (229), Expect = 1e-017
Identities = 146/628 (23%), Positives = 246/628 (38%), Gaps = 90/628 (14%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L +WA+ ++P++ S PG ++ P+ +++ Y + V + K A+VG T
Sbjct: 28 LTTVQWADEYYYLPKESSYTPG--KWETLPFQVAIMNAMG-YELIRVVNLIKSARVGYTK 84
Query: 89 SVLENIIAYGIDYVKTASMIFATVD---DDVTKRRLTNFI--LPMLEHSGLAHLIQANDF 143
+L + Y I++ S++F D +D K + I +P+L L A F
Sbjct: 85 MLL-GVEGYFIEHKSRNSLLFQPTDSSAEDFMKSHVEPTIRDVPVL-------LELAPWF 136
Query: 144 SKGTKRSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPM 203
+ + + T K + G + G + R V + DE+S + V +G P
Sbjct: 137 GRKHRDNTLTLK--RFSSGVGFWCLGGAAAKNYREKSVDVVCYDELSSFEPDVEKEGSPT 194
Query: 204 KLTETRTNSFSATRKVLDLSTPLPAGNDTITQRF-EKGDQRYWEVPCKHCGGFQRLYFRG 262
L + R S K + STP G+ I + E + VPC HCG Q L F G
Sbjct: 195 LLGDKRIEG-SVWPKSIRGSTPKVKGSCQIEKAANESAHFMRFYVPCPHCGEEQYLKF-G 252
Query: 263 NAKKGQGRLIWETEGGILVPDSVRYVCPHCG--------------------GEMINEDKV 302
+ G L WE P++V Y+C H G G +
Sbjct: 253 DGSTPFG-LKWEKSK----PETVYYLCEHNGCVIRQSELDQKAGRWICDNTGMWTRDGLA 307
Query: 303 AFMGEGHWVPTAKPSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDEL 362
F G VP P ++H+ Y+P+ +W +I W+D A KD + +
Sbjct: 308 YFSASGEEVPP-----PRSITFHIWTAYSPF--TTWIQIIYDWLD-------ALKDPNGV 353
Query: 363 QVFYNNDLGEAYELKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQ 422
+ F N LGE YE E A + L E ++A A + +T +D Q
Sbjct: 354 KTFINTTLGEPYE---------EAVAEKLSHELLLEKVIHYA-APVPERVVYLTAGIDSQ 403
Query: 423 HTWLAVLTIGWAPSADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRL 482
+ GWAP G ++ID + + G + + + +++ +I+ + + G
Sbjct: 404 RNRYEMYVWGWAP-----GEEAFLIDKQIIMG--RHDDEDTLQRVDAVINKKYRHADGTD 456
Query: 483 YDIARV-----GIDASELTDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKG 536
I+R+ GIDA ++VY+ G V+P++G + G + ++ ++ G
Sbjct: 457 ISISRICWDIGGIDA----EIVYKRSKKHGIFRVLPVKGAS--VYGKPVITMPKKRNQSG 510
Query: 537 VEYLSVTVDLYKDRWSPTLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAA 596
V + D K+ + + E P N + E E +
Sbjct: 511 VFLCEIGTDTAKEMLYARMGAVTAPADEATPYAIRFPDNPDVFTEVEAKQLVAEELVEKL 570
Query: 597 TNKIIRQVWHRPGGSRNEMWDTLIYNTA 624
N R +W G RNE D L+Y +A
Sbjct: 571 VNGKFRLLWDAK-GRRNEALDCLVYASA 597
>ref|NP_310206.1| (NC_002695) putative terminase large subunit [Escherichia coli
O157:H7]
dbj|BAB35602.1| (AP002557) putative terminase large subunit [Escherichia coli
O157:H7]
Length = 641
Score = 92.4 bits (228), Expect = 1e-017
Identities = 142/643 (22%), Positives = 251/643 (38%), Gaps = 69/643 (10%)
Query: 33 EWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVLE 92
EWA+ ++P++ + G ++ P+ R +++ + V V K A+VG + +L
Sbjct: 32 EWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKMLL- 87
Query: 93 NIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSGA 152
+ AY I++ + ++I+ D D T+ + + L L A + K + +
Sbjct: 88 GVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLSL--APWYGKKHRDNTL 145
Query: 153 TAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTNS 212
T K + G + G ++ R V DE++ + + + +G P L + R
Sbjct: 146 TMK--RFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDKRIEG 203
Query: 213 FSATRKVLDLSTPLPAGNDTITQRFEKGDQ-RYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
S K + STP G I + + + V C HCG Q L F K+ L
Sbjct: 204 -SVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF--GDKETPFGL 260
Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGH--------------WVPTA--K 315
W + P SV Y+C H +I + ++ F + W ++ +
Sbjct: 261 KWTPDD----PSSVFYLCEH-NACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEE 315
Query: 316 PSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYE 375
PD ++H+ Y+P+ +W +I + W+ D + + F N LGE +E
Sbjct: 316 IEPPDSVTFHIWTAYSPF--TTWVQIVKDWM-------KTKGDTGKRKTFVNTTLGETWE 366
Query: 376 LKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAP 435
K EV A R++ Y+ VP+ A AG +D Q + GW P
Sbjct: 367 AKIGERPDAEVMAERKEHYS-ASVPDRVAYLTAG---------IDSQLDRYEMRVWGWGP 416
Query: 436 SADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASEL 495
G ++ID + + G + + + ++ E I+ G ++R+ D +
Sbjct: 417 -----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTGGI 469
Query: 496 -TDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
+VYE G VIPI+G + G + R+ ++ GV + D K++
Sbjct: 470 DPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYN 527
Query: 554 TLRKEWSGQGEMPRGMF--NAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGS 611
G +P + N P + ++LT E + E+ KI+ R
Sbjct: 528 RFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR---- 583
Query: 612 RNEMWDTLIYNTAIFESMVLEACEDILGLEALVWPEFWAIAGK 654
RNE D +Y A + D+ L A + E A K
Sbjct: 584 RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_287368.1| (NC_002655) putative DNA packaging protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
ref|NP_309657.1| (NC_002695) terminase large subunit [Escherichia coli O157:H7]
gb|AAG55980.1|AE005330_12 (AE005330) putative DNA packaging protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
dbj|BAB35053.1| (AP002555) terminase large subunit [Escherichia coli O157:H7]
Length = 641
Score = 92.0 bits (227), Expect = 2e-017
Identities = 142/643 (22%), Positives = 251/643 (38%), Gaps = 69/643 (10%)
Query: 33 EWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVLE 92
EWA+ ++P++ + G ++ P+ R +++ + V V K A+VG + +L
Sbjct: 32 EWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKMLL- 87
Query: 93 NIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSGA 152
+ AY I++ + ++I+ D D T+ + + L L A + K + +
Sbjct: 88 GVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLAL--APWYGKKHRDNTL 145
Query: 153 TAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTNS 212
T K + G + G ++ R V DE++ + + + +G P L + R
Sbjct: 146 TMK--RFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDKRIEG 203
Query: 213 FSATRKVLDLSTPLPAGNDTITQRFEKGDQ-RYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
S K + STP G I + + + V C HCG Q L F K+ L
Sbjct: 204 -SVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF--GDKETPFGL 260
Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGH--------------WVPTA--K 315
W + P SV Y+C H +I + ++ F + W ++ +
Sbjct: 261 KWTPDD----PSSVFYLCEH-NACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEE 315
Query: 316 PSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYE 375
PD ++H+ Y+P+ +W +I + W+ D + + F N LGE +E
Sbjct: 316 IEPPDSVTFHIWTAYSPF--TTWVQIVKDWM-------KTKGDTGKRKTFVNTTLGETWE 366
Query: 376 LKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAP 435
K EV A R++ Y+ VP+ A AG +D Q + GW P
Sbjct: 367 AKIGERPDAEVMAERKEHYS-ASVPDRVAYLTAG---------IDSQLDRYEMRVWGWGP 416
Query: 436 SADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASEL 495
G ++ID + + G + + + ++ E I+ G ++R+ D +
Sbjct: 417 -----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTGGI 469
Query: 496 -TDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
+VYE G VIPI+G + G + R+ ++ GV + D K++
Sbjct: 470 DPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYN 527
Query: 554 TLRKEWSGQGEMPRGMF--NAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGS 611
G +P + N P + ++LT E + E+ KI+ R
Sbjct: 528 RFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR---- 583
Query: 612 RNEMWDTLIYNTAIFESMVLEACEDILGLEALVWPEFWAIAGK 654
RNE D +Y A + D+ L A + E A K
Sbjct: 584 RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_040581.1| (NC_001416) A (DNA packaging;641) [bacteriophage lambda]
sp|P03708|TERL_LAMBD TERMINASE LARGE SUBUNIT (DNA PACKAGING PROTEIN A) (GPA)
pir||JVBPAL DNA-packaging protein A - phage lambda
gb|AAA96534.1| (J02459) A (DNA packaging;641) [bacteriophage lambda]
Length = 641
Score = 91.3 bits (225), Expect = 3e-017
Identities = 143/643 (22%), Positives = 250/643 (38%), Gaps = 69/643 (10%)
Query: 33 EWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVLE 92
EWA+ ++P++ + G ++ P+ R +++ + V V K A+VG + +L
Sbjct: 32 EWADANYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKMLL- 87
Query: 93 NIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSGA 152
+ AY I++ + ++I+ D D T+ + + L L A + K + +
Sbjct: 88 GVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLAL--APWYGKKHRDNTL 145
Query: 153 TAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTNS 212
T K + G + G ++ R V DE++ + + +G P L + R
Sbjct: 146 TMK--RFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDIEQEGSPTFLGDKRIEG 203
Query: 213 FSATRKVLDLSTPLPAGNDTITQRFEKGDQ-RYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
S K + STP G I + + + V C HCG Q L F K+ L
Sbjct: 204 -SVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF--GDKETPFGL 260
Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGH--------------WVPTA--K 315
W + P SV Y+C H +I + ++ F + W ++ +
Sbjct: 261 KWTPDD----PSSVFYLCEH-NACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEE 315
Query: 316 PSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYE 375
PD ++H+ Y+P+ +W +I + W+ D + + F N LGE +E
Sbjct: 316 IEPPDSVTFHIWTAYSPF--TTWVQIVKDWM-------KTKGDTGKRKTFVNTTLGETWE 366
Query: 376 LKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAP 435
K EV A R++ Y+ VP+ A AG +D Q + GW P
Sbjct: 367 AKIGERPDAEVMAERKEHYS-APVPDRVAYLTAG---------IDSQLDRYEMRVWGWGP 416
Query: 436 SADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASEL 495
G ++ID + + G + + + ++ E I+ G I+R+ D +
Sbjct: 417 -----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSISRICWDTGGI 469
Query: 496 -TDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
+VYE G VIPI+G + G + R+ ++ GV + D K++
Sbjct: 470 DPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYN 527
Query: 554 TLRKEWSGQGEMPRGMF--NAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGS 611
G +P + N P + ++LT E + E+ KI+ R
Sbjct: 528 RFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR---- 583
Query: 612 RNEMWDTLIYNTAIFESMVLEACEDILGLEALVWPEFWAIAGK 654
RNE D +Y A + D+ L A + E A K
Sbjct: 584 RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>sp|P36693|TERL_BPP21 TERMINASE LARGE SUBUNIT (GP2)
pir||B49849 terminase large chain gp2 - phage 21
gb|AAA32340.1| (M81255) terminase large subunit [Bacteriophage 21]
Length = 642
Score = 89.0 bits (219), Expect = 1e-016
Identities = 138/611 (22%), Positives = 238/611 (38%), Gaps = 62/611 (10%)
Query: 29 LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L +WA + ++P++ S PG ++ P+ +++C + V + K A+VG T
Sbjct: 28 LTTVQWANKHYYLPKESSYTPG--RWETLPFQVGIMNCMGN-DLIRTVNLIKSARVGYTK 84
Query: 89 SVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTK 148
+L + AY I++ S++F D + + + + P + A L A F + +
Sbjct: 85 MLL-GVEAYFIEHKSRNSLLFQPTDS-AAEDFMKSHVEPTIRDVP-AMLELAPWFGRKHR 141
Query: 149 RSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTET 208
+ T K + G + G + R V + DE+S + V +G P L +
Sbjct: 142 DNTLTLK--RFSSGVGFWCLGGAAAKNYREKSVDVVCYDELSSFEPDVEKEGSPTLLGDK 199
Query: 209 RTNSFSATRKVLDLSTPLPAGNDTITQRF-EKGDQRYWEVPCKHCGGFQRLYFRGNAKKG 267
R S K + STP G+ I + E + VPC HCG Q L F +A
Sbjct: 200 RIEG-SVWPKSIRGSTPKIKGSCQIEKAANESAHFMRFYVPCPHCGEEQYLKFGDDASPF 258
Query: 268 QGRLIWETEGGILVPDSVRYVCPHCG------------GEMINEDKVAFMGEGHWVPTAK 315
L WE P+SV Y+C H G G I E+ + +G +A+
Sbjct: 259 G--LKWEKNK----PESVFYLCEHHGCVIHQSELDQSNGRWICENTGMWTRDGLMFFSAR 312
Query: 316 PSR---PDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGE 372
P ++H+ Y+P+ +W +I W+D A KD + L+ F N LGE
Sbjct: 313 GDEIPPPRSITFHIWTAYSPF--TTWVQIVYDWLD-------ALKDPNGLKTFVNTTLGE 363
Query: 373 AYELKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIG 432
+E +V + Y A + +T +D Q + G
Sbjct: 364 TWEEAVGEKLDHQVLMDKVVRYT----------AAVPARVVYLTAGIDSQRNRFEMYVWG 413
Query: 433 WAPSADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDA 492
WAP G +++D + G + E E ++ I+ + + G I+RV D
Sbjct: 414 WAP-----GEEAFLVDKIIIMG--RPDEEETLLRVDAAINKKYRHADGTEMTISRVCWDI 466
Query: 493 SELT-DVVYEYCNDWGE-NVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDR 550
+ ++VY+ G V+P++G + G + + +++GV V D K+
Sbjct: 467 GGIDGEIVYQRSKKHGVFRVLPVKGAS--VYGKPVITMPKTRNQRGVYLCEVGTDTAKEI 524
Query: 551 WSPTLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGG 610
++ + + E P + E E E + +R +W
Sbjct: 525 LYARMKADPTPVDEATSYAIRFPDDPEIFSQTEAQQLVAEELVEKWEKGKMRLLWDNK-K 583
Query: 611 SRNEMWDTLIY 621
RNE D L+Y
Sbjct: 584 RRNEALDCLVY 594
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.136 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 412,883,444
Number of Sequences: 884779
Number of extensions: 18875352
Number of successful extensions: 42466
Number of sequences better than 5.0e-02: 21
Number of HSP's better than 0.1 without gapping: 4
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 42391
Number of HSP's gapped (non-prelim): 28
length of query: 675
length of database: 277,083,050
effective HSP length: 127
effective length of query: 548
effective length of database: 164,716,117
effective search space: 90264432116
effective search space used: 90264432116
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 94 (40.8 bits)
Query= orf3 [900-993]
(92 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.320 0.136 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 45,264,236
Number of Sequences: 884779
Number of extensions: 1464462
Number of successful extensions: 3295
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3295
Number of HSP's gapped (non-prelim): 0
length of query: 92
length of database: 277,083,050
effective HSP length: 68
effective length of query: 24
effective length of database: 216,918,078
effective search space: 5206033872
effective search space used: 5206033872
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf4 [1015-1245]
(229 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.136 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 131,181,179
Number of Sequences: 884779
Number of extensions: 5289705
Number of successful extensions: 9328
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 9328
Number of HSP's gapped (non-prelim): 0
length of query: 229
length of database: 277,083,050
effective HSP length: 116
effective length of query: 113
effective length of database: 174,448,686
effective search space: 19712701518
effective search space used: 19712701518
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 89 (38.9 bits)
Query= orf5 [1245-1561]
(315 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_459888.1| (NC_003197) Fels-1 prophage [Salmonella ty... 72 9e-012
ref|NP_460007.1| (NC_003197) Gifsy-2 prophage [Salmonella t... 70 2e-011
ref|NP_518978.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 68 1e-010
sp|P36272|VG04_BPP21 PORTAL PROTEIN (HEAD PROTEIN GP4) >gi|... 64 2e-009
emb|CAC83156.1| (AJ304858) putative portal protein [Escheri... 63 3e-009
ref|NP_288458.1| (NC_002655) putative head-tail preconnecto... 63 3e-009
ref|NP_287794.1| (NC_002655) putative capsid protein of pro... 62 9e-009
ref|NP_461542.1| (NC_003197) Gifsy-1 prophage: similar to h... 59 5e-008
ref|NP_297800.1| (NC_002488) conserved hypothetical protein... 45 0.001
>ref|NP_459888.1| (NC_003197) Fels-1 prophage [Salmonella typhimurium LT2]
gb|AAL19847.1| (AE008738) Fels-1 prophage [Salmonella typhimurium LT2]
Length = 511
Score = 71.6 bits (174), Expect = 9e-012
Identities = 54/189 (28%), Positives = 82/189 (42%), Gaps = 24/189 (12%)
Query: 7 KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSG 66
+R +ARG LL V+ + ++ + D+ + A + A + FI++Q
Sbjct: 233 RRLNQARGASLLAPVIIRLMDLKEYEDSERIAARIAASLGMFIKKQ-------------- 278
Query: 67 AQRRVQDQDFVGDGTVRQTNIAETPF--GMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAI 124
D DG V ET GM+ +GL GE+I +D + F
Sbjct: 279 --------DVGTDGYVAPEKRKETQIQPGMLFDGLNPGEDIGMIKSDRPNAGLESFRMGQ 330
Query: 125 LRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTM 184
LRAVA L FS +A +D +YSA R E+ E + ++D RP+YR WL
Sbjct: 331 LRAVAAGLRGSFSSIARNYDGTYSAQRQELVEAQEGYSILQDSFIAAFTRPLYRRWLAAA 390
Query: 185 VLTRRIDAP 193
V + I+ P
Sbjct: 391 VASGAIEVP 399
>ref|NP_460007.1| (NC_003197) Gifsy-2 prophage [Salmonella typhimurium LT2]
gb|AAL19966.1| (AE008744) Gifsy-2 prophage [Salmonella typhimurium LT2]
Length = 437
Score = 70.5 bits (171), Expect = 2e-011
Identities = 69/266 (25%), Positives = 110/266 (40%), Gaps = 29/266 (10%)
Query: 7 KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSG 66
+R +ARG LL V+ + ++ + D+ + A + A FI+R
Sbjct: 155 RRLNQARGVTLLAPVIIRLLDLKEYEDSERLAARISAAFAMFIRR--------------- 199
Query: 67 AQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 126
+ VQD D D + ++ P G +++ L GE+I +D + F LR
Sbjct: 200 SDAMVQDGD-APDYADKDRDLDIEP-GTILKDLLPGEDIGTIKSDRPNANLESFRMGQLR 257
Query: 127 AVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTMVL 186
AVA + FS +A +D +YSA R E+ E + ++D + RP+YR WL T +
Sbjct: 258 AVAAGVRGSFSSIARNYDGTYSAQRQELVEAQEGYAILQDNFIAAVSRPVYRRWLATAIT 317
Query: 187 TRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTRTK 246
ID P A + + G V +D KE I+ GA T +
Sbjct: 318 AGVIDVP----------TDTDMATLFNAVYSGPVMPWIDPLKEANAWRILIRGGAATESD 367
Query: 247 AARMLTGTSFKTNTRQLAK--ENQLL 270
R G + R+ A+ EN+ L
Sbjct: 368 WVRARGGAPAEVKRRRKAEIDENRKL 393
>ref|NP_518978.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14559.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 508
Score = 67.8 bits (164), Expect = 1e-010
Identities = 66/266 (24%), Positives = 110/266 (40%), Gaps = 33/266 (12%)
Query: 8 RHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSGA 67
R + RGEP L+ + + E+D+ DA + A GF+ RQ
Sbjct: 232 RPGQIRGEPWLSRALVKLNELDQYDDAELVRKKTAAMFAGFVTRQ--------------- 276
Query: 68 QRRVQDQDFVGDGTVRQTNIAETPFGM-MVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 126
+ + +G+G + I+ ++ L GE+IK +GEF R
Sbjct: 277 ---SPEDNLMGEGLPDEAGISLVGLEPGTLQILEPGEDIKFSDPADVGGSYGEFLRTQFR 333
Query: 127 AVAWALGMPFSVLAMQFDS-SYSASRGEIKEF----ESVIHEMRDRDADTLLRPIYRSWL 181
AVA ALG+ + L +YS+ R + EF E V H + + RP++ +W+
Sbjct: 334 AVAAALGITYEQLTGDLTGVNYSSIRAGLLEFRRRCEMVQHSVL---VHQMCRPVWAAWM 390
Query: 182 RTMVLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGA 241
+ VL+ + APGF + W W + VD EKE + I+ G
Sbjct: 391 KQAVLSGALVAPGFARGGAARRRQYLQVKWIPQGW-----QWVDPEKEFKAMLLAIRAGL 445
Query: 242 LTRTKAARMLTGTSFKTNTRQLAKEN 267
++R++A G + R++A +N
Sbjct: 446 MSRSEAISTF-GYDAEDIDREIAADN 470
>sp|P36272|VG04_BPP21 PORTAL PROTEIN (HEAD PROTEIN GP4)
pir||JN0538 head protein gp4 - phage 21
gb|AAA32342.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
Length = 530
Score = 63.9 bits (154), Expect = 2e-009
Identities = 61/263 (23%), Positives = 113/263 (42%), Gaps = 16/263 (6%)
Query: 11 EARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSGAQRR 70
+ RG VM+ + +D ++ + A + A I+ ++ T + +GA +
Sbjct: 241 QTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATIESELD--TEKAFEYIAGAPQE 298
Query: 71 VQDQDFVGD----GTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 126
+D + + TN T G+ + L G+++K T +D F E+A+LR
Sbjct: 299 QKDNPLINILEKFSSWYDTNNV-TLGGVKIPHLFPGDDLKLQTAQDSDNGFSALEQALLR 357
Query: 127 AVAWALGMPFSVLAMQFDS-SYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTMV 185
+A LG+ + L+ + SYS++R E R A L ++ WL +
Sbjct: 358 YIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGRRKFIAARLATQMFSCWLEEAL 417
Query: 186 LTRRIDAPGFLAAARNPKEF-PTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTR 244
L G + R +F +AW+R++W G + A+D KE + +I+ G T
Sbjct: 418 LR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMAIDGLKEVQESVMRIEAGLSTY 471
Query: 245 TKAARMLTGTSFKTNTRQLAKEN 267
K ++ G ++ RQ +E+
Sbjct: 472 EKGLALM-GEDYQDIFRQQVRES 493
>emb|CAC83156.1| (AJ304858) putative portal protein [Escherichia coli]
Length = 500
Score = 63.2 bits (152), Expect = 3e-009
Identities = 65/245 (26%), Positives = 104/245 (41%), Gaps = 33/245 (13%)
Query: 7 KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGG--KTPGTKPFT 64
+R + RG +L+ V+ I+ + + DA A + A + +I++ G + PG K
Sbjct: 231 RRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYIRKGDGQDYEDPGIKE-- 288
Query: 65 SGAQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAI 124
T R+ +I TP G++ + L +GE+I +D + F
Sbjct: 289 ----------------TEREVHI--TP-GIIYDDLRKGEDIGMVKSDRPNPNLETFRNGQ 329
Query: 125 LRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTM 184
LRAVA + FS A ++ +YSA R E+ E ++D + RP+YR+WL +
Sbjct: 330 LRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDCFIGAVTRPVYRTWLNMV 389
Query: 185 VLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTR 244
V + P + E T T S G V +D KE QI+ GA T
Sbjct: 390 VAAGLLKIPADV-------EMKTLYNATYS---GPVMPWIDPVKEAEAWRIQIRGGAATE 439
Query: 245 TKAAR 249
+ R
Sbjct: 440 SDWVR 444
>ref|NP_288458.1| (NC_002655) putative head-tail preconnector protein of prophage
CP-933U [Escherichia coli O157:H7 EDL933]
ref|NP_310988.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
dbj|BAB19579.1| (AP000400) portal protein [Escherichia coli O157:H7]
gb|AAG57012.1|AE005420_17 (AE005420) putative head-tail preconnector protein of prophage
CP-933U [Escherichia coli O157:H7 EDL933]
dbj|BAB36384.1| (AP002560) putative portal protein [Escherichia coli O157:H7]
Length = 500
Score = 63.2 bits (152), Expect = 3e-009
Identities = 65/245 (26%), Positives = 104/245 (41%), Gaps = 33/245 (13%)
Query: 7 KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGG--KTPGTKPFT 64
+R + RG +L+ V+ I+ + + DA A + A + +I++ G + PG K
Sbjct: 231 RRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYIRKGDGQDYEDPGIKE-- 288
Query: 65 SGAQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAI 124
T R+ +I TP G++ + L +GE+I +D + F
Sbjct: 289 ----------------TEREVHI--TP-GIIYDDLRKGEDIGMVKSDRPNPNLETFRNGQ 329
Query: 125 LRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTM 184
LRAVA + FS A ++ +YSA R E+ E ++D + RP+YR+WL +
Sbjct: 330 LRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDCFIGAVTRPVYRTWLNMV 389
Query: 185 VLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTR 244
V + P + E T T S G V +D KE QI+ GA T
Sbjct: 390 VAAGLLKIPADV-------EMKTLYNATYS---GPVMPWIDPVKEAEAWRIQIRGGAATE 439
Query: 245 TKAAR 249
+ R
Sbjct: 440 SDWVR 444
>ref|NP_287794.1| (NC_002655) putative capsid protein of prophage CP-933R
[Escherichia coli O157:H7 EDL933]
ref|NP_309134.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
ref|NP_310760.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
gb|AAG56408.1|AE005368_14 (AE005368) putative capsid protein of prophage CP-933R [Escherichia
coli O157:H7 EDL933]
dbj|BAB34530.1| (AP002554) putative portal protein [Escherichia coli O157:H7]
dbj|BAB36156.1| (AP002559) putative portal protein [Escherichia coli O157:H7]
Length = 530
Score = 61.6 bits (148), Expect = 9e-009
Identities = 61/263 (23%), Positives = 112/263 (42%), Gaps = 16/263 (6%)
Query: 11 EARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSGAQRR 70
+ RG VM+ + +D ++ + A + A I+ + T + +GA +
Sbjct: 241 QTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATIESDLD--TEKAFEYIAGAPQG 298
Query: 71 VQDQDFVGD----GTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 126
+D + + TN T G+ + L G+++K T +D F E+A+LR
Sbjct: 299 QKDNPLINILEKFSSWYDTNNV-TLGGVKIPHLFPGDDLKLQTAQDSDNGFSALEQALLR 357
Query: 127 AVAWALGMPFSVLAMQFDS-SYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTMV 185
+A LG+ + L+ + SYS++R E R A L ++ WL +
Sbjct: 358 YIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGRRKFIAARLATQMFSYWLEEAL 417
Query: 186 LTRRIDAPGFLAAARNPKEF-PTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTR 244
L G + R +F +AW+R++W G + A+D KE + +I+ G T
Sbjct: 418 LR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMAIDGLKEVQESVMRIEAGLSTY 471
Query: 245 TKAARMLTGTSFKTNTRQLAKEN 267
K ++ G ++ RQ +E+
Sbjct: 472 EKEPALM-GEDYQDIFRQQVRES 493
>ref|NP_461542.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp4
of phage 21 [Salmonella typhimurium LT2]
gb|AAL21501.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp4
of phage 21 [Salmonella typhimurium LT2]
Length = 526
Score = 59.3 bits (142), Expect = 5e-008
Identities = 45/177 (25%), Positives = 81/177 (45%), Gaps = 7/177 (3%)
Query: 93 GMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDS-SYSASR 151
G+ + L G+E+ T +D F E+A+LR +A LG+ + L+ + SYS++R
Sbjct: 322 GVKIPHLYPGDELNLQTAQDSDNGFSALEQALLRYIAAGLGVSYEQLSRDYSQVSYSSAR 381
Query: 152 GEIKEFESVIHEMRDRDADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEFPTFAAW 211
E R A L ++ WL ++ I AP + + ++W
Sbjct: 382 ASANESWRYFLGRRRFIAGRLATQMFSCWLEEALIRGVIRAPRARFSFWEAR-----SSW 436
Query: 212 TRSQWFGMVKEAVDLEKETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKENQ 268
+RS+W G + A+D KE + +I+ G T K ++ G ++ RQ +E++
Sbjct: 437 SRSEWIGAGRMAIDGLKEVQESVMRIEAGLSTYEKELAIM-GEDYQEIFRQQVRESE 492
>ref|NP_297800.1| (NC_002488) conserved hypothetical protein [Xylella fastidiosa
9a5c]
pir||F82798 conserved hypothetical protein XF0510 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF83320.1|AE003899_12 (AE003899) conserved hypothetical protein [Xylella fastidiosa 9a5c]
Length = 520
Score = 44.7 bits (104), Expect = 0.001
Identities = 41/181 (22%), Positives = 70/181 (38%), Gaps = 15/181 (8%)
Query: 94 MMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDSSYSASRGE 153
M+ +GL GE I + + + + LRA+A + + FS LA ++ +YSA R E
Sbjct: 318 MIFDGLNPGERIDPIDSTRPNPNLESYRDGQLRAIAAPMRISFSSLAKNYNGTYSAQRQE 377
Query: 154 IKEFESVIHEMRDRDADTLLRPIYRSWLRTMVLTRRIDAPGF--LAAARNPKEFPTFAAW 211
+ E + ++RP+Y + V + + P LA A P W
Sbjct: 378 LVEQYGAYGVLAHEFISQMVRPLYERLIAIAVTSGALTLPPNVPLANALGADYLPPSMPW 437
Query: 212 TRSQWFGMVKEAVDLEKETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKENQLLA 271
+D KE Q++ G + + G + T QL+ + Q A
Sbjct: 438 ------------IDPLKEINALTAQVQAGVRSLSSVIAERGGRMYDT-VEQLSIDKQWAA 484
Query: 272 D 272
+
Sbjct: 485 E 485
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 166,160,910
Number of Sequences: 884779
Number of extensions: 6555490
Number of successful extensions: 14605
Number of sequences better than 5.0e-02: 15
Number of HSP's better than 0.1 without gapping: 6
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 14588
Number of HSP's gapped (non-prelim): 15
length of query: 315
length of database: 277,083,050
effective HSP length: 120
effective length of query: 195
effective length of database: 170,909,570
effective search space: 33327366150
effective search space used: 33327366150
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 91 (39.7 bits)
Query= orf6 [1555-1951]
(395 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAB65791.1| (AF009900) Z2Z3 protein [Vibrio shilonii] 93 5e-018
dbj|BAA89626.1| (AB036665) similar to GPC of lambda [bacter... 64 3e-009
ref|NP_536361.1| (NC_003309) gp5 [Bacteriophage phiE125] >g... 57 3e-007
dbj|BAA89643.1| (AB036666) similar to minor capsid protein ... 56 7e-007
ref|NP_461541.1| (NC_003197) Gifsy-1 prophage: similar to h... 52 1e-005
pir||T03571 probable peptidase - Rhodobacter capsulatus >gi... 51 2e-005
gb|AAC24140.1| (AF071201) unknown [bacteriophage Felix 01] 49 1e-004
sp|P36273|VG05_BPP21 HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CO... 48 1e-004
ref|NP_287793.1| (NC_002655) putative capsid assembly prote... 47 3e-004
ref|NP_287588.1| (NC_002655) putative head-tail preconnecto... 47 3e-004
ref|NP_470963.1| (NC_003212) similar to proteases [Listeria... 43 0.005
ref|NP_540710.1| (NC_003317) PUTATIVE PROTEASE IV [Brucella... 43 0.006
ref|NP_110548.1| (NC_002689) Secreted serine protease (ClpP... 42 0.014
>gb|AAB65791.1| (AF009900) Z2Z3 protein [Vibrio shilonii]
Length = 330
Score = 92.8 bits (229), Expect = 5e-018
Identities = 67/202 (33%), Positives = 101/202 (49%), Gaps = 10/202 (4%)
Query: 55 AQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAA-INAAESDDSVKNIEYYIDSPGGEAEG 113
A I++ GV + + F + GG +T +V A NAA ++ SVK + +DSPGGEA G
Sbjct: 40 AIIHVNGVISRYANLFHAVCGGVST---EVLAKEFNAALNESSVKAVILNVDSPGGEASG 96
Query: 114 AIKLGDMIRGT--KKPXXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLS-MRRP 170
+L +MI + KKP V +R G+IG V+S ++RP
Sbjct: 97 IHELSEMIHASRGKKPVRAYVGGDGCSAAYWITTACDRVTMDATARVGSIGTVVSFVKRP 156
Query: 171 SQ--SSYIDVTSTDAPNKRPDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADVNQ 228
+ + S+ +PNKR DPE+E+G+ I+ Q LD + +F VA G T V
Sbjct: 157 DAEGAKRFEFVSSQSPNKRLDPESEQGQTAIQTQ-LDAMAEVFISRVARNMGVTSDKVKS 215
Query: 229 NFGRGGSLFAETALKMGMIDEI 250
+FG+GG + A+ GM E+
Sbjct: 216 DFGQGGVKIGQQAVDAGMAHEL 237
>dbj|BAA89626.1| (AB036665) similar to GPC of lambda [bacteriophage WO]
Length = 353
Score = 63.5 bits (153), Expect = 3e-009
Identities = 57/208 (27%), Positives = 87/208 (41%), Gaps = 17/208 (8%)
Query: 54 TAQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEG 113
TA I + G+ T P F G T Y + I A S+ ++ I IDSPGGE G
Sbjct: 47 TAVIAINGILTKKPGAFDDFLG--MTSYEKIQEEIEEALSNKDIETILLDIDSPGGEVNG 104
Query: 114 AIKLGDMI---RGTKKPXXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLS---M 167
L D I R K+ +K V SR S G+IG + S
Sbjct: 105 IFDLADFIYESRAKKRIIAIANDDAYSAAYAIASSAEK-VFVSRTSGVGSIGVIASHIDQ 163
Query: 168 RRPSQSSYIDVTSTDAPNKRPD-----PETEEGRRVIREQALDPLHNMFAQAVADGRGTT 222
R + I T+ A +++ D P T E ++++ + L+ MF Q +A RG +
Sbjct: 164 SRFDEKQGIKYTTIFAGSRKNDLNPHEPMTSESLESLQKE-VGRLYEMFLQLIARNRGLS 222
Query: 223 LADVNQNFGRGGSLFAETALKMGMIDEI 250
+ + G F E A+++G+ D +
Sbjct: 223 IEKIRST--EAGLYFGEKAVEIGLADGV 248
>ref|NP_536361.1| (NC_003309) gp5 [Bacteriophage phiE125]
gb|AAL40278.1|AF447491_5 (AF447491) gp5 [Bacteriophage phiE125]
Length = 301
Score = 57.0 bits (136), Expect = 3e-007
Identities = 51/184 (27%), Positives = 79/184 (42%), Gaps = 17/184 (9%)
Query: 79 TLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXXXX 136
T Y + +A+N A +D +V++I IDS GG A GA +L D IR KP
Sbjct: 80 TSYEGLRSAVNQAVADPAVEHIVLDIDSNGGSATGAFELADDIRAASLVKPITAIVNFSA 139
Query: 137 XXXXXXXXXXDKIVATSRVSRAGNIGAV---LSMRRPSQSSYIDVTSTDAPNKRPDPETE 193
V SR S G+IG + L + + + I VTS A + + D
Sbjct: 140 FSGGYLIAAAASKVIVSRTSGVGSIGVIANHLDVSKRDEQQGIKVTSVFAGDHKNDLTPH 199
Query: 194 EGRRVIREQALDPLHNM-------FAQAVADGRGTTLADVNQNFGRGGSLFAETALKMGM 246
E + +Q+L L +M F A+A+ RG + V + G F + ++ G+
Sbjct: 200 E---PLSDQSLTFLTSMVQNSYKQFVDAIANFRGLSTQAVKDT--QAGIFFGQKGVEAGL 254
Query: 247 IDEI 250
D +
Sbjct: 255 ADSV 258
>dbj|BAA89643.1| (AB036666) similar to minor capsid protein GPC of phage lambda
[Wolbachia sp. wKue]
Length = 350
Score = 55.8 bits (133), Expect = 7e-007
Identities = 54/208 (25%), Positives = 86/208 (40%), Gaps = 17/208 (8%)
Query: 54 TAQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEG 113
TA I + G+ T P+ F G T Y + I A D+ V+ I IDSPGGE G
Sbjct: 49 TAIIPIHGILTKKPEAFDDFLG--MTSYEKMQEEIEEALEDEKVEIILLDIDSPGGEVNG 106
Query: 114 AIKLGDMI---RGTKKPXXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLSMRRP 170
L D I R K+ +KI T R S G+IG + S
Sbjct: 107 IFDLADFIYNARSKKRIIAIANDDAYSAAYAIASSAEKIFLT-RTSGVGSIGVIASHIDQ 165
Query: 171 S---QSSYIDVTSTDAPNKRPD-----PETEEGRRVIREQALDPLHNMFAQAVADGRGTT 222
S + I ++ A +++ D P T E ++ + ++ L+ M + +A R +
Sbjct: 166 SGFDEKQGIKYSTIFAGSRKNDLNPHEPITSESLENLKSE-VNRLYGMLVELIARNRSLS 224
Query: 223 LADVNQNFGRGGSLFAETALKMGMIDEI 250
+ + G F E A+++G+ D +
Sbjct: 225 IEAIKST--EAGLYFGEKAVEIGLADGV 250
>ref|NP_461541.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp5
of phage 21 [Salmonella typhimurium LT2]
gb|AAL21500.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp5
of phage 21 [Salmonella typhimurium LT2]
Length = 498
Score = 51.6 bits (122), Expect = 1e-005
Identities = 51/183 (27%), Positives = 70/183 (37%), Gaps = 11/183 (6%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMI--RGTKKPXXXXXXX 134
G T Y V A + A SD VK I IDSPGG+A GA DMI +KP
Sbjct: 99 GMTGYDGVTARLQQAVSDPEVKGILLDIDSPGGQAAGAFDCADMIYRMREQKPVWALANE 158
Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVT----STDAPNKR 187
++ SR G+IG V+ S + ID+T +
Sbjct: 159 TACSAAMLLAAACSHRLVTQTSRMGSIGVVMAHTSYAEKLKQEGIDITLIYSGAHKADLT 218
Query: 188 PDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADV--NQNFGRGGSLFAETALKMG 245
P + E +Q +D MFA+ VA G ++ V + G T L G
Sbjct: 219 PSQKLPESVYADYQQRMDEARKMFAEKVARYTGLSVDAVMATEAAVYDGQAIITTGLADG 278
Query: 246 MID 248
M++
Sbjct: 279 MVN 281
>pir||T03571 probable peptidase - Rhodobacter capsulatus
gb|AAC16224.1| (AF010496) potential peptidase [Rhodobacter capsulatus]
Length = 303
Score = 50.8 bits (120), Expect = 2e-005
Identities = 54/190 (28%), Positives = 78/190 (40%), Gaps = 18/190 (9%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEA---EGAIKLGDMIRGTKKPXXXXXX 133
G Y + A +DD+V + D+PGG E A+ + K
Sbjct: 77 GWATYRGIAEACADLAADDAVSAVVVEFDTPGGYTLGIEDAVTALRDLAAVKPVHALAAP 136
Query: 134 XXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLSMRR---PSQSSYI---DVTSTDAPNKR 187
++V T R + G+IG ++ P +S + D TS A K
Sbjct: 137 LAASAGYWLASQARELVMTPRAA-VGSIGVAVTAAANVAPGAASGVQLFDFTSRHARAKW 195
Query: 188 PDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLAD------VNQNFGRGGSLF-AET 240
PDP TE+G+ IR + LD F AVA+GRG D V+ + GG++F A
Sbjct: 196 PDPATEDGKAEIR-RGLDRTEARFHAAVAEGRGIAPEDLAARLSVSDDPEDGGAVFEAAE 254
Query: 241 ALKMGMIDEI 250
AL G+ D I
Sbjct: 255 ALARGLADRI 264
>gb|AAC24140.1| (AF071201) unknown [bacteriophage Felix 01]
Length = 442
Score = 48.5 bits (114), Expect = 1e-004
Identities = 50/222 (22%), Positives = 90/222 (40%), Gaps = 13/222 (5%)
Query: 40 PTANADRVMVRMGDTAQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAAINAAESDDSVKN 99
P A ++ D A I + G T F + GG + Y + + A +D+S+K
Sbjct: 48 PKGEARSLLKVKDDIAIIPIMGGLTHRMTFIDAMCTGGLSSYEGLRRGFDEALADESIKT 107
Query: 100 IEYYIDSPGGEAEGAIKLGDMI---RGTKKPXXXXXXXXXXXXXXXXXXXDKIVATSRVS 156
I +IDS GGEA G +L I RG KK ++I+A S +
Sbjct: 108 IVLHIDSGGGEASGCFELARHIMASRGQKKIIAYVDEFACSAAYALASSAEEIIA-SPDA 166
Query: 157 RAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKR----PDPETEEGRRVIREQALDPLHN 209
G+IG ++ + + + + + + A + P E + ++ ++ +
Sbjct: 167 DVGSIGVIMVHQELTKAFEKNGVTINVIKAGEFKGMGSPFQALSEESKERLQKRINDTYA 226
Query: 210 MFAQAVADGRGTTLADVNQNFGRGGSLFAETALKMGMIDEIL 251
F VA+ R + V A+ AL++G+I+ I+
Sbjct: 227 TFTGFVAESRNLSEEAVKNT--EANVYSAQEALELGLINSIM 266
>sp|P36273|VG05_BPP21 HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CONTAINS: SCAFFOLD PROTEIN
(HEAD PROTEIN GP6)]
pir||JN0539 head protein gp5 - phage 21
gb|AAA32343.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
Length = 501
Score = 48.1 bits (113), Expect = 1e-004
Identities = 46/185 (24%), Positives = 77/185 (40%), Gaps = 13/185 (7%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXX 134
G T Y + A + A +D V+ + IDSPGG+A GA DMI + KP
Sbjct: 99 GMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCND 158
Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKRPDPE 191
++ SR G+IG ++ S + +D+T + + D
Sbjct: 159 TACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGN 218
Query: 192 TEEGR----RVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLF-AETALKMGM 246
E R +Q +D MFA+ VA G ++ V G ++F ++ ++ G+
Sbjct: 219 QFEALPAEVRQDMQQRIDAARRMFAEKVAMYTGLSVDAVT---GTEAAVFEGQSGIEAGL 275
Query: 247 IDEIL 251
DE++
Sbjct: 276 ADELI 280
>ref|NP_287793.1| (NC_002655) putative capsid assembly protein of prophage CP-933R
[Escherichia coli O157:H7 EDL933]
ref|NP_309135.1| (NC_002695) putative head-tail preconnector protein [Escherichia
coli O157:H7]
ref|NP_310759.1| (NC_002695) putative head-tail preconnector protein [Escherichia
coli O157:H7]
gb|AAG56407.1|AE005368_13 (AE005368) putative capsid assembly protein of prophage CP-933R
[Escherichia coli O157:H7 EDL933]
dbj|BAB34531.1| (AP002554) putative head-tail preconnector protein [Escherichia
coli O157:H7]
dbj|BAB36155.1| (AP002559) putative head-tail preconnector protein [Escherichia
coli O157:H7]
Length = 501
Score = 47.0 bits (110), Expect = 3e-004
Identities = 46/185 (24%), Positives = 76/185 (40%), Gaps = 13/185 (7%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXX 134
G T Y + A + A +D V+ + IDSPGG+A GA DMI + KP
Sbjct: 99 GMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCND 158
Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKRPDPE 191
++ SR G+IG ++ S + +D+T + + D
Sbjct: 159 TACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGN 218
Query: 192 TEEGR----RVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLF-AETALKMGM 246
E R +Q +D MFA+ VA G ++ V G ++F ++ + G+
Sbjct: 219 QFEALPAEVRQNMQQRIDAARRMFAEKVAMFTGLSVDAVT---GTEAAVFEGQSGIDAGL 275
Query: 247 IDEIL 251
DE++
Sbjct: 276 ADELV 280
>ref|NP_287588.1| (NC_002655) putative head-tail preconnector protein of prophage
CP-933O [Escherichia coli O157:H7 EDL933]
gb|AAG56200.1|AE005349_4 (AE005349) putative head-tail preconnector protein of prophage
CP-933O [Escherichia coli O157:H7 EDL933]
Length = 501
Score = 47.0 bits (110), Expect = 3e-004
Identities = 46/185 (24%), Positives = 76/185 (40%), Gaps = 13/185 (7%)
Query: 77 GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXX 134
G T Y + A + A +D V+ + IDSPGG+A GA DMI + KP
Sbjct: 99 GMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCND 158
Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKRPDPE 191
++ SR G+IG ++ S + +D+T + + D
Sbjct: 159 TACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGN 218
Query: 192 TEEGR----RVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLF-AETALKMGM 246
E R +Q +D MFA+ VA G ++ V G ++F ++ + G+
Sbjct: 219 QFEALPAEVRQNMQQRIDAARRMFAEKVAMFTGLSVDAVT---GTEAAVFEGQSGIDAGL 275
Query: 247 IDEIL 251
DE++
Sbjct: 276 ADELV 280
>ref|NP_470963.1| (NC_003212) similar to proteases [Listeria innocua]
emb|CAC96858.1| (AL596169) similar to proteases [Listeria innocua]
Length = 337
Score = 43.1 bits (100), Expect = 0.005
Identities = 57/263 (21%), Positives = 96/263 (35%), Gaps = 33/263 (12%)
Query: 10 LRLMADAPNLTQAQIDQCMASSPLF-QGPGGPTANADRVMVRMG--DT-AQINMQGVFTS 65
L +++ T +QI SSP F + T ++ G DT A +++ G
Sbjct: 15 LLIVSALAKFTSSQIASTEESSPTFVESLFADTGELTETVIEEGGDDTIAVLSVDGTIQD 74
Query: 66 APDFFAMLFGGGNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK 125
D LFG + + +DD ++ + Y++SPGG + ++ D I +
Sbjct: 75 TGDS-GSLFGDAGYDHSFFMQQLEQVRNDDYIQGVLLYVNSPGGGVMESAQIRDKILQIQ 133
Query: 126 KP-----XXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVL--------------S 166
K DKI A S+ + G++G ++ S
Sbjct: 134 KERNIPFYVSMGSMAASGGYYISAPADKIFA-SKETLTGSLGVIMQGYDYSELMKKLGVS 192
Query: 167 MRRPSQSSYIDVTSTDAPNKRPDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADV 226
Y D+ S P E + I + +D +N F + VA GRG + A+
Sbjct: 193 DNTIKSGEYKDIMSGTRP-------MTEDEKKIMQSMIDDSYNEFVKVVAKGRGMS-AEK 244
Query: 227 NQNFGRGGSLFAETALKMGMIDE 249
+ G A + G+IDE
Sbjct: 245 VRKIADGRIYDGRQAKENGLIDE 267
>ref|NP_540710.1| (NC_003317) PUTATIVE PROTEASE IV [Brucella melitensis]
gb|AAL52974.1| (AE009612) PUTATIVE PROTEASE IV [Brucella melitensis]
Length = 301
Score = 42.7 bits (99), Expect = 0.006
Identities = 36/152 (23%), Positives = 59/152 (38%), Gaps = 15/152 (9%)
Query: 83 DVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIR--GTKKPXXXXXXXXXXXXX 140
++ ++ DD+VK + +DSPGG G + D +R TKKP
Sbjct: 39 ELLKRLDKIAGDDAVKGVILLLDSPGGTTVGGEAIYDAVRKIATKKPVVTQVGTLAASAG 98
Query: 141 XXXXXXDKIVATSRVSRAGNIGAVLSMRRPSQSSYIDV------TSTDAPNKRP----DP 190
+ + S G+IG + + P S +D T +P K P
Sbjct: 99 YMIASASDHIVARQTSIVGSIGVL--FQYPDLSKLLDTLGVKVETIKSSPLKAEPNYFSP 156
Query: 191 ETEEGRRVIREQALDPLHNMFAQAVADGRGTT 222
+EE + +IR +D ++ F V + R T
Sbjct: 157 ASEEAKNMIRSMIMDS-YDWFVGIVQERRAFT 187
>ref|NP_110548.1| (NC_002689) Secreted serine protease (ClpP class) [Thermoplasma
volcanium]
dbj|BAB59171.1| (AP000991) proteinase IV [Thermoplasma volcanium]
Length = 230
Score = 41.6 bits (96), Expect = 0.014
Identities = 41/182 (22%), Positives = 73/182 (39%), Gaps = 14/182 (7%)
Query: 80 LYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIR--GTKKPXXXXXXXXXX 137
LY ++ A+ AAE S+ + +S GG+A + + D+++ KKP
Sbjct: 16 LYKYLYPALKAAEGKKSIAGLILVFNSGGGDAASSQLIHDLVKKIRKKKPVYSLALGICA 75
Query: 138 XXXXXXXXXDKIVATSRVSRAGNIGAVLSMRRPSQSSYIDVTSTDA---------PNKRP 188
+ S G+IG V+S+ RP+ ++ D P
Sbjct: 76 SGAYWIASASTKIYAIDTSLIGSIG-VISI-RPNVKKLMEKIGVDVMVYKSGKYKDMTSP 133
Query: 189 DPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLFAETALKMGMID 248
E E + + ++ LD + F ++VA+ RG +++ G A+ A G+ID
Sbjct: 134 FSEPNEEEKSVYQRLLDDIFEKFKRSVAEDRGIPSEKIDE-IANGMVYSAKMAADNGLID 192
Query: 249 EI 250
I
Sbjct: 193 RI 194
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.315 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 176,519,459
Number of Sequences: 884779
Number of extensions: 6309375
Number of successful extensions: 15521
Number of sequences better than 5.0e-02: 17
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 15513
Number of HSP's gapped (non-prelim): 17
length of query: 395
length of database: 277,083,050
effective HSP length: 122
effective length of query: 273
effective length of database: 169,140,012
effective search space: 46175223276
effective search space used: 46175223276
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 92 (40.0 bits)
Query= orf7 [1956-2091]
(134 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_518980.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 37 0.025
>ref|NP_518980.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14561.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 125
Score = 37.4 bits (85), Expect = 0.025
Identities = 18/41 (43%), Positives = 27/41 (64%)
Query: 33 AAETWPAGAVLGRVTATGRYVRYNASASDGSQVPSAILTEA 73
A ++ G VLG VTATG+ + + SA+DGSQ + +L +A
Sbjct: 31 AGQSLVLGTVLGMVTATGKVKQLDPSATDGSQYAAGVLMQA 71
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.319 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 59,690,860
Number of Sequences: 884779
Number of extensions: 1917181
Number of successful extensions: 4161
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4160
Number of HSP's gapped (non-prelim): 1
length of query: 134
length of database: 277,083,050
effective HSP length: 110
effective length of query: 24
effective length of database: 179,757,360
effective search space: 4314176640
effective search space used: 4314176640
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf8 [2096-2461]
(364 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_056680.1| (NC_002214) gp348 [Streptococcus thermophi... 46 5e-004
pir||T13320 hypothetical protein 31 - Streptococcus phage p... 44 0.002
>ref|NP_056680.1| (NC_002214) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
pir||T13625 hypothetical protein gp348 - Streptococcus phage phi-Sfi11
gb|AAC34402.1| (AF158600) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
Length = 348
Score = 46.2 bits (108), Expect = 5e-004
Identities = 42/167 (25%), Positives = 68/167 (40%), Gaps = 16/167 (9%)
Query: 37 GKTVEIDVKRFAENVAVVVTKLSGPNFNDASLISTKEFEPPEYGEAFATDVDDLLQRLLG 96
G++V + F NV V D + + P + EA +D Q L
Sbjct: 51 GQSVALKAAAFDTNVTV----------RDRVSAEMHDEQMPFFKEAMLVKENDRQQLNL- 99
Query: 97 VNPYDDANIAYSSKLVGRLMDYFMEANDMIMRGIELQASQILQTGRLSLIDRAGEVAYEI 156
D N A + +V + + + + +E Q+L TG+++ V +I
Sbjct: 100 --VKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDG--VNKDI 155
Query: 157 DYSPKATHFPTTATAWSDDGADPLADLESLFGVIRADGKVNPDMIIM 203
DY K H + +W++ GA PLADLE R G +NP+ +M
Sbjct: 156 DYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELG-LNPERAVM 201
>pir||T13320 hypothetical protein 31 - Streptococcus phage phi-O1205
gb|AAC79546.1| (U88974) ORF31 [Streptococcus thermophilus temperate bacteriophage
O1205]
Length = 348
Score = 44.3 bits (103), Expect = 0.002
Identities = 25/74 (33%), Positives = 38/74 (50%), Gaps = 3/74 (4%)
Query: 130 IELQASQILQTGRLSLIDRAGEVAYEIDYSPKATHFPTTATAWSDDGADPLADLESLFGV 189
+E Q+L TG+++ V +IDY K H + +W++ GA PLADLE
Sbjct: 131 LEAMRMQVLATGKIAFTSDG--VNKDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIET 188
Query: 190 IRADGKVNPDMIIM 203
R G +NP+ +M
Sbjct: 189 ARELG-LNPERAVM 201
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 205,071,628
Number of Sequences: 884779
Number of extensions: 8701858
Number of successful extensions: 24210
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 24209
Number of HSP's gapped (non-prelim): 3
length of query: 364
length of database: 277,083,050
effective HSP length: 121
effective length of query: 243
effective length of database: 170,024,791
effective search space: 41316024213
effective search space used: 41316024213
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 91 (39.7 bits)
Query= orf9 [2477-2607]
(129 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.309 0.137 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 63,642,766
Number of Sequences: 884779
Number of extensions: 2055492
Number of successful extensions: 2316
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2316
Number of HSP's gapped (non-prelim): 0
length of query: 129
length of database: 277,083,050
effective HSP length: 105
effective length of query: 24
effective length of database: 184,181,255
effective search space: 4420350120
effective search space used: 4420350120
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf10 [2609-2738]
(128 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.314 0.133 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 67,790,511
Number of Sequences: 884779
Number of extensions: 2607104
Number of successful extensions: 4764
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4764
Number of HSP's gapped (non-prelim): 0
length of query: 128
length of database: 277,083,050
effective HSP length: 104
effective length of query: 24
effective length of database: 185,066,034
effective search space: 4441584816
effective search space used: 4441584816
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf11 [2736-2965]
(228 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.132 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 112,919,182
Number of Sequences: 884779
Number of extensions: 4236100
Number of successful extensions: 9074
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 9074
Number of HSP's gapped (non-prelim): 0
length of query: 228
length of database: 277,083,050
effective HSP length: 116
effective length of query: 112
effective length of database: 174,448,686
effective search space: 19538252832
effective search space used: 19538252832
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 89 (38.9 bits)
Query= orf12 [2969-3470]
(500 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_050643.1| (NC_000929) L [Enterobacteria phage Mu] >g... 96 9e-019
ref|NP_404841.1| (NC_003143) putative bacteriophage tail sh... 85 2e-015
ref|NP_284552.1| (NC_003116) hypothetical protein [Neisseri... 77 5e-013
ref|NP_439661.1| (NC_000907) sheath protein gpL (muL) [Haem... 66 7e-010
ref|NP_543096.1| (NC_003356) putative sheath protein [Bacte... 65 2e-009
ref|NP_274135.1| (NC_003112) phage sheath protein [Neisseri... 63 8e-009
ref|NP_313006.1| (NC_002695) putative tail sheath protein [... 61 3e-008
>ref|NP_050643.1| (NC_000929) L [Enterobacteria phage Mu]
sp|P79678|VPL_BPMU Tail sheath protein (gpL)
dbj|BAA19195.1| (AB000833) sheath protein gpL [Enterobacteria phage Mu]
gb|AAF01117.1|AF083977_36 (AF083977) L [Enterobacteria phage Mu]
Length = 495
Score = 95.9 bits (237), Expect = 9e-019
Identities = 128/483 (26%), Positives = 193/483 (39%), Gaps = 29/483 (6%)
Query: 5 NAIDNSVRARVLGIKTEFRNFNTGRTFFLP-QHVALLGQ-GNTAATYALTPFRATSAAQV 62
NAI + VR + I EF N N P Q V + GQ G+ A+ P R S +Q
Sbjct: 7 NAIPSDVRVPLTYI--EFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQA 64
Query: 63 GQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATV 122
FG GS L L A L N R L P G N G AA G + LSGTA ++
Sbjct: 65 SAAFGQGSMLALMADAFLNAN----RVAELWCIPQG-NGTGNAAVGEISLSGTAGENGSL 119
Query: 123 RVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTAAVDATNDNE-----VNVAA 177
I QR + V++ G T A LLVA I+G P +P+TA V A + ++ V ++A
Sbjct: 120 VTYIAGQRLA-VSVAAGATGAALADLLVARIKGQPDLPVTAEVRADSGDDDTHADVVLSA 178
Query: 178 KWKG-LSGNDL---VVSIEGAVAGLTVAITQXXXXXXXXXXXXXXXXFGENVWYTQIVSC 233
K+ G LS D+ + E G+ A G+ + Y IV
Sbjct: 179 KFTGALSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGD-LQYKYIVMP 237
Query: 234 FNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAIGDARRAQRTNLITPV 293
+ + LN T + RW P+ + T + T + ++ G +R + + +
Sbjct: 238 YT--DEPNLNLLRTELQERWGPVNQADGFAVTVLSGTYGD-ISTFGVSRNDHLISCMG-I 293
Query: 294 PGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWSHTQRDLLVKAGI 353
GA + DPA LTLPG P +++ ++R+ L+ GI
Sbjct: 294 AGAPEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGDRFTWSERNALLFDGI 353
Query: 354 STSIVRD-GVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLYNLDLIFNTREWDGA 410
ST V D G +I +T Y G+ +P YL S + Y+L T+++
Sbjct: 354 STFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRI-TQKFPNY 412
Query: 411 PLIPD-DQPTTNARAKRPKXXXXXXXXXXXXXXXXXIISDVPFTLENIRASINDQNPNRL 469
L D + T P ++ D E + + N + +RL
Sbjct: 413 KLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYVARNKDDKDRL 472
Query: 470 DII 472
D++
Sbjct: 473 DVL 475
>ref|NP_404841.1| (NC_003143) putative bacteriophage tail sheath protein [Yersinia
pestis]
emb|CAC90076.1| (AJ414147) putative bacteriophage tail sheath protein [Yersinia
pestis]
Length = 502
Score = 84.7 bits (208), Expect = 2e-015
Identities = 95/371 (25%), Positives = 150/371 (39%), Gaps = 33/371 (8%)
Query: 21 EFRNFNTGRTFFLPQHVALLGQGNTAATY-ALTPFRATSAAQVGQRFGFGSPLHLAALQL 79
EF N + T Q ++GQ +T A P +S A V G GS LH
Sbjct: 20 EFDN-SQANTASTTQRTLIIGQTRAGSTLPANIPLLVSSTATVAGLSGAGSMLHGQMAAY 78
Query: 80 LPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTG 139
L ++ G + + P+ D D+ VAA G + ++ AS T + + IG R + T
Sbjct: 79 LANDTAG----EIYLLPLSDADSMVAAIGKITINTAASETGVISLYIGGIRVQTTVVATD 134
Query: 140 TTAEQAAALLVAGIQGNPFMPMTAAVDATNDNE---VNVAAKWKGLSGN--DLVVSIEGA 194
+ + AAAL A I+ P +P+T +E V + AK KG GN DL ++ G
Sbjct: 135 SVSTIAAAL-AAAIENQPELPVTVVHTGDMVSEGAVVVLEAKNKGAHGNNIDLRLNYLGN 193
Query: 195 VAG------LTVAITQXXXXXXXXXXXXXXXXFGENVWYTQIVSCFNTANTDALNAFETF 248
G L + IT + + I +TA+ DA+ F +
Sbjct: 194 AGGEITPESLVLMITPMAGGAGAPELADGLANLQDRTFDFIINPYTDTASLDAIKDFLSD 253
Query: 249 GEGRWN---PILRRPLAVFTGTNETNANTLAAIGDARRAQRTNLITPVPGAQNLPCEI-- 303
GRW+ + A +GT L A G+ R Q +L+ G N P
Sbjct: 254 STGRWSYRQQLYGHSFAAQSGTY----GQLTAAGELRNDQHASLL----GIHNSPTPAHI 305
Query: 304 -XXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWSHTQRDLLVKAGIST-SIVRDG 361
NDP L + G+ + +++ T+R+ L+ +GIST ++ D
Sbjct: 306 WSAAYVGAIAQSLRNDPGRPLQTLAIRGVLAPPLSSRFTLTERNNLLHSGISTVTVADDS 365
Query: 362 VAEISDTVTTY 372
++ + +TTY
Sbjct: 366 TVQVENIITTY 376
>ref|NP_284552.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
pir||E81810 hypothetical protein NMA1841 [imported] - Neisseria meningitidis
(group A strain Z2491)
emb|CAB71993.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
emb|CAB85065.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
Length = 475
Score = 76.6 bits (187), Expect = 5e-013
Identities = 59/186 (31%), Positives = 84/186 (44%), Gaps = 13/186 (6%)
Query: 19 KTEFRNFNTGRTFFLPQHVALLGQGNTAATYALTPFRAT-SAAQVGQRFGFGSPLHLAAL 77
K RN T + Q V ++ Q N A LT SAA ++G GS HL
Sbjct: 27 KLAVRNLPTNK-----QRVLIVAQHNNPALGELTELENVFSAADAAAKYGAGSMAHLMVT 81
Query: 78 QLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIP 137
+ L++ + DN AGVAA G + LSGTA+ +RV I + + + I
Sbjct: 82 AAIK----AYAYADLSLITVADNKAGVAAGGKITLSGTANTQGVLRVSIANADTLTIGIG 137
Query: 138 TGTTAEQAAALLVAGIQGNPFMPMTAAVDATNDNEVNVAAKWKGLSGNDLVVSIEGAVAG 197
TA AA + A I P +P+TA V + V + AK KG +GN + + G
Sbjct: 138 AEDTAATVAAAVKAAIDAVPDLPVTATV---AEAVVTLTAKNKGTAGNAIRIKTSNTAEG 194
Query: 198 LTVAIT 203
+T A+T
Sbjct: 195 ITAAVT 200
>ref|NP_439661.1| (NC_000907) sheath protein gpL (muL) [Haemophilus influenzae Rd]
sp|P44233|VPL_HAEIN Mu-like prophage FluMu tail sheath protein
pir||I64033 hypothetical protein HI1511 - Haemophilus influenzae (strain Rd
KW20)
gb|AAC23158.1| (U32827) sheath protein gpL (muL) [Haemophilus influenzae Rd]
Length = 487
Score = 66.2 bits (160), Expect = 7e-010
Identities = 81/364 (22%), Positives = 141/364 (38%), Gaps = 30/364 (8%)
Query: 45 TAATYALTPFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGV 104
T + A R ++ AQ FG GS L N+ ++ L + P+ + +G
Sbjct: 46 TGSAKAGEAVRVSAYAQAKTLFGRGSQLAEMVKTFKAHNS----TLDLWVLPLDEAPSGA 101
Query: 105 AAEGVLDLSGTASATATVRVRI-GSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTA 163
A G + ++GTA+ T + I G+ + VT +G TA+ A L I + +P+ A
Sbjct: 102 KATGSVQITGTATQAGTFSLMIAGNNYKTAVT--SGDTADVVAGKLQKLIAADQDVPVVA 159
Query: 164 AVDATNDNEVNVAAKWKGLSGNDLVV-----SIEGAVAGLTVAITQXXXXXXXXXXXXXX 218
V N + + ++KG +GN++ + S E GL IT
Sbjct: 160 TVAG---NTITLTCRFKGETGNEIDLRCNYYSGEAFPEGLKATITDMKNGAVNPDMSVAI 216
Query: 219 XXFGENVWYTQIVSCFNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAI 278
G W+ I++ F +T++LN RW P+ + F T+A
Sbjct: 217 TGLGAE-WWNYIINPFT--DTESLNLLRADLVKRWGPLKQIDGICFMAKRGTHAEVTTFA 273
Query: 279 GDARRAQRTNLITPVPGAQNLPCEIXXXXXXXXXXXXNN---DPASDFARLTLPGLTPGT 335
QR + + + P + DPA L + L P +
Sbjct: 274 -----EQRNDYLFSLLATHKAPQPAYLWASAYAAVVAGSLAIDPARPVQTLVMD-LLPPS 327
Query: 336 DAQQWSHTQRDLLVKAGISTSIVRDG-VAEISDTVTTYHPT--GEVNPGYLFYKDTVKAS 392
+ +W +R+ L+ +G+ST V G ++ +T Y G+ + YL+ + S
Sbjct: 328 MSDRWDLPERNTLLYSGVSTYTVNAGSQPQVEAAITMYRKNAFGDNDESYLYVETIATLS 387
Query: 393 NVLY 396
+ Y
Sbjct: 388 YLRY 391
>ref|NP_543096.1| (NC_003356) putative sheath protein [Bacteriophage P27]
emb|CAC83562.1| (AJ298298) putative sheath protein [Bacteriophage P27]
Length = 498
Score = 65.1 bits (157), Expect = 2e-009
Identities = 85/389 (21%), Positives = 142/389 (35%), Gaps = 29/389 (7%)
Query: 103 GVAAEGVLDLSGTASATATVRVRIGSQRSSL-VTIPTGTTAEQAAALLVAGIQGNPFMPM 161
G AA + ++G A + T+ + +G RSS+ V + G A A + + G +P
Sbjct: 97 GAAATVRVTVTGEAEESGTLSLYVG--RSSVQVPVVNGDDATAVATAIKEAVNGVITLPF 154
Query: 162 TAAVDATNDNEVNVAAKWKGLSGNDLVVSI--------EGAVAGLTVAITQXXXXXXXXX 213
A+ DA V + A+ KGL GN+L V + E AGL V
Sbjct: 155 AASSDA---GVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPD 211
Query: 214 XXXXXXXFGENVWYTQIVSCFNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNAN 273
G+ + + + A+ + + GRW+ + V+T T +
Sbjct: 212 LTAAVAAMGDEAFDFIGLPFNDAASINMMMTEMNDSSGRWSYARQLYGHVYTAKLGTLSE 271
Query: 274 TLAAIGDARRAQRTNLITPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTP 333
+ A GD Q L Q+ E+ NDPA L G+ P
Sbjct: 272 LVNA-GDMHNQQHITLAGYEKETQSPVDELVASRLAREAVFIRNDPARPTQTGELVGMLP 330
Query: 334 GTDAQQWSHTQRDLLVKAGISTSIVRDGVAEISDTVTTY--HPTGEVNPGYLFYKDTVKA 391
+++ T++ L+ G++T+ V G I +VTTY + G + YL + +
Sbjct: 331 APKGKRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTS 390
Query: 392 SNVLYNLDLIFNTREW------DGAPLIPDDQPTTNARAKRPKXXXXXXXXXXXXXXXXX 445
+ VL L + ++ DG P T P
Sbjct: 391 AYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVT------PAVIKGELLATYRQMERAG 444
Query: 446 IISDVPFTLENIRASINDQNPNRLDIIFP 474
I+ + + + + NPNRL+ +FP
Sbjct: 445 IVENYDLFKQYLIVERDADNPNRLNTLFP 473
>ref|NP_274135.1| (NC_003112) phage sheath protein [Neisseria meningitidis MC58]
pir||E81121 phage sheath protein NMB1104 [imported] - Neisseria meningitidis
(group B strain MD58)
gb|AAF41495.1| (AE002460) phage sheath protein [Neisseria meningitidis MC58]
Length = 469
Score = 62.8 bits (151), Expect = 8e-009
Identities = 53/167 (31%), Positives = 74/167 (43%), Gaps = 10/167 (5%)
Query: 34 PQHVALLGQGNTAATY-ALTPFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPL 92
PQ V ++ TA AL P + S A+ FG GS HL Q +N + L
Sbjct: 35 PQKVLMVAPMLTAGIQPALEPVQLFSDAEAADLFGQGSLAHLMVRQAFANNP----YLDL 90
Query: 93 TIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAG 152
T+ + D+ AGV A + LSGTA+A V + IG ++ S + TG TA A L
Sbjct: 91 TVIGIADHSAGVQATATVTLSGTATAPGVVEITIGGKQVS-TAVNTGETAATVADRLKTA 149
Query: 153 IQGNPFMPMTAAVDATNDNEVNVAAKWKGLSGNDLVVSIEGAVAGLT 199
I + A V + AK KG GN+ +++ GLT
Sbjct: 150 ITAADVTVTASGSGAA----VTLTAKHKGEIGNESGLTVSTGNTGLT 192
>ref|NP_313006.1| (NC_002695) putative tail sheath protein [Escherichia coli O157:H7]
dbj|BAB38402.1| (AP002567) putative tail sheath protein [Escherichia coli O157:H7]
Length = 500
Score = 60.8 bits (146), Expect = 3e-008
Identities = 96/435 (22%), Positives = 156/435 (35%), Gaps = 27/435 (6%)
Query: 53 PFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDL 112
P R T A+Q + +G GS + L + + N D L G N G A ++L
Sbjct: 58 PVRITRASQATELWGRGSMIALMVAEFIAINPDA----ELYAIAQG-NGTGQATAAAMNL 112
Query: 113 SGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTAAVDA-TNDN 171
+GT + V +G +R +L P G ++ L A I +P P TA+ A + DN
Sbjct: 113 TGTVTRDGIVYACVGGRRYTLPA-PKGKKGKELTDELAALINADPDAPFTASSGAGSGDN 171
Query: 172 ------EVNVAAKWKG-LSGNDLVVSI---EGAVAGLTVAITQXXXXXXXXXXXXXXXXF 221
+ + A++ G S +D+ ++ E G+ VAI
Sbjct: 172 GAGLKGSLGITARFTGECSVHDVRLNYHDGEATPEGIQVAIAYPKQKAANPDITRSVAGM 231
Query: 222 GENVWYTQIVSCFNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAIGDA 281
G+ + ++ + AN L RW P ++ T A A
Sbjct: 232 GDRQYNYVVMPYKDDAN---LKIISDELLKRWGPAKMSDGVLWLAHTGTFGEVQAF--GA 286
Query: 282 RRAQRTNLITPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWS 341
+R T +P A DPA L LP A + +
Sbjct: 287 KRNDFLATCTSIPKAPEPDYLWAAAVCATCAPSLGADPARPLQTLALPSRMAPAPADRLT 346
Query: 342 HTQRDLLVKAGIST-SIVRDGVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLYNL 398
+R+ L+ GI+T S+ V +I VT Y GE +P YL + S + Y+L
Sbjct: 347 REERNSLLWGGIATVSVAAGDVVQIERQVTMYRQNVYGESDPSYLDVETIYTLSWLRYSL 406
Query: 399 DLIFNTREWDGAPLIPDDQPT-TNARAKRPKXXXXXXXXXXXXXXXXXIISDVPFTLENI 457
F T+ + L D P P+ ++ ++ +N+
Sbjct: 407 -RTFITQRFPRHKLADDGTPVRAGQNIVTPEIMKLQLISLGEEWVELGLVENLDTFKKNL 465
Query: 458 RASINDQNPNRLDII 472
N + NRLD++
Sbjct: 466 LVERNASDRNRLDVL 480
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.317 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 248,521,712
Number of Sequences: 884779
Number of extensions: 9678405
Number of successful extensions: 20487
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 20478
Number of HSP's gapped (non-prelim): 7
length of query: 500
length of database: 277,083,050
effective HSP length: 124
effective length of query: 376
effective length of database: 167,370,454
effective search space: 62931290704
effective search space used: 62931290704
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 93 (40.4 bits)
Query= orf13 [3474-3602]
(127 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.310 0.128 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 65,088,242
Number of Sequences: 884779
Number of extensions: 2494057
Number of successful extensions: 7057
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7057
Number of HSP's gapped (non-prelim): 0
length of query: 127
length of database: 277,083,050
effective HSP length: 103
effective length of query: 24
effective length of database: 185,950,813
effective search space: 4462819512
effective search space used: 4462819512
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf14 [3627-3761]
(133 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,441,945
Number of Sequences: 884779
Number of extensions: 2796944
Number of successful extensions: 7351
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7351
Number of HSP's gapped (non-prelim): 0
length of query: 133
length of database: 277,083,050
effective HSP length: 109
effective length of query: 24
effective length of database: 180,642,139
effective search space: 4335411336
effective search space used: 4335411336
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf15 [3793-3828]
(34 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.326 0.144 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,999,558
Number of Sequences: 884779
Number of extensions: 270277
Number of successful extensions: 549
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 549
Number of HSP's gapped (non-prelim): 0
length of query: 34
length of database: 277,083,050
effective HSP length: 10
effective length of query: 24
effective length of database: 268,235,260
effective search space: 6437646240
effective search space used: 6437646240
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf16 [3866-4448]
(581 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_058453.1| (NC_002321) orf 15 [Staphylococcus aureus ... 172 9e-042
ref|NP_061640.1| (NC_002486) phi PVL ORF 15 and 16 homologu... 172 9e-042
ref|NP_310668.1| (NC_002695) putative tail protein [Escheri... 163 4e-039
ref|NP_288365.1| (NC_002655) putative tail fiber component ... 161 2e-038
pir||T13525 hypothetical protein 36 - Bacillus phage phi-10... 153 4e-036
ref|NP_108725.1| (NC_002703) unknown [Bacteriophage Tuc2009... 153 5e-036
ref|NP_112708.1| (NC_002747) TMP [Lactococcus lactis bacter... 152 9e-036
ref|NP_075512.1| (NC_002661) orf1374 [Staphylococcus aureus... 145 1e-033
ref|NP_490625.1| (NC_003278) orf25; similar to T gene of P2... 132 1e-029
ref|NP_046782.1| (NC_001895) gpT [Enterobacteria phage P2] ... 120 4e-026
ref|NP_052273.1| (NC_001317) G protein [Enterobacteria phag... 112 8e-024
ref|NP_471714.1| (NC_003212) similar to hypothetical protei... 111 2e-023
gb|AAK85310.1| (AY043263) phage-related tail protein [Wolba... 48 2e-004
>ref|NP_058453.1| (NC_002321) orf 15 [Staphylococcus aureus bacteriophage PVL]
pir||T00148 hypothetical protein 15 - Staphylococcus aureus phage phi PVL
dbj|BAA31888.1| (AB009866) orf 15 [Staphylococcus aureus bacteriophage PVL]
Length = 694
Score = 172 bits (436), Expect = 9e-042
Identities = 115/384 (29%), Positives = 191/384 (48%), Gaps = 25/384 (6%)
Query: 37 SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
++ GIEF+ + + G + ++ L A+++G + ++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234
Query: 97 GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
G+D++Q + L ++ LA A+G +L +DI TD L AFGL KDS LA DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHLA-------DVL 287
Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
A+TS+ ANT + + E + AP A A G +IE S IG+M+N GIK E AGTA++ F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347
Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
L++P + RLGI + DS G M ++ L + + Q+ + +FG
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407
Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
E ++G L +INA + + ++ S+ GA+ RMA + + G LRTLRS +E +A+
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467
Query: 337 FELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 395
++ +++ +V + V ++ + +I V G L V L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVVGFGLFVAVLGPLVF 515
Query: 396 AFNTIVSTITGAITLLN-LVIAAN 418
F +S + A+T+L L+I N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_061640.1| (NC_002486) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
prophage phiPV83]
dbj|BAA97857.1| (AB044554) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
prophage phiPV83]
Length = 1313
Score = 172 bits (436), Expect = 9e-042
Identities = 115/384 (29%), Positives = 191/384 (48%), Gaps = 25/384 (6%)
Query: 37 SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
++ GIEF+ + + G + ++ L A+++G + ++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234
Query: 97 GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
G+D++Q + L ++ LA A+G +L +DI TD L AFGL KDS A DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHFA-------DVL 287
Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
A+TS+ ANT + + E + AP A A G +IE S IG+M+N GIK E AGTA++ F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347
Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
L++P + RLGI + DS G M ++ L + + Q+ + +FG
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407
Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
E ++G L +INA + + ++ S+ GA+ RMA + + G LRTLRS +E +A+
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467
Query: 337 FELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 395
++ +++ +V + V ++ + +I V G L A V L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVIGFGLFAAVLGPLVF 515
Query: 396 AFNTIVSTITGAITLLN-LVIAAN 418
F +S + A+T+L L+I N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_310668.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
dbj|BAB36064.1| (AP002559) putative tail protein [Escherichia coli O157:H7]
Length = 696
Score = 163 bits (413), Expect = 4e-039
Identities = 122/399 (30%), Positives = 182/399 (45%), Gaps = 47/399 (11%)
Query: 89 GIEFLAMAGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 148
G FLAMAGF Q AALP ++++A A G++L DI ++ L F L A
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299
Query: 149 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIA 208
+ RV D L T NT + + ET++ P A G S+E +AM G++ANNG++ A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359
Query: 209 GTAVQNFFLRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLA 268
GTA++ RLA+P A L+ LG+ VAD+ G MR DV+ DL A + G+ +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419
Query: 269 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 327
+ + G E G V AG L + L A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEEAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479
Query: 328 TVESVAIRFFELSGGSMRDVVDQATAWLR-----------ANRELIAQNVA--------- 367
E + IR +L G +R V T L R+L+ A
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539
Query: 368 ----------GFLNTIVENIDSIVRGIKLIATVFATLWAFNTIVSTITGAITLLNLVIAA 417
G L T+ D + R + ++ V LW +TG+++LL VI A
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVV-RVLPALWGM------VTGSVSLLGGVIGA 592
Query: 418 NPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGI 456
+ +W+PI++FF ++ GI
Sbjct: 593 --LFSPVGLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>ref|NP_288365.1| (NC_002655) putative tail fiber component of prophage CP-933T
[Escherichia coli O157:H7 EDL933]
gb|AAG56919.1|AE005413_13 (AE005413) putative tail fiber component of prophage CP-933T
[Escherichia coli O157:H7 EDL933]
Length = 696
Score = 161 bits (407), Expect = 2e-038
Identities = 121/399 (30%), Positives = 181/399 (45%), Gaps = 47/399 (11%)
Query: 89 GIEFLAMAGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 148
G FLAMAGF Q AALP ++++A A G++L DI ++ L F L A
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299
Query: 149 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIA 208
+ RV D L T NT + + ET++ P A G S+E +AM G++ANNG++ A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359
Query: 209 GTAVQNFFLRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLA 268
GTA++ RLA+P A L+ LG+ VAD+ G MR DV+ DL A + G+ +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419
Query: 269 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 327
+ + G G V AG L + L A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEXAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479
Query: 328 TVESVAIRFFELSGGSMRDVVDQATAWLR-----------ANRELIAQNVA--------- 367
E + IR +L G +R V T L R+L+ A
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539
Query: 368 ----------GFLNTIVENIDSIVRGIKLIATVFATLWAFNTIVSTITGAITLLNLVIAA 417
G L T+ D + R + ++ V LW +TG+++LL VI A
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVV-RVLPALWGM------VTGSVSLLGGVIGA 592
Query: 418 NPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGI 456
+ +W+PI++FF ++ GI
Sbjct: 593 --LFSPVGLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>pir||T13525 hypothetical protein 36 - Bacillus phage phi-105
dbj|BAA36642.1| (AB016282) ORF36 [bacteriophage phi-105]
Length = 1018
Score = 153 bits (387), Expect = 4e-036
Identities = 105/373 (28%), Positives = 179/373 (47%), Gaps = 16/373 (4%)
Query: 38 RAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAG 97
+ GI+FE +++ G + + K L + AK++G FLAMAG
Sbjct: 173 QVGIDFEKQMSKVQAISGGSAAEIAK----LREQAKELGATTVFTASQAADAQGFLAMAG 228
Query: 98 FDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLA 157
F A+P ++SLA A ++L A DI ++ + AF L K+S +DV+A
Sbjct: 229 FKVNDIYDAMPGMLSLAAAGQLELGMAADITSNIMSAFALKAKES-------GHASDVIA 281
Query: 158 KTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFL 217
+ +ANT ++ M E ++ AP A + G +E A I + G++ IAG A +
Sbjct: 282 YAAANANTNVEQMGEAMKFLAPNANSLGWGMEESPAAIMAFGDAGLQGSIAGQAFGTSLI 341
Query: 218 RLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGAE 277
RLA+P G+A K++++LG D+AGNM+ +VV ++ L M + Q+ A ++ + GAE
Sbjct: 342 RLASPTGKASKLVKKLGFNFFDAAGNMKSMPEVVEEMEKGLKGMTKEQQAAALKTIVGAE 401
Query: 278 GLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFF 337
++ G AL + L ++GAA +MA + D+ GS+ +S +E I+
Sbjct: 402 AYKHWAVLLQKGSKALGDNTKALEKSDGAAKKMADTMLDNAHGSIVAFQSALEGAKIKLT 461
Query: 338 ELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATLWAF 397
E ++ D+ ++ + + L + V T V + G+ + T ATL A
Sbjct: 462 ESLLPALGDLANKGSDLIMMFNNLDSGTVQTIAKTAV-----LATGVLGVTTAVATLTAG 516
Query: 398 NTIVSTITGAITL 410
+ TG + L
Sbjct: 517 IGALLAFTGPVGL 529
>ref|NP_108725.1| (NC_002703) unknown [Bacteriophage Tuc2009]
gb|AAK19873.1|AF109874_46 (AF109874) unknown [Bacteriophage Tuc2009]
Length = 1025
Score = 153 bits (386), Expect = 5e-036
Identities = 114/441 (25%), Positives = 203/441 (45%), Gaps = 23/441 (5%)
Query: 37 SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
++ G +FE ++R G + ++++L A +G G+E LA A
Sbjct: 69 AKIGGDFESQMSRVKAISGATGS----SFEELRQQAIDLGAKTAFSAKESASGMENLASA 124
Query: 97 GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
GF+A++ + A+P ++ LA +G D+ A++ A +L F L S +A +V
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVGLASENAATALRGFNLDASQSGHVA-------NVF 177
Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
AK + + N + M E ++ AP A + G SIE VSA IG+M++ GIK AGT+++
Sbjct: 178 AKAAANTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237
Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
RLA P + + LG+ DS G M+ D +G L A + Q+ + ++G
Sbjct: 238 SRLAKPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297
Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
E L+G + +I+ G D L + +L +++GAAD+MAK + D+M SL + +ES AI
Sbjct: 298 ESLSGMIALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGALESAAIVV 357
Query: 337 FELSGGSMRDVVDQATAWLR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 390
++ ++R V D + + + Q + + IV I ++ + + ++ V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVLVLQRV 417
Query: 391 FATLWAFNTIVSTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFV 450
A + ++ I G+ T ++L + Y +W+ I F
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471
Query: 451 NLWDGIGIAFDTFAMNLANKW 471
+W + A + KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_112708.1| (NC_002747) TMP [Lactococcus lactis bacteriophage TP901-1]
gb|AAG32164.1|AF252967_1 (AF252967) tape measure protein [Lactococcus lactis bacteriophage
TP901-1]
gb|AAK38062.1| (AF304433) TMP [Lactococcus lactis bacteriophage TP901-1]
Length = 937
Score = 152 bits (384), Expect = 9e-036
Identities = 115/441 (26%), Positives = 201/441 (45%), Gaps = 23/441 (5%)
Query: 37 SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
++ G +FE ++R G + ++++L A +G G+E LA A
Sbjct: 69 AKIGGDFESQMSRVKAISGATGS----SFEELRQQAIDLGAKTAFSAKESASGMENLASA 124
Query: 97 GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
GF+A++ + A+P ++ LA +G D+A A++ A +L F L S +A +V
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVALASENAATALRGFNLDASQSGHVA-------NVF 177
Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
AK + N + M E ++ AP A + G SIE VSA IG+M++ GIK AGT+++
Sbjct: 178 AKAAADTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237
Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
RLA P + + LG+ DS G M+ D +G L A + Q+ + ++G
Sbjct: 238 SRLADPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297
Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
E L+G + +I+ G D L + +L +++GAAD+MAK + D+M SL + ES AI
Sbjct: 298 ESLSGMMALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGAFESAAIVV 357
Query: 337 FELSGGSMRDVVDQATAWLR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 390
++ ++R V D + + + Q + + IV I ++ + + + V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVVTLQRV 417
Query: 391 FATLWAFNTIVSTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFV 450
A + ++ I G+ T ++L + Y +W+ I F
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471
Query: 451 NLWDGIGIAFDTFAMNLANKW 471
+W + A + KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_075512.1| (NC_002661) orf1374 [Staphylococcus aureus temperate phage phiSLT]
dbj|BAB21743.1| (AB045978) orf1374 [Staphylococcus aureus temperate phage phiSLT]
Length = 1374
Score = 145 bits (365), Expect = 1e-033
Identities = 89/317 (28%), Positives = 160/317 (50%), Gaps = 18/317 (5%)
Query: 53 KMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFDAQQSVAALPKIVS 112
++G +K + + + A +G G+E LA GF+A+Q++ A+P ++S
Sbjct: 345 RVGAIAQASSKDLKSMSNQAVDLGAKTSKSANEVAKGMEELAALGFNAKQTMEAMPGVIS 404
Query: 113 LATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFE 172
A A+G ++A + ++ +FGL D+ +A D+LA+++ + I M +
Sbjct: 405 AAEASGAEMATTATVMASAINSFGLKASDANHVA-------DLLARSANDSAADIQYMGD 457
Query: 173 TIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRLAAPAGEARKILRR 232
++ A A A G SIE SA I V++N+G++ AGTA++ F+RLA P+ K +++
Sbjct: 458 ALKYAGTPAKALGVSIEDTSAAIEVLSNSGLEGSQAGTALRASFIRLANPSKNTAKEMKK 517
Query: 233 LGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGAEGLAGNLGVINAGKDA 292
LGI ++D+ G +++ + M Q+LA + + G E +G L +I AG D
Sbjct: 518 LGIHLSDAKGQFVGMGELIRQFQDNMKGMTREQKLATVATIVGTEAASGFLALIEAGPDK 577
Query: 293 LVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFELSGGSMRDVVDQAT 352
+ Y +L ++ G + + A + D++ G+L L ES+AI +V T
Sbjct: 578 INSYSKSLKNSNGESKKAADLMKDNLKGALEQLGGAFESLAI-----------EVGKDLT 626
Query: 353 AWLRANRELIAQNVAGF 369
+RA E + + V GF
Sbjct: 627 PMIRAGAEGLTKLVDGF 643
>ref|NP_490625.1| (NC_003278) orf25; similar to T gene of P2 [bacteriophage phi CTX]
dbj|BAA36253.1| (AB008550) orf25; similar to T gene of P2 [bacteriophage phi CTX]
Length = 904
Score = 132 bits (332), Expect = 1e-029
Identities = 107/457 (23%), Positives = 205/457 (44%), Gaps = 54/457 (11%)
Query: 40 GIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFD 99
G EF+ ++ AT + K Q L A+ + G FL G++
Sbjct: 212 GYEFDATMS-ATQAVTRIERKDDPQMQALRQQARTLPLSSKFTDKEVAQGQYFLGRTGYN 270
Query: 100 AQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 159
A+Q + A+P +++LA A +DL DIA++ A G+ + Q+A DVL
Sbjct: 271 AKQILGAMPGMLNLAAAGDMDLGDTADIASNIQTAMGIPAEKMDQVA-------DVLTAA 323
Query: 160 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRL 219
T N I M+ ++++ +A GQS+ETV+ ++ N G++ +AGT++++ RL
Sbjct: 324 FTRNNVDIRMLGDSLKYSAGVGREYGQSLETVTTATALLGNAGVQGSMAGTSMRSVLTRL 383
Query: 220 AAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGAEGL 279
K + +LG++ D+ GNMRD D++ D+N + MG QR A+ + + G +
Sbjct: 384 G-----TSKAVAKLGVQTKDANGNMRDMLDILKDINKKTAGMGNVQRGAIFKDIAGQYAV 438
Query: 280 AGNLGVINAGKDALVE-YRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFE 338
++ A + + R +L ++EG A R+A D++ G + L + +E++++ F+
Sbjct: 439 TSFGTLMRAVEGGQFQTMRESLNNSEGEAARVAATQLDNLKGDMTMLHAALENISVELFD 498
Query: 339 LSGGSMRDVVDQAT-------AWLRANREL---IAQNVAGFLNTIVENIDSIVRGIK-LI 387
+ +R++ + +L+AN ++ I VA F + ++ + S+ + ++
Sbjct: 499 KNSPWLRELAADLSHLLHNIGEFLKANPQVSKGIVITVAAF-SALMATVGSLAITLAGIL 557
Query: 388 ATVFATLWAFNTIVSTITGAITLLNLV----------------------------IAANP 419
+ A + TI + G I LL L+ +AANP
Sbjct: 558 GPMIAVRFMLGTIGIRLPGLIGLLKLLFAPIRMLAGLLIGPLVTALRVVSIALWGLAANP 617
Query: 420 XXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGI 456
Y +W+ +K++ + +W+ I
Sbjct: 618 VVLAIAAVVAVLAGAAYLIYRNWDAVKAYLLGMWEEI 654
>ref|NP_046782.1| (NC_001895) gpT [Enterobacteria phage P2]
gb|AAD03293.1| (AF063097) gpT [Enterobacteria phage P2]
Length = 815
Score = 120 bits (301), Expect = 4e-026
Identities = 108/472 (22%), Positives = 186/472 (38%), Gaps = 46/472 (9%)
Query: 40 GIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFD 99
G EF + +G V K + L A+Q+G +A AG D
Sbjct: 212 GYEFAQKNSELQAVIG--VAKDSAEMAALRKQARQLGDNTAASADDAAGAQIIIAKAGGD 269
Query: 100 AQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 159
AA P +++A A + + AF L +A + DVL+ T
Sbjct: 270 VDAIQAATPVTLNMALANRRTMEENAALLMGMKSAFQLSND-------KVAHIGDVLSMT 322
Query: 160 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRL 219
D M + + AAP A AG SIE +AM+G + + I +AGT + RL
Sbjct: 323 MNKTAADFDGMSDALTYAAPVAKNAGVSIEETAAMVGALHDAKITGSMAGTGSRAVLSRL 382
Query: 220 AAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVAL--SQMGERQRLAVIQKVFGAE 277
AP G+A L+ LG++ +DS GN R F ++ ++ + +++G Q+ ++ +FG E
Sbjct: 383 QAPTGKAWDALKELGVKTSDSKGNTRPIFTILKEMQASFEKNRLGTAQQAEYMKTIFGEE 442
Query: 278 GLAGNLGVINAGKDALVEYRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
+ ++ A ++ A +G + + D++ G + +S E+V
Sbjct: 443 ASSAAAVLMTAASTGKLDKLTAAFKASDGKTAELVNIMQDNLGGDFKAFQSAYEAVGTDL 502
Query: 337 FELSGGSMRDVVDQAT-------AWLRANRELI---------AQNVAGFLNTIVENIDSI 380
F+ G++R + AT W++ N+ L A + G + I +
Sbjct: 503 FDQQEGALRKLTQTATKYVLKLDGWIQKNKSLASTIGIIAGGALALTGIIGAIGLVAWPV 562
Query: 381 VRGIKLIATVFATLWA-FNTIVSTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXY 439
+ GI I + A F T+ S + AI ++ + A
Sbjct: 563 ITGINAIIAAAGAMGAVFTTVGSAVMTAIGAISWPVVA---------VVAAIVAGALLIR 613
Query: 440 MHWEPIKSFFVNLWDGIGIAFDTFAMNLANKWEALIAPIRSSIQWLLEQADA 491
+WEP+ +FF + +G+ AF L P++ WL E+ A
Sbjct: 614 KYWEPVSAFFGGVVEGLKAAFAPVG--------ELFTPLKPVFDWLGEKLQA 657
>ref|NP_052273.1| (NC_001317) G protein [Enterobacteria phage 186]
gb|AAC34170.1| (U32222) G protein [Enterobacteria phage 186]
Length = 812
Score = 112 bits (281), Expect = 8e-024
Identities = 105/449 (23%), Positives = 182/449 (40%), Gaps = 37/449 (8%)
Query: 58 VTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFDAQQSVAALPKIVSLATAA 117
V K + Q L A+ VG +A +G DA A +++A +
Sbjct: 227 VDKQSPEMQALRKQARHVGDNTAASADDAASAQIIIAKSGGDAAAIQARRRVTLNMALSN 286
Query: 118 GIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFETIRKA 177
+ + T AF L +A + DVL+ T D + + + A
Sbjct: 287 RRTMEENAALLTGMKSAFQLSND-------KIAHIGDVLSMTMNKTAADFDGLSDALTYA 339
Query: 178 APTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRLAAPAGEARKILRRLGIEV 237
AP A AG SIE +AM+G + + I +AGT + RL P +A + ++ LG++
Sbjct: 340 APVAKNAGVSIEQTAAMVGALHDAKITGSMAGTGSRAILSRLQLPPEKAFEAIKELGVKT 399
Query: 238 ADSAGNMRDAFDVVGDLNVAL--SQMGERQRLAVIQKVFGAEGLAGNLGVINAGKDALVE 295
+DS GN R F ++ ++ + + +G QR ++ +FG E + ++ A ++
Sbjct: 400 SDSKGNTRPIFSILKEMQRSFEKNNLGTSQRGEYMKTIFGEEASSAAAVLMEAASSGKLD 459
Query: 296 YRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFELSGGSMRDVVDQAT-- 352
A +G + + K + D++ G + +S E+V F+ GS+R + AT
Sbjct: 460 RLTAAFKASDGKTEELVKVMQDNLGGDFKEFQSAYEAVGTDLFDQQEGSLRKLTQTATQY 519
Query: 353 -----AWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATLWAFNTIVS----- 402
W++ N+ L G + + I+ GI L+A + + N I++
Sbjct: 520 VLKLDGWIQKNKGL--ATTIGIIAGGALALIGIIGGIGLVA--WPVVMGINAIIAAAGVL 575
Query: 403 ----TITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGIGI 458
T+TG+ +V A +WEPI +FF + +GI
Sbjct: 576 GTVFTVTGSA----IVTALGAITWPIVAVGAAIVAGALLIRKYWEPISAFFSGVIEGIMS 631
Query: 459 AFDTFA---MNLANKWEALIAPIRSSIQW 484
AF A LA ++ L +R QW
Sbjct: 632 AFAPVAEMFAPLAPIFDGLGEKLRGVWQW 660
>ref|NP_471714.1| (NC_003212) similar to hypothetical protein [Lactobacillus casei
bacteriophage A2] [Listeria innocua]
emb|CAC97610.1| (AL596172) similar to hypothetical protein [Lactobacillus casei
bacteriophage A2] [Listeria innocua]
Length = 1601
Score = 111 bits (278), Expect = 2e-023
Identities = 74/340 (21%), Positives = 154/340 (44%), Gaps = 25/340 (7%)
Query: 37 SRAGIEFEHVI---NRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFL 93
++A ++F++ + N + G++ + Y + D A++ G++ +
Sbjct: 100 TKAYLDFDNEVTEVNSLLRESGESAKEFGDRYTQVFDYAQKASVKYGVASEQTMLGMKEM 159
Query: 94 AMAGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVN 153
G+D Q++A++P I + A A+G D + T +L FG+++KD+ + +V
Sbjct: 160 VKKGYDINQTMASMPAIFNAARASGDDFETVMSVTTSTLEQFGMISKDTNKQMEYTNKVA 219
Query: 154 DVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQ 213
DVL + M + P + + G S+ +A +G+++N GI+ + AGT ++
Sbjct: 220 DVLTYVADKTAAGFSDMGTAMNYVGPISHSLGYSLTDTAAAVGLLSNRGIEGQKAGTGLR 279
Query: 214 NFFLRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKV 273
L P+ A + + +G+ + D+ GNM+ ++ D+N +M + Q+ + + +
Sbjct: 280 GMLTSLLKPSKSAAEAMSAVGLTIEDNNGNMKTLPTLLDDINDKTKKMTKTQKNSFLTMI 339
Query: 274 FGAEGLAGNLGVINAGKDALVEYRNTLLSAEG----AADRMAK----------------- 312
FG E L+ ++ AG D+L +Y A G AD M K
Sbjct: 340 FGREPLSAVNTLLEAGGDSLRKYSKGADEANGYTKQVADNMRKAGKFGVDQFKASLEVLE 399
Query: 313 -RIGDDMLGSLRTLRSTVESVAIRFFELSGGSMRDVVDQA 351
+G ++ +L + + +F +LSG ++++ A
Sbjct: 400 QNVGQKLMPALTPIIEWANKMIDKFNDLSGEQQQNIIKWA 439
>gb|AAK85310.1| (AY043263) phage-related tail protein [Wolbachia endosymbiont of
Drosophila melanogaster]
Length = 680
Score = 48.1 bits (113), Expect = 2e-004
Identities = 83/476 (17%), Positives = 170/476 (35%), Gaps = 62/476 (13%)
Query: 38 RAGIEFEHVINRAT--VKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAM 95
+ I+FE + T V + TK + L ++++ + G
Sbjct: 110 KVAIDFESAMADVTKVVDFKKGTDEATKFAKKLKEMSRTIPLSAAELAQIAASG----GQ 165
Query: 96 AGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDV 155
G + + ++TA + +A D +G+ + V +V
Sbjct: 166 LGIKKEDLFMFTETVAKMSTAFDMSAEQAGDSIAKLSNVYGIDVS-------KMEYVGNV 218
Query: 156 LAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNF 215
+ S + M E + TA G I+ S+++ + G + A TA+
Sbjct: 219 INHLSDNTAAKAKDMVEALAIVGGTAKQFGLDIKETSSLVNAFVSLGKQPAKAATAINAL 278
Query: 216 FLRLAAP---AGEARKILRRLGIEVADSAGNMRD-AFDVVGDLNVALSQMGERQRLAVIQ 271
+L G+ + L ++GI + + + + + AL +M ++R ++
Sbjct: 279 LSKLQTAEEQGGDFKAALEQMGITAEEIVQRISENGEEALLYFFQALKKMDNQERSTILM 338
Query: 272 KVFGAE---GLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRST 328
K+FG E +A G N +DA+ +T R+ D LR LR+
Sbjct: 339 KLFGQEYQDDIALLAGSFNKYEDAIRLLSDTEEYKSSLQKEFQNRV-DTTASKLRLLRNA 397
Query: 329 VESVAIRFFELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIA 388
+ V + + +++ + + R+ L A+ I+ + +++ +K++A
Sbjct: 398 IAEVGMNLGSVMLPTLKSIAEFLQEKTRSIA-LFAEKYPTLTKAIMGTVAALI-SLKVVA 455
Query: 389 ------------TVFAT----LWAFNTIVSTITGAI-TLLNLVIAANPXXXXXXXXXXXX 431
T+F+ L F+ + +T+ A+ T L V ANP
Sbjct: 456 VGLGYGFTLLGSTIFSLKANLLGVFSFLSATVFPAVVTGLRAVTLANPIGLLITGLVTGA 515
Query: 432 XXXXXXXYMHWEPIKSFFVNLWDGIGIAFDTFAMNLANKWEALIAPIRSSIQWLLE 487
+W+ +K FF ++ W+++I PI + W+ E
Sbjct: 516 ALVIT----NWQKVKDFF------------------SSFWKSIIKPIEEAFSWIGE 549
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.319 0.133 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 223,816,781
Number of Sequences: 884779
Number of extensions: 7353514
Number of successful extensions: 20760
Number of sequences better than 5.0e-02: 26
Number of HSP's better than 0.1 without gapping: 17
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 20708
Number of HSP's gapped (non-prelim): 39
length of query: 581
length of database: 277,083,050
effective HSP length: 126
effective length of query: 455
effective length of database: 165,600,896
effective search space: 75348407680
effective search space used: 75348407680
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 94 (40.8 bits)
Query= orf17 [4448-4866]
(417 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_274140.1| (NC_003112) phage virion protein, putative... 40 0.033
>ref|NP_274140.1| (NC_003112) phage virion protein, putative [Neisseria meningitidis
MC58]
pir||B81122 phage virion protein, probable NMB1109 [imported] - Neisseria
meningitidis (group B strain MD58)
gb|AAF41500.1| (AE002460) phage virion protein, putative [Neisseria meningitidis
MC58]
Length = 443
Score = 40.4 bits (93), Expect = 0.033
Identities = 32/129 (24%), Positives = 59/129 (44%), Gaps = 10/129 (7%)
Query: 3 WQTRLREAAYTPPSGNRLTFIYTSVSEEFDQKGGPFDFAGADGTFVQFLGVTGRRYPMTI 62
W T L++A+Y G + + S + + PF G ++ +G+TGR+ +
Sbjct: 4 WHTLLQDASYKGV-GFDIEVVDESNGKALAEHARPF----VQGIDLEDMGMTGRQVQINA 58
Query: 63 IVSGDDYDLDAAAWMQALAEQGEAILEHPAYGRL--TVAPVGTVKRSENFVNGAGQATIE 120
+ G Y + AL + G +L HP +GR+ +A + + ++V+ AG I+
Sbjct: 59 VFWGKGYAGRLKKLLDALEQPGGGVLVHPVWGRMHNMIAASWSYRHEADYVDYAG---ID 115
Query: 121 VTLFETTGA 129
+T E A
Sbjct: 116 ITFREAAEA 124
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 175,870,520
Number of Sequences: 884779
Number of extensions: 6030334
Number of successful extensions: 14372
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 14372
Number of HSP's gapped (non-prelim): 1
length of query: 417
length of database: 277,083,050
effective HSP length: 123
effective length of query: 294
effective length of database: 168,255,233
effective search space: 49467038502
effective search space used: 49467038502
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 92 (40.0 bits)
Query= orf18 [4863-5281]
(417 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_313011.1| (NC_002695) putative tail protein [Escheri... 67 3e-010
ref|NP_439666.1| (NC_000907) 43 kD tail protein [Haemophilu... 66 6e-010
ref|NP_050648.1| (NC_000929) P [Enterobacteria phage Mu] >g... 65 2e-009
ref|NP_284542.1| (NC_003116) hypothetical protein [Neisseri... 62 8e-009
ref|NP_543101.1| (NC_003356) putative tail protein [Bacteri... 52 8e-006
>ref|NP_313011.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
dbj|BAB38407.1| (AP002567) putative tail protein [Escherichia coli O157:H7]
Length = 374
Score = 67.0 bits (162), Expect = 3e-010
Identities = 75/290 (25%), Positives = 114/290 (38%), Gaps = 20/290 (6%)
Query: 124 QPVSVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTP--PASAFPLQWDE 181
Q + + + G+ + +G L + R +A + S G L DC+ P S QW
Sbjct: 52 QSLELRIDGQPVITGWLDQVRQRISATRFQITLSGRDKTGDLVDCSAIHPGS----QWRN 107
Query: 182 ATLQTIAADLCRPFGIQV--LAPDGT-GQTFERIAVEPAEKVMAVIARLAAQRNLVVRSD 238
TL+ IA+DLC PFG+ V D T + F +E +E V + R A R ++V S+
Sbjct: 108 RTLEHIASDLCAPFGVTVRWQVNDATAARPFSTFTLENSETVADALTRAARHRGVLVTSN 167
Query: 239 EQGRLVLLRPDT-RGEPVAEFIEGQQPPISVTPTFGNQDYY-------SHVTGITPTIVG 290
G LV + + RG+ + +V + +Y G T
Sbjct: 168 AAGELVFTQAGSQRGDTLTLGENLLDLDHNVDHRLRHSEYRVRGHGRGGGHAGDALTAGT 227
Query: 291 LEGPQATVRNPHLEGVLRPYVYNADDMAEAD-LVQAVQSKAGRMFAQSATYDVPVPTWRN 349
L P TV + + RP + AD +AD Q + R A+S V W
Sbjct: 228 LAAPVGTVTDSAIHR-YRPKIVLADHAVDADGARQRAVREMRRAVARSVRLTATVRHWFR 286
Query: 350 ANGDLWRVGDFVILEAPGAQVYRRTLMQIKTVRFSATTTERSAVLELIIP 399
NG LW + + AP V R L+ + V FS ++ P
Sbjct: 287 ENGQLWDINLLTAVTAPRTGVEERDLL-VCQVEFSLDANHGETTRLILAP 335
>ref|NP_439666.1| (NC_000907) 43 kD tail protein [Haemophilus influenzae Rd]
Length = 420
Score = 66.2 bits (160), Expect = 6e-010
Identities = 48/188 (25%), Positives = 92/188 (48%), Gaps = 10/188 (5%)
Query: 68 TQDPDEVTLRINGLNFRFWRAVTITQHLDAVS-TVSLHAPFDPNDEQSRDAFRPYSYQPV 126
T+ +++ L +NG W+++ + + L+++S L P D+ S A + P+
Sbjct: 7 TETNNKIELYLNGKILSGWKSLNLQRSLESMSGRFDLGIAVRPEDDISVLA----AGSPL 62
Query: 127 SVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQT 186
+ +GG+ + +G L + R + N++T+ S L DC +++ Q+ T +
Sbjct: 63 VLKMGGQTVITGYLDEIKQRVSGNDKTISVSGRDKTCDLVDCAIIHNSY--QFKNQTAKQ 120
Query: 187 IAADLCRPFGIQVLAPDGTGQTFERI---AVEPAEKVMAVIARLAAQRNLVVRSDEQGRL 243
IA +C+PFGI V+ + ERI VEP E ++++A + ++V SD G L
Sbjct: 121 IAEAICKPFGISVVWQVQAPEANERIPVWQVEPGETAFDNLSKIARHKGVLVTSDVDGNL 180
Query: 244 VLLRPDTR 251
+ P +
Sbjct: 181 LFTEPSNK 188
>ref|NP_050648.1| (NC_000929) P [Enterobacteria phage Mu]
sp|P08558|VPP_BPMU 43 kDa tail protein (gpP)
pir||ZPBPMU gene P protein - phage Mu
emb|CAA29956.1| (X06796) 43 kDa tail protein [Enterobacteria phage Mu]
gb|AAF01122.1|AF083977_41 (AF083977) P [Enterobacteria phage Mu]
Length = 379
Score = 64.7 bits (156), Expect = 2e-009
Identities = 72/310 (23%), Positives = 129/310 (41%), Gaps = 30/310 (9%)
Query: 74 VTLRINGLNFRFWRAVTITQHLDAVSTV-SLHAPFDPNDEQSRDAFRPYSYQPVSVDVGG 132
VTLR +G F W +V++T+ +++V+ L P + S A + ++++GG
Sbjct: 5 VTLRADGRLFTGWTSVSVTRSIESVAGYFELGVNVPPGTDLSGLA----PGKKFTLEIGG 60
Query: 133 ERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQTIAADLC 192
+ + +G + + + + A+ + + L DC S QW TL+ IA DLC
Sbjct: 61 QIVCTGYIDSRRRQMTADSMKITVAGRDKTADLIDCAAVYSGG--QWKNRTLEQIARDLC 118
Query: 193 RPFGIQV---LAPDGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLVLLRPD 249
P+G+ V L+ + F ++ +E V + R + R +++ S+ G LV R
Sbjct: 119 APYGVTVRWELSDKESSAAFPGFTLDHSETVYEALVRASRARGVLMTSNAAGELVFSR-- 176
Query: 250 TRGEPVAEFIEGQQPPISVTPTFGNQDYYSHVTGIT----PTIVGLEGPQATVRN-PHLE 304
E + G+ +T F +D+ + T G EG ++ +
Sbjct: 177 AASTATDELVLGEN---LLTLDF-EEDFRDRFSEYTVKGYARANGAEGDDIDAKSIVSRK 232
Query: 305 GV--------LRPYVYNADDMAEADLVQA-VQSKAGRMFAQSATYDVPVPTWRNANGDLW 355
G RP + AD A QA + R A+S T++ + W +G LW
Sbjct: 233 GTATDSDVTRYRPMIIIADSKITAKDAQARALREQRRRLAKSITFEAEIDGWTRKDGQLW 292
Query: 356 RVGDFVILEA 365
V ++A
Sbjct: 293 MPNLLVTIDA 302
>ref|NP_284542.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
pir||C81809 hypothetical protein NMA1830 [imported] - Neisseria meningitidis
(group A strain Z2491)
emb|CAB72001.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
emb|CAB85055.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
Length = 379
Score = 62.4 bits (150), Expect = 8e-009
Identities = 82/313 (26%), Positives = 122/313 (38%), Gaps = 34/313 (10%)
Query: 71 PDE-VTLRINGLNFRFWRAVTITQHLDAVS---TVSLHAPFDPNDEQSRDAFRPYSYQPV 126
PD VTL ING W I L + +V L P D + R + V
Sbjct: 4 PDNTVTLLINGKTHGQWTNYDIVSDLLTPADDFSVMLGRPVDAKPDAVRAGDK------V 57
Query: 127 SVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQT 186
V VGG+ + SG + Q T +T+ GVL DC+ P + L
Sbjct: 58 EVRVGGDTVLSGRIDRVQTVTEKGGKTLTIQGRDDAGVLLDCSAPL----FNAQDMDLNQ 113
Query: 187 IAADLCRPFGIQVLAPDG--TGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLV 244
I + +P G+ + D T +T ++ +EP + + A L + G LV
Sbjct: 114 IIEKIVKPLGLAKIRIDAAKTDKT-HKVQIEPGSRAWDALLEYAEANGLWPWLEPDGTLV 172
Query: 245 LLRPDTRGEPVAEFI---EGQQPPISVTPTFGNQDY---YSHVTGITPTIVGLEGPQATV 298
+ PD PVAE + GQ I N+D YS VT + + G +AT
Sbjct: 173 VGGPDYTAAPVAELVLRTNGQNNNIKRLEV--NRDMAARYSEVTVLAQSHSGKNNIKATA 230
Query: 299 RNPHLEGVLRPYVYNADDMAEADLVQAVQSKAGRMFAQS----ATYDVPVPTWRNANGDL 354
++ ++ + RP + D+ D Q KA + A S T V R +G L
Sbjct: 231 KDESVK-LHRPLIVTEPDI---DSQAQAQRKAKKRLADSRLEGLTITATVQGHRTDDGTL 286
Query: 355 WRVGDFV-ILEAP 366
W+ G + +L P
Sbjct: 287 WQPGQRINVLSEP 299
>ref|NP_543101.1| (NC_003356) putative tail protein [Bacteriophage P27]
emb|CAC83567.1| (AJ298298) putative tail protein [Bacteriophage P27]
Length = 351
Score = 52.4 bits (124), Expect = 8e-006
Identities = 75/315 (23%), Positives = 117/315 (36%), Gaps = 42/315 (13%)
Query: 74 VTLRINGLNFRFWRAVTITQHLDAVSTVSLHAPFDPNDEQSRDAFRPYSYQP-------V 126
V LR++G + W +V I+ ++ ++ + SRD YQP V
Sbjct: 5 VLLRVSGREWGGWTSVRISAGINRIAR-DFNVAITTRWPGSRD------YQPRIKNGELV 57
Query: 127 SVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQT 186
V +G E + +G + R A+ ++ S L DC SA PLQ L
Sbjct: 58 EVLIGDEPVLTGYVEALPLRYDASSVSMGIVGRSKTADLVDC----SALPLQQSGKNLLR 113
Query: 187 IAADLCRPFGIQVLAPDGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLVLL 246
I ++L PFGI V+ E E V + RL Q + DE GRLVL
Sbjct: 114 IVSELAAPFGITVIDAGVPQTAVIDAQPEHGETVADCLNRLLGQVQTLAYDDECGRLVLG 173
Query: 247 RPDTRGEPVAEFIEGQ-----QPPISVTPTF----------GNQDYYSHVTGITPTIVGL 291
+P T G+ + G+ S+ F GN D + T +
Sbjct: 174 KPGT-GKAATALVLGENILSCDTERSIRERFSEYQVSGQRPGNDDDFGEAT--------I 224
Query: 292 EGPQATVRNPHLEGVLRPYVYNADDMAEADLVQAVQSKAGRMFAQSATYDVPVPTWRNAN 351
+ T+++ + + + A + +A + A + V WR +
Sbjct: 225 AAIRQTIQDSGVTRYRPLLIQQSGTATTATCKARCEFEARQRAALTRETTYTVQGWRQGS 284
Query: 352 GDLWRVGDFVILEAP 366
G LWR G VI+ P
Sbjct: 285 GALWRPGLSVIVFDP 299
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 234,132,124
Number of Sequences: 884779
Number of extensions: 9822749
Number of successful extensions: 20636
Number of sequences better than 5.0e-02: 5
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 20629
Number of HSP's gapped (non-prelim): 7
length of query: 417
length of database: 277,083,050
effective HSP length: 123
effective length of query: 294
effective length of database: 168,255,233
effective search space: 49467038502
effective search space used: 49467038502
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 92 (40.0 bits)
Query= orf19 [5293-5489]
(195 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_267614.1| (NC_002662) UNKNOWN PROTEIN [Lactococcus l... 41 0.006
emb|CAC02002.2| (AL390114) possible l6202.3 [Leishmania major] 40 0.013
gb|AAD39531.2|AF149108_1 (AF149108) outer membrane protein ... 39 0.021
ref|NP_523104.1| (NC_003296) PUTATIVE HEMAGGLUTININ-RELATED... 39 0.021
ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED... 39 0.037
ref|NP_522101.1| (NC_003296) PROBABLE HEMAGGLUTININ-RELATED... 39 0.037
gb|AAF50719.1| (AE003564) CG13292 gene product [Drosophila ... 38 0.048
>ref|NP_267614.1| (NC_002662) UNKNOWN PROTEIN [Lactococcus lactis subsp. lactis]
gb|AAK05556.1|AE006376_2 (AE006376) UNKNOWN PROTEIN [Lactococcus lactis subsp. lactis]
Length = 1441
Score = 41.2 bits (95), Expect = 0.006
Identities = 63/224 (28%), Positives = 81/224 (36%), Gaps = 41/224 (18%)
Query: 4 NPGGGATVDAYHAQPAGDDCHPLPTDTAVL--------------VETPRSNNYAAVG-VV 48
NPGG TV + DD L ++ V+ TP N G VV
Sbjct: 1064 NPGGSVTVPGPDGKTGTDDDTTLNPNSPVVPGDNGSVTLPGGGTASTPNGNITLPGGTVV 1123
Query: 49 DPNNRQTAGPGERRVYSRNADGE-----QVAEVFLHDDGQVR-----VSNDEGFVDLAAD 98
DP+ G+ N DG Q + DDG+++ + D G V L
Sbjct: 1124 DPDGTIHLPGGD----IVNPDGTITLPGQDGKTGTGDDGKIKPNGPIIPGDNGSVTLPGG 1179
Query: 99 GTIQASNGQASATLQGTSITLDDGA----GGSISINGGVITLTG----TAIRLIGPVDAN 150
GT+ G + + G S+ DG GG I G ITL G T G V N
Sbjct: 1180 GTVTTPGG--TINVPGGSVVDPDGTVHLPGGDIVNPDGTITLPGQDGKTGTGDDGKVKPN 1237
Query: 151 GATISEA-GQITDADGLSVHGHNHTQANDSDNDVQQPTSTAQIP 193
G +IS G IT G +V T N V P T +P
Sbjct: 1238 GPSISNPDGSITLPGGGTVTTPGGT-INVPGGSVVDPDGTVHLP 1280
>emb|CAC02002.2| (AL390114) possible l6202.3 [Leishmania major]
Length = 1376
Score = 40.0 bits (92), Expect = 0.013
Identities = 48/194 (24%), Positives = 72/194 (36%), Gaps = 22/194 (11%)
Query: 4 NPGGGATVDAYHAQPAGDDCHPLPTDTAVLVETPRSNNYAAVGVVDPNNRQTAGPGERRV 63
+P G + HA AG+ P P A P N + + + +RQ A G R
Sbjct: 832 HPSGSGVRTSSHALAAGEHSQP-PAQRAAAAAAPPLANTSGASLQELQDRQRASSGSRSS 890
Query: 64 YSRNADGEQVAEVFLHDDGQVRVSN-------------DEGFVDLAADGTIQA--SNGQA 108
S+ E+ A + L VR SN D D T++A + +A
Sbjct: 891 SSKEGSEERPA-LHLSGVSSVRPSNGQEERLHSLVEAGDASTASADGDNTVEAFTARPEA 949
Query: 109 SATLQGTSITLDDGAGGSISINGGVITLTGTAIRLIGPVDANGATISEAGQITDADGLSV 168
A L TS DG GS S+ T T T +L P + S G ++ +
Sbjct: 950 VARLPRTS----DGTNGSASVPLFATTRTETVEQLSAPAH-DSTQASAVGNVSHCTAIRE 1004
Query: 169 HGHNHTQANDSDND 182
+ ++ SD+D
Sbjct: 1005 FTTITSSSSLSDSD 1018
>gb|AAD39531.2|AF149108_1 (AF149108) outer membrane protein A [Rickettsia australis]
Length = 2106
Score = 39.3 bits (90), Expect = 0.021
Identities = 30/82 (36%), Positives = 40/82 (48%), Gaps = 7/82 (8%)
Query: 91 GFVDLAADGTIQASNGQASA--TLQGTSITLDDGAG---GSISINGGVI--TLTGTAIRL 143
G V+ AADG + A+NG A A T + TL GAG G+I +NGG I +
Sbjct: 265 GAVNFAADGALTANNGIAGAVTTANNNTGTLTVGAGDVTGAIGVNGGNILKQVLFNGASN 324
Query: 144 IGPVDANGATISEAGQITDADG 165
+ +DA TI+ A A G
Sbjct: 325 VATIDATNVTINNAAANVTAAG 346
>ref|NP_523104.1| (NC_003296) PUTATIVE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD18696.1| (AL646085) PUTATIVE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 2691
Score = 39.3 bits (90), Expect = 0.021
Identities = 35/114 (30%), Positives = 49/114 (42%), Gaps = 18/114 (15%)
Query: 95 LAADGTIQA-----SNGQASATLQGT-----SITLDDGAG-----GSISINGGVITLTGT 139
LA G + A +NGQA TL GT +T AG GS+ + G + LTGT
Sbjct: 611 LATSGAVLAGGNVTANGQAGTTLSGTVYAAKGVTAQSSAGATSATGSVVAHSGKVALTGT 670
Query: 140 AIRLIGPVDANGATISEAGQ-ITDADGLSVHGHNHTQANDSDNDVQQPTSTAQI 192
++ + G + G T A Q DG + T + D+ P STA +
Sbjct: 671 SVAVSGSTQSGGDTALRATQGDATIDGQAAALGKLTVT--ASQDITGPGSTASV 722
>ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14589.1| (AL646061) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 3501
Score = 38.5 bits (88), Expect = 0.037
Identities = 28/77 (36%), Positives = 37/77 (47%), Gaps = 3/77 (3%)
Query: 91 GFVDLAADGTIQASNGQASATLQGTS--ITLDDGAGGSISINGGVITLTGTAIRLIGPVD 148
G +D+ A G + G+ SA Q T +LD+ AGG + G IT G G V
Sbjct: 742 GALDVQA-GAVSNQGGKLSAQSQATLNVASLDNSAGGYVGAQGVAITDQGALNNAGGTVA 800
Query: 149 ANGATISEAGQITDADG 165
A+GA AG I +A G
Sbjct: 801 ASGALTVSAGAIANAGG 817
>ref|NP_522101.1| (NC_003296) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD17691.1| (AL646079) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 3552
Score = 38.5 bits (88), Expect = 0.037
Identities = 28/77 (36%), Positives = 37/77 (47%), Gaps = 3/77 (3%)
Query: 91 GFVDLAADGTIQASNGQASATLQGTS--ITLDDGAGGSISINGGVITLTGTAIRLIGPVD 148
G +D+ A G + G+ SA Q T +LD+ AGG + G IT G G V
Sbjct: 741 GALDVQA-GAVSNQGGKLSAQSQATLNVASLDNSAGGYVGAQGVAITDQGALNNAGGTVA 799
Query: 149 ANGATISEAGQITDADG 165
A+GA AG I +A G
Sbjct: 800 ASGALTVSAGAIANAGG 816
>gb|AAF50719.1| (AE003564) CG13292 gene product [Drosophila melanogaster]
Length = 174
Score = 38.1 bits (87), Expect = 0.048
Identities = 21/84 (25%), Positives = 39/84 (46%)
Query: 107 QASATLQGTSITLDDGAGGSISINGGVITLTGTAIRLIGPVDANGATISEAGQITDADGL 166
+A+ + ++ D + S I G T GTA + + NG+T S + AD
Sbjct: 5 EAATMITNNNLDYDQASASSTIIAGATSTTAGTAATSVAMSNRNGSTSSSSNSSPTADNS 64
Query: 167 SVHGHNHTQANDSDNDVQQPTSTA 190
+ + ++++ AN + N Q P S +
Sbjct: 65 NSNSNSNSNANINRNLNQNPNSNS 88
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.310 0.130 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 116,963,488
Number of Sequences: 884779
Number of extensions: 4932970
Number of successful extensions: 11861
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 11827
Number of HSP's gapped (non-prelim): 35
length of query: 195
length of database: 277,083,050
effective HSP length: 114
effective length of query: 81
effective length of database: 176,218,244
effective search space: 14273677764
effective search space used: 14273677764
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 87 (38.1 bits)
Query= orf20 [5492-5628]
(135 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_518999.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni... 67 4e-011
gb|AAB05794.1| (U62615) ORF 3 [Synechococcus sp. PCC 7942] 37 0.032
>ref|NP_518999.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD14580.1| (AL646061) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 132
Score = 66.6 bits (161), Expect = 4e-011
Identities = 38/125 (30%), Positives = 59/125 (46%), Gaps = 3/125 (2%)
Query: 3 LAFSQNIRNQRLSVVSTAADAGDGAALIRIYDGTRPPTGGQVTN--LLAQLEMSDPAFNA 60
+ S +RN RL V+ A DAG L+R+Y RP G + LLA++ + P +
Sbjct: 2 ITLSVPVRNSRLVVIGQALDAGTAGGLLRLYSAPRPDVGQALAEQVLLAEVRLPQPCMGS 61
Query: 61 PANGSMTARAITPEGSTPIGGTATWFRIVDSEGNFVLDGDVGLNGSNSEIELSDVNILSN 120
G + I + G W R+ DS+G +V D D GL S +E+EL + + +
Sbjct: 62 LEGGRLVFAPIG-QALCRRSGIVAWARLCDSDGRWVADLDAGLPDSGAEVELPKLQVFAG 120
Query: 121 QEVRI 125
V +
Sbjct: 121 GAVNV 125
>gb|AAB05794.1| (U62615) ORF 3 [Synechococcus sp. PCC 7942]
Length = 136
Score = 37.0 bits (84), Expect = 0.032
Identities = 36/135 (26%), Positives = 61/135 (44%), Gaps = 14/135 (10%)
Query: 1 MTLAFSQNIRNQRLSVVSTAADAGDGAALIRIYDGTRPPTGGQVTNLLAQLEMSDPAFNA 60
MT+A +RN ++ A G ALI +Y G P + Q+T + + ++ F
Sbjct: 1 MTVALGLELRNSLCELIRDRASNG---ALI-LYAGYDPHSD-QLTEEIVRFPLAAQPFRP 55
Query: 61 PANGSMTARAITPEGSTPIGGTATWFRIVDS------EGNFVLDGDVG--LNGSNSEIEL 112
+G+M A I P + GG ++ I S +L G +G N ++++ L
Sbjct: 56 VESGTMAAAPIAPARCSR-GGALRYWEIRSSPLHDTHPDQMLLRGSIGDITNPGDADLVL 114
Query: 113 SDVNILSNQEVRIAT 127
V +L NQ + IA+
Sbjct: 115 DTVAVLQNQAIAIAS 129
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.312 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 74,832,581
Number of Sequences: 884779
Number of extensions: 3151938
Number of successful extensions: 5395
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 5393
Number of HSP's gapped (non-prelim): 2
length of query: 135
length of database: 277,083,050
effective HSP length: 111
effective length of query: 24
effective length of database: 178,872,581
effective search space: 4292941944
effective search space used: 4292941944
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf21 [5650-5851]
(200 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_404848.1| (NC_003143) putative bacteriophage protein... 40 0.013
>ref|NP_404848.1| (NC_003143) putative bacteriophage protein GP46 [Yersinia pestis]
emb|CAC90083.1| (AJ414147) putative bacteriophage protein GP46 [Yersinia pestis]
Length = 151
Score = 40.0 bits (92), Expect = 0.013
Identities = 28/85 (32%), Positives = 39/85 (44%), Gaps = 7/85 (8%)
Query: 87 NGDLYRTDGLETAVYLSLYG---GNPEDNGQDANRLGWWGNANQEEPARQMVSRFQHLVE 143
NG L D L+TA+ LSL+ +D +R GWWG++ A + L+
Sbjct: 22 NGGLLEGDDLQTAILLSLFTDRLARADDAIDSPDRRGWWGDSG----AASAIGSRLWLLR 77
Query: 144 GIPLTSGNLQRLEDAATADLEWLSE 168
LT+ + ED AT L WL E
Sbjct: 78 REKLTTQVAIKAEDYATEALAWLIE 102
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.315 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 111,035,066
Number of Sequences: 884779
Number of extensions: 4889814
Number of successful extensions: 8247
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 8247
Number of HSP's gapped (non-prelim): 1
length of query: 200
length of database: 277,083,050
effective HSP length: 114
effective length of query: 86
effective length of database: 176,218,244
effective search space: 15154768984
effective search space used: 15154768984
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 88 (38.5 bits)
Query= orf22 [5854-6047]
(192 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.321 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 96,602,847
Number of Sequences: 884779
Number of extensions: 3693502
Number of successful extensions: 7308
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7308
Number of HSP's gapped (non-prelim): 0
length of query: 192
length of database: 277,083,050
effective HSP length: 113
effective length of query: 79
effective length of database: 177,103,023
effective search space: 13991138817
effective search space used: 13991138817
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 87 (38.1 bits)
Query= orf23 [6108-6251]
(142 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.316 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 71,856,967
Number of Sequences: 884779
Number of extensions: 2632971
Number of successful extensions: 13430
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 13430
Number of HSP's gapped (non-prelim): 0
length of query: 142
length of database: 277,083,050
effective HSP length: 118
effective length of query: 24
effective length of database: 172,679,128
effective search space: 4144299072
effective search space used: 4144299072
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf24 [6249-6490]
(240 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAB13413.1| (AB046807) KIAA1587 protein [Homo sapiens] 40 0.014
ref|XP_034014.1| (XM_034014) KIAA1587 protein [Homo sapiens] 40 0.014
>dbj|BAB13413.1| (AB046807) KIAA1587 protein [Homo sapiens]
Length = 991
Score = 40.4 bits (93), Expect = 0.014
Identities = 22/65 (33%), Positives = 34/65 (51%), Gaps = 2/65 (3%)
Query: 145 EPLAQCGEPTAQAGNSLQTPPTPQGRLLVNIIQTDTGQVEYAIPTDPAEFP--HIIYVGG 202
E L+ +PTA G+S PPTP G L ++ T T ++ ++P P E P ++ + G
Sbjct: 336 EGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPG 395
Query: 203 EEFGT 207
E T
Sbjct: 396 EGLST 400
>ref|XP_034014.1| (XM_034014) KIAA1587 protein [Homo sapiens]
Length = 957
Score = 40.4 bits (93), Expect = 0.014
Identities = 22/65 (33%), Positives = 34/65 (51%), Gaps = 2/65 (3%)
Query: 145 EPLAQCGEPTAQAGNSLQTPPTPQGRLLVNIIQTDTGQVEYAIPTDPAEFP--HIIYVGG 202
E L+ +PTA G+S PPTP G L ++ T T ++ ++P P E P ++ + G
Sbjct: 302 EGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPG 361
Query: 203 EEFGT 207
E T
Sbjct: 362 EGLST 366
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.320 0.140 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 150,099,548
Number of Sequences: 884779
Number of extensions: 7068811
Number of successful extensions: 14409
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 14395
Number of HSP's gapped (non-prelim): 14
length of query: 240
length of database: 277,083,050
effective HSP length: 117
effective length of query: 123
effective length of database: 173,563,907
effective search space: 21348360561
effective search space used: 21348360561
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 89 (38.9 bits)
Query= orf25 [6494-6939]
(444 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni... 67 3e-010
ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PR... 65 2e-009
ref|NP_518238.1| (NC_003295) CONSERVED HYPOTHETICAL PROTEIN... 50 6e-005
ref|NP_298995.1| (NC_002488) phage-related tail fiber prote... 47 4e-004
ref|NP_488746.1| (NC_003272) unknown protein [Nostoc sp. PC... 43 0.006
ref|NP_105312.1| (NC_002678) microcystin dependent protein ... 41 0.021
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 225
Score = 67.4 bits (163), Expect = 3e-010
Identities = 33/46 (71%), Positives = 35/46 (75%)
Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLPADGAGRPVG 149
WL NG AVSRTTYA LF ++GTTFGAGDGSTTFNLP A P G
Sbjct: 80 WLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLPELRAEFPRG 125
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 554
Score = 64.7 bits (156), Expect = 2e-009
Identities = 29/37 (78%), Positives = 33/37 (88%)
Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLP 140
WL ANG AVSRTTYA+L+A +GTTFGAGDG+ TFNLP
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLP 451
>ref|NP_518238.1| (NC_003295) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD13645.1| (AL646057) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 177
Score = 49.7 bits (117), Expect = 6e-005
Identities = 32/64 (50%), Positives = 37/64 (57%), Gaps = 7/64 (10%)
Query: 101 APQ-WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLPADGAGRPVGQQFVNLGTAT 159
APQ WLL GQ + ++ L+ALLGTT+G GDG TTF LP PVGQ G
Sbjct: 16 APQGWLLCQGQLIDISSNEILYALLGTTYG-GDGRTTFALPNLQGALPVGQ-----GQGP 69
Query: 160 GLPA 163
GL A
Sbjct: 70 GLTA 73
>ref|NP_298995.1| (NC_002488) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
pir||C82649 phage-related tail fiber protein XF1706 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF84515.1|AE003994_14 (AE003994) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
Length = 387
Score = 47.0 bits (110), Expect = 4e-004
Identities = 21/39 (53%), Positives = 28/39 (70%)
Query: 102 PQWLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLP 140
P LL +G+AVSR Y LF + T++GAGDG +TFN+P
Sbjct: 151 PGTLLCDGRAVSRAMYPRLFEEINTSYGAGDGVSTFNIP 189
>ref|NP_488746.1| (NC_003272) unknown protein [Nostoc sp. PCC 7120]
dbj|BAB76405.1| (AP003597) ORF_ID:alr4706~unknown protein [Nostoc sp. PCC 7120]
Length = 101
Score = 43.1 bits (100), Expect = 0.006
Identities = 20/37 (54%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLP 140
W L NGQ +S Y +++++LGTTFG GDG TF LP
Sbjct: 38 WALCNGQLLSIQEYTAVYSILGTTFG-GDGVQTFALP 73
>ref|NP_105312.1| (NC_002678) microcystin dependent protein MdpB [Mesorhizobium loti]
dbj|BAB51098.1| (AP003004) microcystin dependent protein; MdpB [Mesorhizobium loti]
Length = 169
Score = 41.2 bits (95), Expect = 0.021
Identities = 23/48 (47%), Positives = 31/48 (63%), Gaps = 3/48 (6%)
Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLPADGAGR-PVGQ 150
W+ GQ + + Y +LF L+GTT+G GDG +TF LP D GR P+ Q
Sbjct: 21 WMFCEGQLLPISEYETLFQLIGTTYG-GDGQSTFALP-DLRGRVPIHQ 66
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.315 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 269,562,679
Number of Sequences: 884779
Number of extensions: 13034613
Number of successful extensions: 19678
Number of sequences better than 5.0e-02: 6
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 19674
Number of HSP's gapped (non-prelim): 6
length of query: 444
length of database: 277,083,050
effective HSP length: 123
effective length of query: 321
effective length of database: 168,255,233
effective search space: 54009929793
effective search space used: 54009929793
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 92 (40.0 bits)
Query= orf26 [6939-7384]
(444 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni... 67 5e-010
ref|NP_251770.1| (NC_002516) hypothetical protein [Pseudomo... 60 3e-008
ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PR... 60 4e-008
ref|NP_298995.1| (NC_002488) phage-related tail fiber prote... 55 2e-006
ref|NP_347555.1| (NC_003030) Probably secreted sialidase; s... 53 5e-006
emb|CAC13065.1| (AL445503) conserved hypothetical protein [... 50 5e-005
gb|AAK06388.1| (AF078042) glycosyl hydrolase 5 [Caldicellul... 49 1e-004
pir||T35237 probable secreted cellulase - Streptomyces coel... 47 5e-004
ref|NP_105312.1| (NC_002678) microcystin dependent protein ... 47 5e-004
ref|NP_487884.1| (NC_003272) hypothetical protein [Nostoc s... 45 0.002
ref|NP_197703.1| (NM_122218) photosystem II stability/assem... 42 0.010
pir||UMMS period clock protein - mouse (fragment) >gi|13341... 40 0.048
ref|NP_032862.1| (NM_008836) per-hexamer repeat gene 5 [Mus... 40 0.048
ref|NP_189520.1| (NM_113799) histone-H4-like protein [Arabi... 40 0.048
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 225
Score = 66.6 bits (161), Expect = 5e-010
Identities = 42/96 (43%), Positives = 53/96 (54%), Gaps = 9/96 (9%)
Query: 104 YLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQVPT-------PTPGETFV 156
+L+ +G+AVSRTTYA LF VIGTT+GAGDGSTTFNLP+ + P G F
Sbjct: 80 WLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLPELRAEFPRGWDDGRGVDAGRVFG 139
Query: 157 EKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSGTTI 192
Q+ TAI DG+ L +G + T I
Sbjct: 140 SVQAQALSSHQHKTAIGF--DGSNLFGWGDGNATPI 173
>ref|NP_251770.1| (NC_002516) hypothetical protein [Pseudomonas aeruginosa]
pir||D83260 hypothetical protein PA3080 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG06468.1|AE004732_2 (AE004732) hypothetical protein [Pseudomonas aeruginosa]
Length = 365
Score = 60.5 bits (145), Expect = 3e-008
Identities = 60/236 (25%), Positives = 97/236 (40%), Gaps = 17/236 (7%)
Query: 195 SGDGGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTG- 253
S D G TW G + + I A++ + + L ++DGG++W D
Sbjct: 138 SDDAGSTWARQFEDLGREAPLLDIWFADEQHGLAV----GAYGALLETRDGGQHWEDVSE 193
Query: 254 -LGNGLTINRGDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAVDST 312
L N + IAA + V G++FRS D GA W ++ G + F +I
Sbjct: 194 RLDNEDQFHLNAIAAVKDNGLLVVGEAGSLFRSKDWGATWEKLEGPYEGSLFGAIGTADA 253
Query: 313 GAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYIISPD 372
G V V+ ++RSA+ G +W + A L L+ A A +G + ++
Sbjct: 254 GGVLVYG---LRGHLFRSADFGDSWEEIPLKAASGDLEFGLSDGALLA--DGRIVVVG-H 307
Query: 373 NNQGYRSVDNGASWSPVSGLIADFSFAYVTSTPNGSIYAVAGTFSAGGIFASEGSG 428
S D G S+S V S A V++T +G + V GG+ + +G
Sbjct: 308 GGSVLESTDGGRSFS-VFNRPDRLSLAGVSATGDGHLILV----GQGGVHLAAANG 358
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 554
Score = 60.1 bits (144), Expect = 4e-008
Identities = 27/42 (64%), Positives = 35/42 (83%)
Query: 104 YLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQ 145
+L+ +G+AVSRTTYA L+A IGTT+GAGDG+ TFNLP G+
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLPDLRGE 456
>ref|NP_298995.1| (NC_002488) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
pir||C82649 phage-related tail fiber protein XF1706 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF84515.1|AE003994_14 (AE003994) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
Length = 387
Score = 54.7 bits (130), Expect = 2e-006
Identities = 29/59 (49%), Positives = 36/59 (60%)
Query: 82 TAIAAPRVGERLRTYGTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLP 140
TA++ G+ + T G A P L DG AVSR Y LF I T+YGAGDG +TFN+P
Sbjct: 131 TALSRYEPGQIVYTAGKRALPGTLLCDGRAVSRAMYPRLFEEINTSYGAGDGVSTFNIP 189
>ref|NP_347555.1| (NC_003030) Probably secreted sialidase; several ASP-boxes and
dockerin domain [Clostridium acetobutylicum]
gb|AAK78895.1|AE007608_1 (AE007608) Probably secreted sialidase; several ASP-boxes and
dockerin domain [Clostridium acetobutylicum]
Length = 839
Score = 53.1 bits (126), Expect = 5e-006
Identities = 59/204 (28%), Positives = 88/204 (42%), Gaps = 18/204 (8%)
Query: 232 RNNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGD--FYYVNNGGAIFRSTDNG 289
+N + S DGG+NW G G +AA + + G STDNG
Sbjct: 526 KNQQDKDCGISYDGGKNWFSAGSNISGVYKAGTVAAGADAKTIVWSPEEGANAAYSTDNG 585
Query: 290 ANWSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGL 349
W+ SG LP A + D F F+ G + Y S + GA + + +++GL
Sbjct: 586 NKWTPCSG-LPQG--AKVRSDRVNP-KKFYGFLNG-KFYISTDAGATFT----QSSQTGL 636
Query: 350 PDTLNPVAFGA--GRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIADFSFAYVTS-TPN 406
P T F G GD++I + + S D+GA+++ VSG+ A + S T +
Sbjct: 637 P-TKGKGIFKTVIGHEGDIWIAGGKDGL-WHSTDSGATFTKVSGVDASDTVGLGKSKTDD 694
Query: 407 G--SIYAVAGTFSAGGIFASEGSG 428
G +IY A GIF S+ G
Sbjct: 695 GYPAIYMDATIDGTAGIFRSDDEG 718
Score = 44.3 bits (103), Expect = 0.003
Identities = 53/221 (23%), Positives = 78/221 (34%), Gaps = 53/221 (23%)
Query: 195 SGDGGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSD-TG 253
S DGG W S G N Y +AA ++ + + S D G W+ +G
Sbjct: 536 SYDGGKNWFSAG-SNISGVYKAGTVAAGADAKTIVWSPEEGANAAY-STDNGNKWTPCSG 593
Query: 254 LGNGLTINRGDIAAAP-----NGDFYYVNNGGAIF------------------------- 283
L G + + NG FY + GA F
Sbjct: 594 LPQGAKVRSDRVNPKKFYGFLNGKFYISTDAGATFTQSSQTGLPTKGKGIFKTVIGHEGD 653
Query: 284 -----------RSTDNGANWSQVSGQLPDNTFA---SIAVDSTGAVYVFAEFVTGTRVYR 329
STD+GA +++VSG +T S D A+Y+ A ++R
Sbjct: 654 IWIAGGKDGLWHSTDSGATFTKVSGVDASDTVGLGKSKTDDGYPAIYMDATIDGTAGIFR 713
Query: 330 SANNGANWAIAAEEGAESGLPD---TLNPVAFG---AGRNG 364
S + GA W ++ + G PD T +P +G G NG
Sbjct: 714 SDDEGATWVRINDDAHQYGSPDYCITGDPNKYGRVFVGTNG 754
>emb|CAC13065.1| (AL445503) conserved hypothetical protein [Streptomyces coelicolor]
Length = 384
Score = 50.1 bits (118), Expect = 5e-005
Identities = 60/223 (26%), Positives = 94/223 (41%), Gaps = 42/223 (18%)
Query: 202 WLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGLGN----- 256
W+ G G GL + + A D G++ + ++ +FR+ DGG +W+ + G
Sbjct: 165 WVPGGGGEGLHTV---LTDARDPGSVTVAV---STAGVFRTADGGASWAPSNSGVSAVFL 218
Query: 257 -------GLTINR-GDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQVSGQLPDN-TFASI 307
G +++ AA P D Y+ N ++RS D GA+W+ + LP FA
Sbjct: 219 PDPDPEFGQCVHKVARDAATP--DRLYLQNHWGVYRSDDAGAHWTDIGEGLPSTFGFAVA 276
Query: 308 AVDSTG-AVYVF-----AEFVTG---TRVYRSANNGANWAIAAEEGAESGLP--DTLNPV 356
A G YVF A+ V RV+R+A+ G +W E +GLP D V
Sbjct: 277 AHPHRGDTAYVFPINADADRVPADHRCRVFRTADAGKSW-----EPLSAGLPREDHYGTV 331
Query: 357 AFGAGRNGDL----YIISPDNNQGYRSVDNGASWSPVSGLIAD 395
A D N + + S D+G SW ++ + D
Sbjct: 332 LRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQLASHLPD 374
Score = 46.2 bits (108), Expect = 7e-004
Identities = 33/127 (25%), Positives = 56/127 (43%), Gaps = 19/127 (14%)
Query: 192 IRKSGDGGDTWLSTGLGNGL-SSYNIPIMA---ANDSGTILLIDRNNNSRP------LFR 241
+ +S D G W T +G GL S++ + A D+ + I+ + + P +FR
Sbjct: 250 VYRSDDAGAHW--TDIGEGLPSTFGFAVAAHPHRGDTAYVFPINADADRVPADHRCRVFR 307
Query: 242 SQDGGENWSDTGLGNGLTINRGDIAA-------APNGDFYYVNNGGAIFRSTDNGANWSQ 294
+ D G++W G + G + A Y+ N G +F S D+G +W Q
Sbjct: 308 TADAGKSWEPLSAGLPREDHYGTVLRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQ 367
Query: 295 VSGQLPD 301
++ LPD
Sbjct: 368 LASHLPD 374
>gb|AAK06388.1| (AF078042) glycosyl hydrolase 5 [Caldicellulosiruptor sp. Tok7B.1]
Length = 996
Score = 48.9 bits (115), Expect = 1e-004
Identities = 53/197 (26%), Positives = 81/197 (40%), Gaps = 16/197 (8%)
Query: 233 NNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGD-FYYVNNGGAIFRSTDNGAN 291
N N+ + S DGG++W T G +AAA +G + G + STDNG
Sbjct: 529 NPNTNRIGFSYDGGKSWFQGNTEPQGTSEGGTVAAAADGSAVVWAPKGAKVCYSTDNGNK 588
Query: 292 WSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPD 351
W + + + S V+ F F G + Y SA+ G + E +GLP
Sbjct: 589 WVECANVPSEAIVYSDRVNPN----KFYAFKNG-KFYISADKGKTFI----ESPAAGLPI 639
Query: 352 TLNPVAFGAGRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIADFSFAY---VTSTPNGS 408
+ N G GD++++ NN + S D G S+ +SG+ S + +
Sbjct: 640 SGN-FKTVPGIEGDIWLVG--NNGMWHSTDGGYSFVKISGVEDAASIGFGKPAEGETYPA 696
Query: 409 IYAVAGTFSAGGIFASE 425
IY A GIF S+
Sbjct: 697 IYTYAKINGVRGIFRSD 713
>pir||T35237 probable secreted cellulase - Streptomyces coelicolor
emb|CAA20642.1| (AL031515) putative secreted cellulase [Streptomyces coelicolor
A3(2)]
Length = 890
Score = 46.6 bits (109), Expect = 5e-004
Identities = 56/223 (25%), Positives = 84/223 (37%), Gaps = 35/223 (15%)
Query: 198 GGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGL--- 254
G D W TG+ S P GT D + + + RS D G +W L
Sbjct: 94 GWDDWGHTGVVALASDAVDPDRVYAAVGTYTN-DWDPTNGAVLRSADRGASWEKADLPFK 152
Query: 255 --GNGLTINRGD-IAAAPN-GDFYYVN--NGGAIFRSTDNGANWSQVSGQLPDNTFASIA 308
GN G+ +A P+ D Y+ +G ++RSTD G WS+V+ +A
Sbjct: 153 LGGNMPGRGMGERLAVDPHDNDVLYLGAPSGHGLWRSTDAGVTWSEVTAFPNPGNYAQDP 212
Query: 309 VDSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYI 368
D++G Y S N G W + +E G + G
Sbjct: 213 NDTSG--------------YASDNQGITW-VTFDESTGGGAGTATRTLYVG--------- 248
Query: 369 ISPDNNQGYRSVDNGASWSPVSGLIADF-SFAYVTSTPNGSIY 410
++ N YRS D GA+W ++G + + V NG +Y
Sbjct: 249 VADKENAVYRSTDAGATWERLAGQPTGYLAHKGVLDAENGYLY 291
Score = 40.4 bits (93), Expect = 0.036
Identities = 39/167 (23%), Positives = 67/167 (39%), Gaps = 34/167 (20%)
Query: 195 SGDGGDTWL----STGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWS 250
S + G TW+ STG G G ++ + + A+ + +RS D G W
Sbjct: 220 SDNQGITWVTFDESTGGGAGTATRTLYVGVADKENAV------------YRSTDAGATWE 267
Query: 251 D-TGLGNGLTINRGDIAAAPNGDFY---------YVNNGGAIFRSTDNGANWSQVSGQLP 300
G G ++G + A NG Y Y G ++R W+ +S
Sbjct: 268 RLAGQPTGYLAHKG-VLDAENGYLYLAYSDTGGPYDGGKGRLYRYATATGTWTDISPAAE 326
Query: 301 DNT---FASIAVD----STGAVYVFAEFVTGTRVYRSANNGANWAIA 340
+T F+ + VD T ++ + T+++RS ++GA W+ A
Sbjct: 327 ADTYYGFSGLTVDRQRPGTVMATAYSSWWPDTQIFRSTDSGATWSQA 373
>ref|NP_105312.1| (NC_002678) microcystin dependent protein MdpB [Mesorhizobium loti]
dbj|BAB51098.1| (AP003004) microcystin dependent protein; MdpB [Mesorhizobium loti]
Length = 169
Score = 46.6 bits (109), Expect = 5e-004
Identities = 26/64 (40%), Positives = 36/64 (55%), Gaps = 1/64 (1%)
Query: 84 IAAPRVGERLRTYGTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEG 143
+A P +GE G A ++ +G + + Y LF +IGTTYG GDG +TF LP
Sbjct: 1 MAQPYIGEIRMFAGNFAPAGWMFCEGQLLPISEYETLFQLIGTTYG-GDGQSTFALPDLR 59
Query: 144 GQVP 147
G+VP
Sbjct: 60 GRVP 63
>ref|NP_487884.1| (NC_003272) hypothetical protein [Nostoc sp. PCC 7120]
dbj|BAB75543.1| (AP003594) ORF_ID:alr3844~hypothetical protein [Nostoc sp. PCC
7120]
Length = 339
Score = 44.7 bits (104), Expect = 0.002
Identities = 40/161 (24%), Positives = 62/161 (37%), Gaps = 18/161 (11%)
Query: 235 NSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQ 294
+S L + DGG NW L R D + + + V + +TD G +WS+
Sbjct: 62 SSATLLETNDGGNNWQPLNLALDDDRYRFDSVSFAGKEGWIVGEPSLLLHTTDEGRSWSR 121
Query: 295 V--SGQLPDNTFASIAVDSTGAVYVFAEFVTGT-RVYRSANNGANW--AIAAEEGAESGL 349
+ S +LP N IA+ + G AE T +Y++ + G NW + A G L
Sbjct: 122 IPLSEKLPGN---PIAIQALGT--DIAEMATDVGAIYKTTDGGKNWKAQVEAAVGVVRNL 176
Query: 350 PDTLNPVAFGAGRNGDLYII--------SPDNNQGYRSVDN 382
+++ G Y P N R V+N
Sbjct: 177 ERSVDGKYVAVSAKGSFYSTWEAGQNAWVPHNRNSSRRVEN 217
>ref|NP_197703.1| (NM_122218) photosystem II stability/assembly factor HCF136
(sp|O82660) [Arabidopsis thaliana]
sp|O82660|H136_ARATH Photosystem II stability/assembly factor HCF136, chloroplast
precursor
pir||T51828 probable photosystem II stability protein HCF136 [imported] -
Arabidopsis thaliana
emb|CAA75723.1| (Y15628) HCF136 protein [Arabidopsis thaliana]
dbj|BAB09829.1| (AB006708) photosystem II stability/assembly factor HCF136
[Arabidopsis thaliana]
gb|AAK74049.1| (AY045691) AT5g23120/MYJ24_11 [Arabidopsis thaliana]
Length = 403
Score = 42.4 bits (98), Expect = 0.010
Identities = 52/224 (23%), Positives = 83/224 (36%), Gaps = 43/224 (19%)
Query: 239 LFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYY--------------VNNGGAIFR 284
L ++DGG W N I +A DF Y + +
Sbjct: 120 LLETKDGGSTW-----------NPRSIPSAEEEDFNYRFNSISFKGKEGWIIGKPAILLY 168
Query: 285 STDNGANWSQV--SGQLP-DNTFASIAVDSTGAVYVFAEFVTGT-RVYRSANNGANWAIA 340
+ D G NW ++ S QLP D F D + AE VT +Y ++N G NW A
Sbjct: 169 TADAGENWDRIPLSSQLPGDMVFIKATEDKS------AEMVTDEGAIYVTSNRGYNWKAA 222
Query: 341 AEEGAESGLPDTLNPVAFGAG-RNGDLYIISPDNNQGYRSVDNGA----SWSPVSGLIAD 395
+E + L T++ GA G ++ + Y +V + +W P
Sbjct: 223 IQETVSATLNRTVSSGISGASYYTGTFSAVNRSPDGRYVAVSSRGNFFLTWEPGQPYWQP 282
Query: 396 FSFAYVTSTPNGSIYAVAGTF---SAGGIFASEGSGVMPDPTDV 436
+ A N A G + GG++ S+G+G+ + +V
Sbjct: 283 HNRAVARRIQNMGWRADGGLWLLVRGGGLYLSKGTGITEEFEEV 326
>pir||UMMS period clock protein - mouse (fragment)
emb|CAA26710.1| (X02966) unidentified reading frame (first ATG at pos. 210) [Mus
musculus]
prf||1111288A gene cp2.2 [Mus musculus]
Length = 713
Score = 40.0 bits (92), Expect = 0.048
Identities = 74/338 (21%), Positives = 108/338 (31%), Gaps = 33/338 (9%)
Query: 97 GTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQVPTPTPGETFV 156
GT + + G+A T D GT G G GS + G T T ET
Sbjct: 391 GTGSGSGTAKVTGTATGTGTGTDTGTGTGTGTGTGTGSGSGTAKVTGTATTTATVTETGT 450
Query: 157 EKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSGT-TIRKSGDGGDTWLSTGLGNGLSSYN 215
K + D + + G GT G +GT T +G G T TG G G +
Sbjct: 451 AKVTG---TDTGTAKVTGTGTGT-----GTGTGTGTGTGTGTGTGTGTGTGTGTGTGT-G 501
Query: 216 IPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYY 275
+ SGT + + + + + G + TG G G G + + +G
Sbjct: 502 TGTGTGSGSGTAKVTGTDTGTAKVTGTGTGTGTGTGTGTGTGTGTGTGTGSGSGSGS--- 558
Query: 276 VNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGA 335
+G T G SG + V TG TGT + +G+
Sbjct: 559 -GSGSGSGTGTGTGLGSGSGSGTAKVTGTGTAKVTGTG---------TGTGTGTGSGSGS 608
Query: 336 NWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIAD 395
+ G+ SG G+G + S + G +G
Sbjct: 609 GSGSGSGSGSGSG---------SGSGTGTGTGLGSGSGSGSGTGTGTGTGTGTGTGTGTG 659
Query: 396 FSFAYVTSTPNGSIYAV-AGTFSAGGIFASEGSGVMPD 432
S V T G+ A GT + G G+G D
Sbjct: 660 TSTVTVRGTGTGTATATGTGTGTGTGTGTGTGTGTGTD 697
>ref|NP_032862.1| (NM_008836) per-hexamer repeat gene 5 [Mus musculus]
sp|P08399|PHX5_MOUSE PER-HEXAMER REPEAT PROTEIN 5
gb|AAA88320.1| (M12039) ORF starts at 87, first start codon is found at 210.;
putative [Mus musculus]
Length = 672
Score = 40.0 bits (92), Expect = 0.048
Identities = 74/338 (21%), Positives = 108/338 (31%), Gaps = 33/338 (9%)
Query: 97 GTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQVPTPTPGETFV 156
GT + + G+A T D GT G G GS + G T T ET
Sbjct: 350 GTGSGSGTAKVTGTATGTGTGTDTGTGTGTGTGTGTGSGSGTAKVTGTATTTATVTETGT 409
Query: 157 EKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSGT-TIRKSGDGGDTWLSTGLGNGLSSYN 215
K + D + + G GT G +GT T +G G T TG G G +
Sbjct: 410 AKVTG---TDTGTAKVTGTGTGT-----GTGTGTGTGTGTGTGTGTGTGTGTGTGTGT-G 460
Query: 216 IPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYY 275
+ SGT + + + + + G + TG G G G + + +G
Sbjct: 461 TGTGTGSGSGTAKVTGTDTGTAKVTGTGTGTGTGTGTGTGTGTGTGTGTGSGSGSGS--- 517
Query: 276 VNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGA 335
+G T G SG + V TG TGT + +G+
Sbjct: 518 -GSGSGSGTGTGTGLGSGSGSGTAKVTGTGTAKVTGTG---------TGTGTGTGSGSGS 567
Query: 336 NWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIAD 395
+ G+ SG G+G + S + G +G
Sbjct: 568 GSGSGSGSGSGSG---------SGSGTGTGTGLGSGSGSGSGTGTGTGTGTGTGTGTGTG 618
Query: 396 FSFAYVTSTPNGSIYAV-AGTFSAGGIFASEGSGVMPD 432
S V T G+ A GT + G G+G D
Sbjct: 619 TSTVTVRGTGTGTATATGTGTGTGTGTGTGTGTGTGTD 656
>ref|NP_189520.1| (NM_113799) histone-H4-like protein [Arabidopsis thaliana]
dbj|BAB03175.1| (AP002057) gene_id:T19N8.7~unknown protein [Arabidopsis thaliana]
Length = 614
Score = 40.0 bits (92), Expect = 0.048
Identities = 75/298 (25%), Positives = 109/298 (36%), Gaps = 53/298 (17%)
Query: 130 AGDGSTTFNLPQEGGQVPTPTPGETFVEKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSG 189
+G S P +GG T G++ S YP D S + AG P G+ G
Sbjct: 147 SGSPSADTGSPTDGGSYGDTT-GDSGSSAGSPSYPSDDGSGSTAGGPSGSTT------DG 199
Query: 190 TTIRKSGDGGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENW 249
+ +S GGD S+ G AA +SG+ D + + + GG
Sbjct: 200 SAGGESSMGGD---SSSAG-----------AAGESGSAATADSGDAA----GADSGGAAG 241
Query: 250 SDTGLGNGLTINRGDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAV 309
+D+ G + + G AA +GGA T G+ ++ G+ AS A
Sbjct: 242 ADS--GGAASADSGGAAAGETA------SGGAAAADTSGGS--AETGGESASGGAASGAG 291
Query: 310 DSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYII 369
++G A TG +A+ G +AE G ES + GA G
Sbjct: 292 AASG-----ASAKTGGESGEAASGG-----SAETGGES--------ASAGAASGGSAETG 333
Query: 370 SPDNNQGYRSVDNGASWSPVSGLIADFSFAYVTSTPNGSIYAVAGTFSAGGIFASEGS 427
+ G S AS SG + + T + S A +G SA G AS GS
Sbjct: 334 GESGSGGAASGGESASGGATSGGSPETGGSAETGGESASGGAASGGESASGGAASSGS 391
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.314 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 290,306,667
Number of Sequences: 884779
Number of extensions: 14619616
Number of successful extensions: 24257
Number of sequences better than 5.0e-02: 22
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 19
Number of HSP's that attempted gapping in prelim test: 24149
Number of HSP's gapped (non-prelim): 70
length of query: 444
length of database: 277,083,050
effective HSP length: 123
effective length of query: 321
effective length of database: 168,255,233
effective search space: 54009929793
effective search space used: 54009929793
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 92 (40.0 bits)
Query= orf27 [7391-7566]
(174 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_274047.1| (NC_003112) hypothetical protein [Neisseri... 125 2e-028
ref|NP_108221.1| (NC_002678) hypothetical protein [Mesorhiz... 84 4e-016
ref|NP_539912.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [B... 82 3e-015
ref|NP_108621.1| (NC_002678) hypothetical protein [Mesorhiz... 79 1e-014
ref|NP_540027.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [B... 72 2e-012
ref|NP_103651.1| (NC_002678) hypothetical protein [Mesorhiz... 64 5e-010
ref|NP_437108.1| (NC_003078) putative protein [Sinorhizobiu... 59 2e-008
>ref|NP_274047.1| (NC_003112) hypothetical protein [Neisseria meningitidis MC58]
ref|NP_283994.1| (NC_003116) hypothetical protein NMA1230 [Neisseria meningitidis
Z2491]
pir||E81131 hypothetical protein NMB1012 [imported] - Neisseria meningitidis
(group B strain MD58, group A strain Z2491)
gb|AAF41413.1| (AE002452) hypothetical protein [Neisseria meningitidis MC58]
emb|CAB84490.1| (AL162755) hypothetical protein NMA1230 [Neisseria meningitidis
Z2491]
Length = 167
Score = 125 bits (313), Expect = 2e-028
Identities = 68/171 (39%), Positives = 101/171 (58%), Gaps = 8/171 (4%)
Query: 2 DTFDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVD 61
D F++ +RV+ HEGG+ N KD GG+ G K T A + ++ +T +
Sbjct: 3 DKFNQFINRVLSHEGGYANHPKD----PGGETNWGITKRT----AQANGYNGSMRAMTRE 54
Query: 62 EAKAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDGIIGPKTI 121
+A +IYR+ +W+ A+ +Q FDA +NHG NA +MLQRA+G DDG+IG ++
Sbjct: 55 QAISIYRKAFWERYRADQMPEAVAFQFFDACVNHGYGNAARMLQRAAGVPDDGVIGAVSL 114
Query: 122 GAARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWSRRIAGNLRLAAEDN 172
A + +DLL+ F AERL F+T + TF +G+GW RR+A NL A+ DN
Sbjct: 115 KAINSLPENDLLLRFNAERLVFYTKLGTFTSFGKGWVRRVAQNLIHASADN 165
>ref|NP_108221.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
dbj|BAB53682.1| (AP003013) hypothetical protein [Mesorhizobium loti]
Length = 283
Score = 84.3 bits (207), Expect = 4e-016
Identities = 52/158 (32%), Positives = 78/158 (48%), Gaps = 8/158 (5%)
Query: 10 RVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEAKAIYRR 69
RV+ HEGG+ N D G T V G + G T +K +T DE AIY R
Sbjct: 12 RVLAHEGGYSNHPNDPGGPTMKGVTQRVYDGYRKGKGLATR---SVKGITTDELNAIYDR 68
Query: 70 DWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQR----ASGAHDDGIIGPKTIGAAR 125
+WDA+ + Y +FD A+N G + LQ+ A DG++G T+ A +
Sbjct: 69 QYWDAVRGDDLPAGVDYVVFDGAVNSGPSRSIMWLQQALRPAYTGRIDGVLGMGTLAALK 128
Query: 126 ATDLDDLLMLFLAE-RLEFFTNIKTFDHYGRGWSRRIA 162
A +D L+ + R+ F ++ TF ++GRGW+ R+A
Sbjct: 129 ADKNNDALIDRICNARMAFLKHLSTFGNFGRGWTARVA 166
>ref|NP_539912.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
gb|AAL52176.1| (AE009540) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
Length = 252
Score = 81.6 bits (200), Expect = 3e-015
Identities = 47/156 (30%), Positives = 71/156 (45%), Gaps = 4/156 (2%)
Query: 3 TFDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDE 62
TF KA V EGG+ + KD GG +G T ++K LT +
Sbjct: 5 TFAKAMPHVFSEEGGYVDHPKD----PGGATNMGITLATLSAWEGRKVSKAEVKALTKTK 60
Query: 63 AKAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDGIIGPKTIG 122
A IYR ++W+ + + + D AI+ G A KMLQ+ G DG+IG KT+
Sbjct: 61 ATDIYRENYWNKVAGDDLPAGVDHATLDFAIHSGPARALKMLQKVVGVDQDGVIGAKTLA 120
Query: 123 AARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWS 158
A R + ++ RL + + TF +G+GW+
Sbjct: 121 AVRKMAANRIINELCDARLAWLKGLGTFSTFGKGWT 156
>ref|NP_108621.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
dbj|BAB54407.1| (AP003014) hypothetical protein [Mesorhizobium loti]
Length = 283
Score = 79.3 bits (194), Expect = 1e-014
Identities = 50/158 (31%), Positives = 76/158 (47%), Gaps = 8/158 (5%)
Query: 10 RVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEAKAIYRR 69
RV+ HEGG+ N D G T V G + G T +K++T E IY R
Sbjct: 12 RVLAHEGGYSNHPNDPGGPTMKGVTQRVYDGFRKGKGLATR---SVKSITTAELNEIYDR 68
Query: 70 DWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQR----ASGAHDDGIIGPKTIGAAR 125
+WDA+ + Y +FD A+N G + LQ+ A DG++G T+ A +
Sbjct: 69 QYWDAVQGDALPAGVDYVVFDGAVNSGPGRSIMWLQQALRPAYTGRIDGVLGMGTLAALK 128
Query: 126 ATDLDDLLM-LFLAERLEFFTNIKTFDHYGRGWSRRIA 162
A +D L+ + R+ F ++ TF +GRGW+ R+A
Sbjct: 129 ADKNNDALIDRICSARMAFLKHLSTFGTFGRGWTARVA 166
>ref|NP_540027.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
gb|AAL52291.1| (AE009550) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
Length = 253
Score = 72.4 bits (176), Expect = 2e-012
Identities = 42/158 (26%), Positives = 67/158 (41%), Gaps = 4/158 (2%)
Query: 4 FDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEA 63
F + EGG+ ++ D GG +G T D+K LT A
Sbjct: 5 FQTVMPYIFSEEGGYADNPAD----PGGATNMGITINTLSAWEGRQVSPQDVKELTQATA 60
Query: 64 KAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDGIIGPKTIGA 123
IY+ ++W+ + + Y +FD A+N G A K LQ+ +DGIIG +T+ A
Sbjct: 61 TQIYQVEFWNKIDGNDLPSGVDYALFDFAVNSGPGRAAKTLQKILAMPEDGIIGAQTVAA 120
Query: 124 ARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWSRRI 161
A A + ++ R + + T +G GW R+
Sbjct: 121 AAARSPEGIINALCDARAAWLRGLSTAATFGNGWLARV 158
>ref|NP_103651.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
dbj|BAB49437.1| (AP002999) hypothetical protein [Mesorhizobium loti]
Length = 409
Score = 64.3 bits (155), Expect = 5e-010
Identities = 46/172 (26%), Positives = 78/172 (44%), Gaps = 10/172 (5%)
Query: 4 FDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEA 63
F++A V EGG+ +D KD GG G + T D+ N++ EA
Sbjct: 49 FERAQGIVREFEGGYSDDPKD----PGGATNFGIARKTLAAWRGKPVSKADVANMSYQEA 104
Query: 64 KAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHD-----DGIIGP 118
K I+ ++W G + +++ A++ G+R+A LQ+A + DG IG
Sbjct: 105 KEIFFSEYWSKSSCGAMPGPLALAVYNVAVHAGVRDAATFLQKALNQNGAAVAVDGGIGG 164
Query: 119 KTIGAARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWSRRIAGNLRLAAE 170
+T+GA L D++ + + ++HY G++RR+ LRL E
Sbjct: 165 ETLGAIPKAPLPDVIGDIIDLYDAKLRAHRDYEHYKNGFNRRV-NKLRLETE 215
>ref|NP_437108.1| (NC_003078) putative protein [Sinorhizobium meliloti]
emb|CAC48968.1| (AL603644) putative protein [Sinorhizobium meliloti]
Length = 91
Score = 58.9 bits (141), Expect = 2e-008
Identities = 25/71 (35%), Positives = 40/71 (56%)
Query: 55 IKNLTVDEAKAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDG 114
++ ++ +EA IYRR +W G+ P + Y +FD +N G A K LQ+ G +DG
Sbjct: 6 VRAMSREEAADIYRRSYWPQCGVDLLPPGLDYAVFDFGVNSGPARAVKTLQKVVGVREDG 65
Query: 115 IIGPKTIGAAR 125
+G +T+ A R
Sbjct: 66 HVGEQTLAAVR 76
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.321 0.139 5.49e-269
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 99,344,891
Number of Sequences: 884779
Number of extensions: 4043632
Number of successful extensions: 7365
Number of sequences better than 5.0e-02: 13
Number of HSP's better than 0.1 without gapping: 8
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 7345
Number of HSP's gapped (non-prelim): 13
length of query: 174
length of database: 277,083,050
effective HSP length: 112
effective length of query: 62
effective length of database: 177,987,802
effective search space: 11035243724
effective search space used: 11035243724
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 86 (37.7 bits)
Query= orf28 [7565-7764]
(198 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.320 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 87,719,920
Number of Sequences: 884779
Number of extensions: 2728446
Number of successful extensions: 6008
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6008
Number of HSP's gapped (non-prelim): 0
length of query: 198
length of database: 277,083,050
effective HSP length: 114
effective length of query: 84
effective length of database: 176,218,244
effective search space: 14802332496
effective search space used: 14802332496
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 87 (38.1 bits)
Query= orf29 [7799-7878]
(78 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.313 0.128 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 39,118,827
Number of Sequences: 884779
Number of extensions: 1273635
Number of successful extensions: 2025
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2025
Number of HSP's gapped (non-prelim): 0
length of query: 78
length of database: 277,083,050
effective HSP length: 54
effective length of query: 24
effective length of database: 229,304,984
effective search space: 5503319616
effective search space used: 5503319616
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf30 [7879-8577]
(697 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_496184.1| (NM_063783) polycistic kidney disease prot... 42 0.022
sp|Q09625|YS8A_CAEEL Hypothetical 84.3 kDa protein ZK945.10... 42 0.022
ref|NP_506158.1| (NM_073757) F56H9.1.p [Caenorhabditis eleg... 42 0.028
pir||T29634 hypothetical protein C12D12.1 - Caenorhabditis ... 41 0.048
ref|NP_508603.1| (NM_076202) C12D12.1a.p [Caenorhabditis el... 41 0.048
>ref|NP_496184.1| (NM_063783) polycistic kidney disease protein 1 like with repeated
domains at the N-terminus [Caenorhabditis elegans]
emb|CAB70201.1| (Z48582) N-terminus contains five repeated domains and the
C-terminus shows similarity to the polycistic kidney
disease protein 1 (PIR accession number A44604),
contains similarity to Pfam domain: PF01477 (PLAT/LH2
domain), Score=130.4, E-value=1.1e-35>
emb|CAB70192.1| (Z48544) N-terminus contains five repeated domains and the
C-terminus shows similarity to the polycistic kidney
disease protein 1 (PIR accession number A44604),
contains similarity to Pfam domain: PF01477 (PLAT/LH2
domain), Score=130.4, E-value=1.1e-35>
Length = 3178
Score = 42.0 bits (97), Expect = 0.022
Identities = 55/238 (23%), Positives = 86/238 (36%), Gaps = 27/238 (11%)
Query: 470 ENTWRIFASHDFNETRSGNLGQASLHSLSTSLPDSVNVGTALPTSVTVEYNVT-----NH 524
E+T F S T S + Q S ++ TS P S + T++PT+ T E T ++
Sbjct: 321 ESTSTTFTSTASTSTSSTSTTQQSSSTI-TSSPSSTTLSTSIPTTTTPEITSTLSSLPDN 379
Query: 525 ADIRTLQLQIDGQSFTPPLPTQDGPQS---------YTATVTAISTTAPRNIITSLTGFR 575
A L +FT + T + T+T + ++TT P +T+ T
Sbjct: 380 AICSYLDETTTSTTFTTTMLTSTTTEEPSTSTTTTEVTSTSSTVTTTEPTTTLTTSTAST 439
Query: 576 RNGSALNTVTDSTAVVVPPPSVFWGRSASNNPATVDTATLQSAPRANRMTVGTGQAA--- 632
+ ++ P S S+S+ T T+T +S + TV T A
Sbjct: 440 STTEPSTSTVTTSPSTSPVTSTVTSSSSSSTTVTTPTST-ESTSTSPSSTVTTSTTAPST 498
Query: 633 --AGDYFIVLAPDSQAISKITDTV------LDQDVLSLFTKTEDVRQIAGLEYDAYVI 682
G P S A S ++ T S TK+E G D Y +
Sbjct: 499 STTGPSSSSSTPSSTASSSVSSTASSTQSSTSTQQSSTTTKSETTTSSDGTNPDFYFV 556
>sp|Q09625|YS8A_CAEEL Hypothetical 84.3 kDa protein ZK945.10 in chromosome II
pir||T21460 hypothetical protein ZK945.10 - Caenorhabditis elegans
Length = 796
Score = 42.0 bits (97), Expect = 0.022
Identities = 55/238 (23%), Positives = 86/238 (36%), Gaps = 27/238 (11%)
Query: 470 ENTWRIFASHDFNETRSGNLGQASLHSLSTSLPDSVNVGTALPTSVTVEYNVT-----NH 524
E+T F S T S + Q S ++ TS P S + T++PT+ T E T ++
Sbjct: 321 ESTSTTFTSTASTSTSSTSTTQQSSSTI-TSSPSSTTLSTSIPTTTTPEITSTLSSLPDN 379
Query: 525 ADIRTLQLQIDGQSFTPPLPTQDGPQS---------YTATVTAISTTAPRNIITSLTGFR 575
A L +FT + T + T+T + ++TT P +T+ T
Sbjct: 380 AICSYLDETTTSTTFTTTMLTSTTTEEPSTSTTTTEVTSTSSTVTTTEPTTTLTTSTAST 439
Query: 576 RNGSALNTVTDSTAVVVPPPSVFWGRSASNNPATVDTATLQSAPRANRMTVGTGQAA--- 632
+ ++ P S S+S+ T T+T +S + TV T A
Sbjct: 440 STTEPSTSTVTTSPSTSPVTSTVTSSSSSSTTVTTPTST-ESTSTSPSSTVTTSTTAPST 498
Query: 633 --AGDYFIVLAPDSQAISKITDTV------LDQDVLSLFTKTEDVRQIAGLEYDAYVI 682
G P S A S ++ T S TK+E G D Y +
Sbjct: 499 STTGPSSSSSTPSSTASSSVSSTASSTQSSTSTQQSSTTTKSETTTSSDGTNPDFYFV 556
>ref|NP_506158.1| (NM_073757) F56H9.1.p [Caenorhabditis elegans]
pir||T22808 hypothetical protein F56H9.1 - Caenorhabditis elegans
emb|CAA98949.1| (Z74473) predicted using Genefinder [Caenorhabditis elegans]
Length = 770
Score = 41.6 bits (96), Expect = 0.028
Identities = 30/119 (25%), Positives = 47/119 (39%), Gaps = 2/119 (1%)
Query: 498 STSLPDSVNVGTALPTSVTVEYNVTNHADIRTLQLQIDGQSFTPPLPTQDGPQSYTATV- 556
+T++P S T+ T+ TV T+ A T+ + T +P + T TV
Sbjct: 35 TTTVPVSTTTSTSSTTTTTVPSTTTSSATTTTVPSTTTSSTTTTTVPPTTTSTTTTTTVP 94
Query: 557 -TAISTTAPRNIITSLTGFRRNGSALNTVTDSTAVVVPPPSVFWGRSASNNPATVDTAT 614
T STT + + T + T T ST PP+ + + P T T+T
Sbjct: 95 PTTTSTTTTTTVPPTTTSTTTTTTVPPTTTSSTTTTTVPPTTTSSTTTTTVPPTTTTST 153
>pir||T29634 hypothetical protein C12D12.1 - Caenorhabditis elegans
Length = 825
Score = 40.8 bits (94), Expect = 0.048
Identities = 44/163 (26%), Positives = 64/163 (38%), Gaps = 14/163 (8%)
Query: 498 STSLPDSVNVGTALPTSVTVEYNVTNHADIRTLQLQIDGQSFTPPLPTQDGPQSYTATVT 557
+T +P S V T PT+VTV T+ A + T + P PT+ P S +TV
Sbjct: 640 TTVVPSSATVPTTPPTTVTVAATTTSKAPVVT-----TSPTLAPTSPTK-LPTSPPSTV- 692
Query: 558 AISTTAPRNIITSLTGFRRNGSA-------LNTVTDSTAVVVPPPSVFWGRSASNNPATV 610
S TAP N+ T T S+ +N + +TA VPP + + ++ T
Sbjct: 693 GTSPTAPANLTTPTTAPVNPTSSTTAPTAPVNPTSPTTAPTVPPVTTTTPTTTTSTTTTT 752
Query: 611 DTATLQSAPRANRMTVGTGQAAAGDYFIVLAPDSQAISKITDT 653
T T + V T V P SQ ++ T
Sbjct: 753 TTTTTTTQTTPTTPVVTTPSTITPTTRPVTQPASQPATQPATT 795
>ref|NP_508603.1| (NM_076202) C12D12.1a.p [Caenorhabditis elegans]
gb|AAA96080.2| (U51998) Hypothetical protein C12D12.1a [Caenorhabditis elegans]
Length = 769
Score = 40.8 bits (94), Expect = 0.048
Identities = 44/163 (26%), Positives = 64/163 (38%), Gaps = 14/163 (8%)
Query: 498 STSLPDSVNVGTALPTSVTVEYNVTNHADIRTLQLQIDGQSFTPPLPTQDGPQSYTATVT 557
+T +P S V T PT+VTV T+ A + T + P PT+ P S +TV
Sbjct: 584 TTVVPSSATVPTTPPTTVTVAATTTSKAPVVT-----TSPTLAPTSPTK-LPTSPPSTV- 636
Query: 558 AISTTAPRNIITSLTGFRRNGSA-------LNTVTDSTAVVVPPPSVFWGRSASNNPATV 610
S TAP N+ T T S+ +N + +TA VPP + + ++ T
Sbjct: 637 GTSPTAPANLTTPTTAPVNPTSSTTAPTAPVNPTSPTTAPTVPPVTTTTPTTTTSTTTTT 696
Query: 611 DTATLQSAPRANRMTVGTGQAAAGDYFIVLAPDSQAISKITDT 653
T T + V T V P SQ ++ T
Sbjct: 697 TTTTTTTQTTPTTPVVTTPSTITPTTRPVTQPASQPATQPATT 739
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.315 0.132 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 378,287,469
Number of Sequences: 884779
Number of extensions: 16344170
Number of successful extensions: 39022
Number of sequences better than 5.0e-02: 5
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 38976
Number of HSP's gapped (non-prelim): 24
length of query: 697
length of database: 277,083,050
effective HSP length: 127
effective length of query: 570
effective length of database: 164,716,117
effective search space: 93888186690
effective search space used: 93888186690
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 94 (40.8 bits)
Query= orf31 [8578-9669]
(1090 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.315 0.132 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 594,227,895
Number of Sequences: 884779
Number of extensions: 26169864
Number of successful extensions: 43435
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 43435
Number of HSP's gapped (non-prelim): 0
length of query: 1090
length of database: 277,083,050
effective HSP length: 131
effective length of query: 959
effective length of database: 161,177,001
effective search space: 154568743959
effective search space used: 154568743959
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 96 (41.6 bits)
Query= orf32 [9748-9906]
(157 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.334 0.148 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,526,976
Number of Sequences: 884779
Number of extensions: 1426651
Number of successful extensions: 3170
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3170
Number of HSP's gapped (non-prelim): 0
length of query: 157
length of database: 277,083,050
effective HSP length: 110
effective length of query: 47
effective length of database: 179,757,360
effective search space: 8448595920
effective search space used: 8448595920
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.6 bits)
S2: 85 (37.4 bits)
Query= orf33 [9910-10000]
(89 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.321 0.135 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 32,160,522
Number of Sequences: 884779
Number of extensions: 824397
Number of successful extensions: 2163
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2163
Number of HSP's gapped (non-prelim): 0
length of query: 89
length of database: 277,083,050
effective HSP length: 65
effective length of query: 24
effective length of database: 219,572,415
effective search space: 5269737960
effective search space used: 5269737960
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf34 [10005-10089]
(83 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.320 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 37,086,067
Number of Sequences: 884779
Number of extensions: 880080
Number of successful extensions: 1874
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1874
Number of HSP's gapped (non-prelim): 0
length of query: 83
length of database: 277,083,050
effective HSP length: 59
effective length of query: 24
effective length of database: 224,881,089
effective search space: 5397146136
effective search space used: 5397146136
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf35 [10080-10160]
(79 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.311 0.130 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 34,827,107
Number of Sequences: 884779
Number of extensions: 815998
Number of successful extensions: 1227
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1227
Number of HSP's gapped (non-prelim): 0
length of query: 79
length of database: 277,083,050
effective HSP length: 55
effective length of query: 24
effective length of database: 228,420,205
effective search space: 5482084920
effective search space used: 5482084920
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf36 [10150-10339]
(188 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_059586.1| (NC_002371) eac [Enterobacteria phage P22]... 39 0.026
>ref|NP_059586.1| (NC_002371) eac [Enterobacteria phage P22]
sp|Q03545|VEAC_BPP22 Eac protein
pir||S35280 eaC protein - phage P22
gb|AAC18880.1| (L06296) EaC [Enterobacteria phage P22]
gb|AAF75004.1| (AF217253) eac [Enterobacteria phage P22]
Length = 211
Score = 38.9 bits (89), Expect = 0.026
Identities = 33/166 (19%), Positives = 72/166 (42%), Gaps = 11/166 (6%)
Query: 2 FKYFQVNPDVGHYLVDKYNSTVNAERERIVSHFLSAVGAVGIVMVREW-GMPSRIEALVF 60
+ Y+ V + L+ Y+ T+N +R I++ VGA+ R W G +++ V+
Sbjct: 8 YDYYMVEGEDVKELIQSYD-TINDQRNSILTTAAEKVGAIAWTTARSWGGEGGLLQSFVW 66
Query: 61 PATHEICNMEGVKLTEHRQGQVVTF---DDNAPFAGHYYEAISGL----NAELVAYPDFS 113
+E +K + G+ V N Y + + + NA+L + P+++
Sbjct: 67 EKGYEFPCQITIKREDFLDGKRVVIARGKGNTKEGRAYNKELDAIMHNANAKLKSLPEWN 126
Query: 114 DWLHHTMGVTRYALAEKDDEAGTYKKIATH--SHLMADGRILFAVP 157
++ + G+ R + + + ++T+ H D ++FA+P
Sbjct: 127 YYITNHYGIMRTGIGGQSGRGFGFVMLSTYGGKHPQRDDCLIFAIP 172
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.319 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 103,847,221
Number of Sequences: 884779
Number of extensions: 3974563
Number of successful extensions: 7256
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 7256
Number of HSP's gapped (non-prelim): 1
length of query: 188
length of database: 277,083,050
effective HSP length: 113
effective length of query: 75
effective length of database: 177,103,023
effective search space: 13282726725
effective search space used: 13282726725
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 87 (38.1 bits)
Query= orf37 [10337-10513]
(175 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 96,295,591
Number of Sequences: 884779
Number of extensions: 3505451
Number of successful extensions: 6196
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6196
Number of HSP's gapped (non-prelim): 0
length of query: 175
length of database: 277,083,050
effective HSP length: 112
effective length of query: 63
effective length of database: 177,987,802
effective search space: 11213231526
effective search space used: 11213231526
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 86 (37.7 bits)
Query= orf38 [10532-10595]
(62 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
sp|P06225|DPOL_BPSP2 DNA POLYMERASE >gi|67052|pir||DJBPS2 D... 51 3e-006
ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [... 41 0.003
ref|NP_299570.1| (NC_002488) phage-related protein [Xylella... 41 0.003
ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [... 41 0.003
ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacte... 40 0.004
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
SPO2]
emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
Length = 648
Score = 50.8 bits (120), Expect = 3e-006
Identities = 22/62 (35%), Positives = 35/62 (55%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPEDFGSVEEMAALMTEREPWRAWWPIRAAGWQDDRYQ 60
+ + +N GY VMH HDE + +VP ++++ A+M E W P+ A G+ D Y+
Sbjct: 587 LMRLDNAGYKTVMHVHDEAVLDVPRGKNELDKVEAIMGEPISWAKGLPLTADGFVTDYYK 646
Query: 61 KD 62
KD
Sbjct: 647 KD 648
>ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
pir||H82774 phage-related DNA polymerase XF0683 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF83493.1|AE003912_5 (AE003912) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
Length = 726
Score = 40.8 bits (94), Expect = 0.003
Identities = 22/62 (35%), Positives = 31/62 (49%), Gaps = 1/62 (1%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPED-FGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
M E GY +V+ HDE+I E + + +AALM PW P+ A G+ RY
Sbjct: 664 MPAIEAAGYSIVLTVHDEIITEADDHPCFNAAHLAALMATPPPWAEGLPLAAEGFDTYRY 723
Query: 60 QK 61
+K
Sbjct: 724 RK 725
>ref|NP_299570.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
pir||H82574 phage-related protein XF2291 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85090.1|AE004041_2 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
Length = 252
Score = 40.8 bits (94), Expect = 0.003
Identities = 22/62 (35%), Positives = 31/62 (49%), Gaps = 1/62 (1%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPED-FGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
M E GY +V+ HDE+I E + + +AALM PW P+ A G+ RY
Sbjct: 190 MPAIEAAGYSIVLTVHDEIITEADDHPCFNAAHLAALMATPPPWAEGLPLAAEGFDTYRY 249
Query: 60 QK 61
+K
Sbjct: 250 RK 251
>ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
pir||C82548 phage-related DNA polymerase XF2525 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85323.1|AE004059_13 (AE004059) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
Length = 726
Score = 40.8 bits (94), Expect = 0.003
Identities = 22/62 (35%), Positives = 31/62 (49%), Gaps = 1/62 (1%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPED-FGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
M E GY +V+ HDE+I E + + +AALM PW P+ A G+ RY
Sbjct: 664 MPAIEAAGYSIVLTVHDEIITEADDHPCFNAAHLAALMATPPPWAEGLPLAAEGFDTYRY 723
Query: 60 QK 61
+K
Sbjct: 724 RK 725
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
Length = 993
Score = 40.4 bits (93), Expect = 0.004
Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 1/63 (1%)
Query: 1 MKKCENNGYPVVMHTHDELIAEVPE-DFGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
M E GY +V+ HDE+I+E P+ S E ++ L++ W P+ A G++ RY
Sbjct: 931 MPPIEKAGYEIVLTVHDEIISEAPDTPQFSAEGLSKLLSFNSDWAWDLPLSANGFETYRY 990
Query: 60 QKD 62
+K+
Sbjct: 991 RKE 993
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.317 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 37,384,509
Number of Sequences: 884779
Number of extensions: 1135657
Number of successful extensions: 3794
Number of sequences better than 5.0e-02: 5
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 3790
Number of HSP's gapped (non-prelim): 5
length of query: 62
length of database: 277,083,050
effective HSP length: 38
effective length of query: 24
effective length of database: 243,461,448
effective search space: 5843074752
effective search space used: 5843074752
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf39 [10593-10898]
(304 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
sp|P06225|DPOL_BPSP2 DNA POLYMERASE >gi|67052|pir||DJBPS2 D... 126 2e-028
ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacte... 80 2e-014
ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [... 66 5e-010
ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [... 65 6e-010
ref|NP_299570.1| (NC_002488) phage-related protein [Xylella... 52 9e-006
sp|Q9S1G2|DPO1_RHILE DNA polymerase I (POL I) >gi|5596366|g... 44 0.002
ref|NP_540742.1| (NC_003317) DNA POLYMERASE I [Brucella mel... 43 0.003
ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas ... 43 0.003
ref|NP_384281.1| (NC_003047) PROBABLE DNA POLYMERASE I PROT... 43 0.004
ref|NP_353146.1| (NC_003062) AGR_C_169p [Agrobacterium tume... 42 0.010
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
SPO2]
emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
Length = 648
Score = 126 bits (317), Expect = 2e-028
Identities = 96/266 (36%), Positives = 135/266 (50%), Gaps = 35/266 (13%)
Query: 18 NKDWTVDAVRWALKDLTHRNLDLIIAIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIE 77
NK +D R LK + ++L+ +G +LS +R FI + G +F DFSAIE
Sbjct: 335 NKIEDLDTARNLLKGGHYEAIELL---YGQVPFVLSQLVRTAFIPSEGNEFYVSDFSAIE 391
Query: 78 AVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEEMMAYKEQNKHDHPLRKSLGKVA 137
A V+A LA EWRLEVF THGKIYE SA+++ VP E + PLR+ GKVA
Sbjct: 392 ARVIAWLAGEEWRLEVFNTHGKIYEASAAQMFKVPVESI-------TKGSPLRQK-GKVA 443
Query: 138 ELASGYGGWLGAWKAFGAEAHFADDDEMKKAILGWRDASPEIVEFWGGQYRRDPFTRKWT 197
ELA GY G GA GA + E+ + + WR A+ +IV+FW
Sbjct: 444 ELALGYQGGKGALIQMGALNMGLAEGELPELVKAWRTANKKIVKFW-------------- 489
Query: 198 KEYFGLEGAAIMAILNPGKCYAVHDITYAVHDDVLYCRLPSGRFLQYHQPRLVESDSWKG 257
+ +E AAI A+ H +T+ +L+ +LPSGR L Y +P+L E D G
Sbjct: 490 ---YDVEAAAIKAVKERKPVKLQHGLTFLYESGILFVQLPSGRRLAYAKPKL-ELDERFG 545
Query: 258 PEYQITFMGYNTNSQKGPVGWVRMDT 283
E +T+ G + + W R++T
Sbjct: 546 KE-ALTYEGKLESGK-----WGRLNT 565
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
Length = 993
Score = 80.1 bits (196), Expect = 2e-014
Identities = 81/268 (30%), Positives = 117/268 (43%), Gaps = 51/268 (19%)
Query: 44 IWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFR--THGK-- 99
I D + L S LRG IA PGKK V D S IE +LA LA W++ F +GK
Sbjct: 359 ICDDIMQLTSSALRGCIIAPPGKKLVISDLSNIEGRMLAWLAGENWKVNAFSEFDNGKGD 418
Query: 100 -IYEMSASKISGVPFEEMMAYKEQNKHDHPLRKSLGKVAELASGYGGWLGAWKAFGAEAH 158
+Y+++ ++ + E++ KEQ + +GKV EL GYGG + A+ F A A+
Sbjct: 419 DLYKLAYARAFNLLPEDVT--KEQ--------RQIGKVMELGLGYGGGVAAFLTF-ALAY 467
Query: 159 FADDDEM------------KKAILGWRDASPEIVEFWGGQYR----RDPFTRKWTKE--- 199
D DE+ K+ + W S E + +G + D R W
Sbjct: 468 GLDLDELAEAALPNIPPGVKREAMRWYQKSVETDKTYGLSEKIFVTCDSLKRMWRNAHPQ 527
Query: 200 ----YFGLEGAAIMAILNPGKCYAVHDITYAVHDDVLYCRLPSGRFLQYHQPRLVESDSW 255
++ +E A AI +PG + ++ L LPSGR L Y R+
Sbjct: 528 TVSFWYDIEDAVKQAIQSPGIAFKCRKLSVRRDKSWLRICLPSGRSLCYPSARIENG--- 584
Query: 256 KGPEYQITFMGYNTNSQKGPVGWVRMDT 283
QIT+MG N S+K W R+ T
Sbjct: 585 -----QITYMGTNPYSRK----WERLKT 603
>ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
pir||C82548 phage-related DNA polymerase XF2525 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85323.1|AE004059_13 (AE004059) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
Length = 726
Score = 65.9 bits (159), Expect = 5e-010
Identities = 78/296 (26%), Positives = 113/296 (37%), Gaps = 62/296 (20%)
Query: 39 DLIIAIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFR--- 95
D + ++ D + L S LR IA KK V D S IE VLA LA +L FR
Sbjct: 358 DCVDLVFDDVMALTSSALRSCLIAPTHKKLVVADLSNIEGRVLAWLAGETPKLHAFREFD 417
Query: 96 ---------------THGKIYEMSASKISGVPFEEMMA----YKEQNKHDHPL------- 129
THG + + + YK H +
Sbjct: 418 TCQGVDGCWHTGAAITHGALRAAPLALQRNADHAPIRRGDDIYKRAYAHSFGIAPQAVSK 477
Query: 130 -RKSLGKVAELASGYGGWLGAWKAFGAEAHFADDDEMKKAILG--WRDASPEIVEFWGGQ 186
++ +GKV ELA GYGG +GA+ AF A H + +A+ + E++E+
Sbjct: 478 EQRQIGKVQELALGYGGGVGAFAAFAALYHIDLEAMAAQAVFPPLLLQEATELLEWTKAN 537
Query: 187 YRR------------DPFTRKWTKE-------YFGLEGAAIMAILNPGKCYAVHDITYAV 227
+R D F R W + L+ A AI +P + IT
Sbjct: 538 HRPTFDLSDRAWLACDVFKRAWRNAHPAIAAFWQALQFAVTDAIHHPETTHTCCGITVQY 597
Query: 228 HDDVLYCRLPSGRFLQYHQPRLVESDSWKGPEYQITFMGYNTNSQKGPVGWVRMDT 283
L RLPSGR L Y PR+ E + +++MG + ++K W R+ T
Sbjct: 598 RRAWLRLRLPSGRVLYYAAPRVDEHGA-------LSYMGTHPITRK----WTRLTT 642
>ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
pir||H82774 phage-related DNA polymerase XF0683 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF83493.1|AE003912_5 (AE003912) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
Length = 726
Score = 65.5 bits (158), Expect = 6e-010
Identities = 78/291 (26%), Positives = 111/291 (37%), Gaps = 62/291 (21%)
Query: 44 IWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFR-------- 95
++ D + L S LR IA KK V D S IE VLA LA +L FR
Sbjct: 363 VFDDVMALTSSALRSCLIAPKNKKLVVADLSNIEGRVLAWLAGETPKLHAFREFDTCQGV 422
Query: 96 ----------THGKIYEMSASKISGVPFEEMMA----YKEQNKHDHPL--------RKSL 133
THG + + + YK H + ++ +
Sbjct: 423 DGTWHTGAAITHGALRGAPIALQRNADHAPIRRGDDIYKRAYAHSFGIAPQAVSKEQRQI 482
Query: 134 GKVAELASGYGGWLGAWKAFGAEAHFADDDEMKKAILG--WRDASPEIVEFWGGQYRR-- 189
GKV ELA GYGG +GA+ AF A H + +A L + E++E+ +R
Sbjct: 483 GKVQELALGYGGGVGAFAAFAAMYHIDLEAMAAQAALPPLLLQQATELLEWTKANHRPTF 542
Query: 190 ----------DPFTRKWTKE-------YFGLEGAAIMAILNPGKCYAVHDITYAVHDDVL 232
D F R W + L+ A AI +P + IT L
Sbjct: 543 DLSDRAWLACDVFKRAWRNAHPAIAAFWQALQFAVTDAIHHPETAHTCCGITVQYRRAWL 602
Query: 233 YCRLPSGRFLQYHQPRLVESDSWKGPEYQITFMGYNTNSQKGPVGWVRMDT 283
RLPSGR L Y PR+ E + +++MG + ++K W R+ T
Sbjct: 603 RLRLPSGRVLYYAAPRVDEHGA-------LSYMGTHPITRK----WTRLTT 642
>ref|NP_299570.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
pir||H82574 phage-related protein XF2291 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85090.1|AE004041_2 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
Length = 252
Score = 51.6 bits (122), Expect = 9e-006
Identities = 49/175 (28%), Positives = 75/175 (42%), Gaps = 32/175 (18%)
Query: 130 RKSLGKVAELASGYGGWLGAWKAFGAEAHFADDDEMKKAILG--WRDASPEIVEFWGGQY 187
++ +GKV ELA GYGG +GA+ AF A H + +A+ + E++E+ +
Sbjct: 5 QRQIGKVQELALGYGGGVGAFAAFAALYHIDLEAMAAQAVFPPLLLQEATELLEWTKANH 64
Query: 188 RR------------DPFTRKWTKE-------YFGLEGAAIMAILNPGKCYAVHDITYAVH 228
R D F R W + L+ A AI +P + IT
Sbjct: 65 RPTFDLSDRAWLACDVFKRAWRNAHPAIAAFWQALQFAVTDAIHHPETTHTCCGITVQYR 124
Query: 229 DDVLYCRLPSGRFLQYHQPRLVESDSWKGPEYQITFMGYNTNSQKGPVGWVRMDT 283
L RLPSGR L Y PR+ E + +++MG + ++K W R+ T
Sbjct: 125 RAWLRLRLPSGRVLYYAAPRVDEHGA-------LSYMGTHPITRK----WTRLTT 168
>sp|Q9S1G2|DPO1_RHILE DNA polymerase I (POL I)
gb|AAD45559.1|U86403_1 (U86403) DNA polymerase I [Rhizobium leguminosarum]
Length = 1016
Score = 43.9 bits (102), Expect = 0.002
Identities = 24/61 (39%), Positives = 34/61 (55%)
Query: 56 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
+R FI+ PG K + D+S IE VLA +A + F I+ M+AS++ GVP E
Sbjct: 771 IRTAFISTPGHKLISADYSQIELRVLAHVAEIPQLTKAFEDGVDIHAMTASEMFGVPVEG 830
Query: 116 M 116
M
Sbjct: 831 M 831
>ref|NP_540742.1| (NC_003317) DNA POLYMERASE I [Brucella melitensis]
gb|AAL53006.1| (AE009615) DNA POLYMERASE I [Brucella melitensis]
Length = 994
Score = 43.1 bits (100), Expect = 0.003
Identities = 24/61 (39%), Positives = 34/61 (55%)
Query: 56 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
+R FIA PG K + D+S IE +LA +A + F I+ M+AS++ GVP E
Sbjct: 751 IRTAFIAEPGNKLISADYSQIELRILAHVADIAQLKQAFADGIDIHAMTASEMFGVPVEG 810
Query: 116 M 116
M
Sbjct: 811 M 811
>ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas aeruginosa]
pir||F82958 DNA polymerase I PA5493 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG08878.1|AE004962_2 (AE004962) DNA polymerase I [Pseudomonas aeruginosa]
Length = 913
Score = 43.1 bits (100), Expect = 0.003
Identities = 20/61 (32%), Positives = 37/61 (59%)
Query: 56 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
+R F+A G K + D+S IE ++A LA+ + L+ FR ++ +A+++ GVP E+
Sbjct: 674 IRQAFVAPQGYKLLAADYSQIELRIMAHLAKDDGLLDAFRHDLDVHRATAAEVFGVPLED 733
Query: 116 M 116
+
Sbjct: 734 V 734
>ref|NP_384281.1| (NC_003047) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
meliloti]
emb|CAC41562.1| (AL591782) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
meliloti]
Length = 1004
Score = 42.7 bits (99), Expect = 0.004
Identities = 24/61 (39%), Positives = 34/61 (55%)
Query: 56 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
+R F+A PG K V D+S IE VLA +A + F I+ M+AS++ GVP +
Sbjct: 761 IRTAFVATPGHKLVSADYSQIELRVLAHVADIPQLRQAFADGVDIHAMTASEMFGVPVDG 820
Query: 116 M 116
M
Sbjct: 821 M 821
>ref|NP_353146.1| (NC_003062) AGR_C_169p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
ref|NP_530819.1| (NC_003304) DNA polymerase I [Agrobacterium tumefaciens str. C58
(U. Washington)]
gb|AAK85931.1| (AE007952) AGR_C_169p [Agrobacterium tumefaciens str. C58 (Cereon)]
gb|AAL41135.1| (AE008985) DNA polymerase I [Agrobacterium tumefaciens str. C58 (U.
Washington)]
Length = 1031
Score = 41.6 bits (96), Expect = 0.010
Identities = 24/61 (39%), Positives = 33/61 (53%)
Query: 56 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
+R FI+ PG K + D+S IE VLA +A F I+ M+AS++ GVP E
Sbjct: 788 IRTAFISTPGHKLLSADYSQIELRVLAHVADIPQLRNAFENGIDIHAMTASEMFGVPVEG 847
Query: 116 M 116
M
Sbjct: 848 M 848
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.322 0.138 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 189,865,443
Number of Sequences: 884779
Number of extensions: 7915968
Number of successful extensions: 14743
Number of sequences better than 5.0e-02: 10
Number of HSP's better than 0.1 without gapping: 7
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 14723
Number of HSP's gapped (non-prelim): 12
length of query: 304
length of database: 277,083,050
effective HSP length: 119
effective length of query: 185
effective length of database: 171,794,349
effective search space: 31781954565
effective search space used: 31781954565
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 90 (39.3 bits)
Query= orf40 [10968-11308]
(339 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
sp|P06225|DPOL_BPSP2 DNA POLYMERASE >gi|67052|pir||DJBPS2 D... 87 3e-016
ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacte... 74 2e-012
ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [... 72 1e-011
ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [... 72 1e-011
gb|AAA23103.1| (M94320) putative DNA polymerase [Citrus gre... 60 3e-008
gb|AAB25243.1| DNA polymerase homolog [bacterium-like organ... 60 3e-008
ref|NP_299569.1| (NC_002488) phage-related protein [Xylella... 44 0.003
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
SPO2]
emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
Length = 648
Score = 86.7 bits (213), Expect = 3e-016
Identities = 82/274 (29%), Positives = 118/274 (42%), Gaps = 22/274 (8%)
Query: 72 GTPVYAEHPSTEILCLYYDLKDGKGRRAYIPGGPK-PTDLIEHIANGGPI-AAWNITFEF 129
G Y E P EIL Y D + + G P +++ + + I A+N FE
Sbjct: 20 GVYAYTEAPDFEILLFAYAFDDDPVKIIDLAQGDTLPHEVLVALTSSKVIKTAYNANFE- 78
Query: 130 WIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGSLDAAAQALKTSP-KDKKGKQLINK 188
+AK L Q C A LPG+LD A+ALK S KDK GK LI
Sbjct: 79 ---RTCIAKHFNLMLLPA-QWRCTAVHATTLGLPGNLDGVAKALKLSAQKDKAGKALIRY 134
Query: 189 LCRPVSATKA-----RPLPRWTLYTAPQDYSDLYAYCDQDVKSEDHVSALTPDMTP--SE 241
P TKA R LP P+ + YC QDV+ E + P SE
Sbjct: 135 FSVPCKPTKANGQRVRNLPEHD----PEKWEKFKVYCIQDVEVERAIKNRISKFEPLESE 190
Query: 242 RETWLADQRINARGVQVDVESLDNALDIVDQTTRRFTMELAAITNGAVGSVSEVAKMTDF 301
+ W DQ IN RGV++DV+ + +A+ +Q E +T + + + A++ +
Sbjct: 191 HKLWALDQEINDRGVRIDVDLVKHAIACDEQYQAGLIAEAKKLT--GLPNPNSTAQLKKW 248
Query: 302 LAGLGCRMHNLTADSVSEALERDDLNPTARRVLR 335
L G + +L D + E +E + + T RVLR
Sbjct: 249 LEEKGLTISSLAKDKIEELIENTN-DETVHRVLR 281
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
Length = 993
Score = 74.3 bits (181), Expect = 2e-012
Identities = 58/234 (24%), Positives = 106/234 (44%), Gaps = 11/234 (4%)
Query: 107 PTDLIEHIANGGPIAAWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGSL 166
P DL ++ + + W+ +++ V+ KR L L + + + +A LPG+L
Sbjct: 54 PADLKAYLDDSEILTVWH---NGGMFDTVILKRVLNIDLPLSRVHDTLVQALAHGLPGAL 110
Query: 167 DAAAQALKTSP---KDKKGKQLINKLCRPVSATKARPLPRWTLYTAPQDYSDLYAYCDQD 223
+ KDK+GK LI+ LC+P K + R T T +++ Y D
Sbjct: 111 GLLCDIFNVNSDKAKDKEGKALISLLCKP--RPKNSKIQRATALTHAEEWQRFKDYAGSD 168
Query: 224 VKSEDHVSALTPD--MTPSERETWLADQRINARGVQVDVESLDNALDIVDQTTRRFTMEL 281
+ + + P+ M E E W DQ+IN RG+ +DVE +AL V+ +R +
Sbjct: 169 ILAMREIYQHLPNWNMNVHETELWQLDQKINRRGMCMDVELAKSALTAVENEQKRLSTVT 228
Query: 282 AAITNGAVGSVSEVAKMTDFLA-GLGCRMHNLTADSVSEALERDDLNPTARRVL 334
+T+ AV + ++ + +A G + ++ A ++ + D+ P R +L
Sbjct: 229 QQLTDNAVQNATQRDALLQHIASAFGITLPDMQASTLQRRINDPDIPPALRELL 282
>ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
pir||H82774 phage-related DNA polymerase XF0683 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF83493.1|AE003912_5 (AE003912) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
Length = 726
Score = 71.6 bits (174), Expect = 1e-011
Identities = 55/196 (28%), Positives = 90/196 (45%), Gaps = 8/196 (4%)
Query: 145 LQLEQCYCDMAKARRFSLPGSLDAAAQALKTS---PKDKKGKQLINKLCRPVSATKARPL 201
+ LE+ MA+A SLPG+L + L+ KDK GK+L+ C+P A L
Sbjct: 93 IPLERWRDSMAQALAHSLPGALGTLCELLRVPVEHAKDKDGKRLVALFCKPRPAHCT--L 150
Query: 202 PRWTLYTAPQDYSDLYAYCDQDVKSEDHVSALTP--DMTPSERETWLADQRINARGVQVD 259
R T T P D++ Y +DV + V P + +E W DQ IN RGV +D
Sbjct: 151 RRATRDTHPTDWAQFVEYAQRDVAAMRDVVKRLPSHNYAGAELALWFLDQTINDRGVLID 210
Query: 260 VESLDNALDIVDQTTRRFTMELAAITNGAVGSVSEVAKMTDFL-AGLGCRMHNLTADSVS 318
+ + A+ V++ + A+T GAV + ++ + L G + ++ +V+
Sbjct: 211 TDLVQAAIRAVERAKQTLAQRTEALTGGAVQAATQRDALLQHLNTAHGVALPDMQQHTVA 270
Query: 319 EALERDDLNPTARRVL 334
++ L T R +L
Sbjct: 271 RCVDDPALPETVRELL 286
>ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
pir||C82548 phage-related DNA polymerase XF2525 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85323.1|AE004059_13 (AE004059) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
Length = 726
Score = 71.6 bits (174), Expect = 1e-011
Identities = 55/196 (28%), Positives = 90/196 (45%), Gaps = 8/196 (4%)
Query: 145 LQLEQCYCDMAKARRFSLPGSLDAAAQALKTS---PKDKKGKQLINKLCRPVSATKARPL 201
+ LE+ MA+A SLPG+L + L+ KDK GK+L+ C+P A L
Sbjct: 93 IPLERWRDSMAQALAHSLPGALGTLCELLRVPVEHAKDKDGKRLVALFCKPRPAHCT--L 150
Query: 202 PRWTLYTAPQDYSDLYAYCDQDVKSEDHVSALTP--DMTPSERETWLADQRINARGVQVD 259
R T T P D++ Y +DV + V P + +E W DQ IN RGV +D
Sbjct: 151 RRATRDTHPTDWAQFVEYAQRDVAAMRDVVKRLPSHNYAGAELALWFLDQTINDRGVLID 210
Query: 260 VESLDNALDIVDQTTRRFTMELAAITNGAVGSVSEVAKMTDFL-AGLGCRMHNLTADSVS 318
+ + A+ V++ + A+T GAV + ++ + L G + ++ +V+
Sbjct: 211 TDLVQAAIRAVERAKQTLAQRTEALTGGAVQAATQRDALLQHLNTAHGVALPDMQQHTVA 270
Query: 319 EALERDDLNPTARRVL 334
++ L T R +L
Sbjct: 271 RCVDDPALPETVRELL 286
>gb|AAA23103.1| (M94320) putative DNA polymerase [Citrus greening
disease-associated bacterium]
Length = 206
Score = 60.1 bits (144), Expect = 3e-008
Identities = 49/166 (29%), Positives = 79/166 (47%), Gaps = 26/166 (15%)
Query: 107 PTDLIEHIANGGPIA-AWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGS 165
P+DL++++ + + A N FE +L K+ + ++ C AR LP S
Sbjct: 54 PSDLLQYLRDETVMCVAHNSLFE-----RILFKKTLGIDIPSKRWICTSVLARTNGLPSS 108
Query: 166 LDAAAQALKTSP---KDKKGKQLINKLCR------PVSATKARPLPRWTLYTAPQDYSDL 216
L A AL+ K ++GK LI + C+ P T+A + W L+
Sbjct: 109 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFRE------- 161
Query: 217 YAYCDQDVKSEDHV-SALTPDMTPSERETWLADQRINARGVQVDVE 261
YC +DV++ + +LTP + P ER WL DQ IN RG ++D++
Sbjct: 162 --YCKRDVEATRELFKSLTP-LNPEERTLWLLDQEINDRGYRIDLD 204
>gb|AAB25243.1| DNA polymerase homolog [bacterium-like organism, citrus greening
disease-associated, Peptide, 207 aa]
Length = 207
Score = 60.1 bits (144), Expect = 3e-008
Identities = 49/166 (29%), Positives = 79/166 (47%), Gaps = 26/166 (15%)
Query: 107 PTDLIEHIANGGPIA-AWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGS 165
P+DL++++ + + A N FE +L K+ + ++ C AR LP S
Sbjct: 55 PSDLLQYLRDETVMCVAHNSLFE-----RILFKKTLGIDIPSKRWICTSVLARTNGLPSS 109
Query: 166 LDAAAQALKTSP---KDKKGKQLINKLCR------PVSATKARPLPRWTLYTAPQDYSDL 216
L A AL+ K ++GK LI + C+ P T+A + W L+
Sbjct: 110 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFRE------- 162
Query: 217 YAYCDQDVKSEDHV-SALTPDMTPSERETWLADQRINARGVQVDVE 261
YC +DV++ + +LTP + P ER WL DQ IN RG ++D++
Sbjct: 163 --YCKRDVEATRELFKSLTP-LNPEERTLWLLDQEINDRGYRIDLD 205
>ref|NP_299569.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
pir||G82574 phage-related protein XF2290 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85089.1|AE004041_1 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
Length = 312
Score = 43.5 bits (101), Expect = 0.003
Identities = 30/126 (23%), Positives = 56/126 (43%), Gaps = 3/126 (2%)
Query: 212 DYSDLYAYCDQDVKSEDHVSALTP--DMTPSERETWLADQRINARGVQVDVESLDNALDI 269
D++ Y +DV + V P + +E W DQ IN RGV +D + + A+
Sbjct: 2 DWAQFVEYARRDVAAMRDVVKRLPSHNYAGAELALWFLDQTINDRGVLIDTDLVQAAIGA 61
Query: 270 VDQTTRRFTMELAAITNGAVGSVSEVAKMTDFL-AGLGCRMHNLTADSVSEALERDDLNP 328
V++ + A+T GAV + ++ + L G + ++ +V+ ++ L
Sbjct: 62 VERAKQTLAQRTEALTGGAVQAATQRDALLQHLNTAHGVALPDMQQHTVARCVDDPALPE 121
Query: 329 TARRVL 334
T R +L
Sbjct: 122 TVRELL 127
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.317 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 194,837,355
Number of Sequences: 884779
Number of extensions: 8603425
Number of successful extensions: 18374
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 18363
Number of HSP's gapped (non-prelim): 7
length of query: 339
length of database: 277,083,050
effective HSP length: 121
effective length of query: 218
effective length of database: 170,024,791
effective search space: 37065404438
effective search space used: 37065404438
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 91 (39.7 bits)
Query= orf41 [11307-11478]
(170 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.132 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 95,252,776
Number of Sequences: 884779
Number of extensions: 3764828
Number of successful extensions: 8556
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8556
Number of HSP's gapped (non-prelim): 0
length of query: 170
length of database: 277,083,050
effective HSP length: 111
effective length of query: 59
effective length of database: 178,872,581
effective search space: 10553482279
effective search space used: 10553482279
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 86 (37.7 bits)
Query= orf42 [11499-11837]
(337 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.319 0.138 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 112,778,215
Number of Sequences: 884779
Number of extensions: 4929411
Number of successful extensions: 8412
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8412
Number of HSP's gapped (non-prelim): 0
length of query: 337
length of database: 277,083,050
effective HSP length: 120
effective length of query: 217
effective length of database: 170,909,570
effective search space: 37087376690
effective search space used: 37087376690
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 91 (39.7 bits)
Query= orf43 [11841-12192]
(350 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.135 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 198,552,102
Number of Sequences: 884779
Number of extensions: 8335154
Number of successful extensions: 18488
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 18488
Number of HSP's gapped (non-prelim): 0
length of query: 350
length of database: 277,083,050
effective HSP length: 121
effective length of query: 229
effective length of database: 170,024,791
effective search space: 38935677139
effective search space used: 38935677139
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 91 (39.7 bits)
Query= orf44 [12218-12506]
(287 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.309 0.127 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 164,641,227
Number of Sequences: 884779
Number of extensions: 7046383
Number of successful extensions: 15184
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 15184
Number of HSP's gapped (non-prelim): 0
length of query: 287
length of database: 277,083,050
effective HSP length: 119
effective length of query: 168
effective length of database: 171,794,349
effective search space: 28861450632
effective search space used: 28861450632
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 90 (39.3 bits)
Query= orf45 [12600-12686]
(85 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.325 0.139 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 47,940,177
Number of Sequences: 884779
Number of extensions: 1648020
Number of successful extensions: 3403
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3403
Number of HSP's gapped (non-prelim): 0
length of query: 85
length of database: 277,083,050
effective HSP length: 61
effective length of query: 24
effective length of database: 223,111,531
effective search space: 5354676744
effective search space used: 5354676744
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf46 [12684-13235]
(550 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAB63670.1| (AJ251789) hypothetical protein [Lactobacil... 199 6e-050
ref|NP_511031.1| (NC_003291) helicase [Bacteriophage PSA] >... 189 6e-047
ref|NP_050128.1| (NC_000896) putative DEAH-family helicase ... 187 2e-046
ref|NP_049952.1| (NC_000871) orf443 gp [Streptococcus therm... 185 9e-046
ref|NP_049999.1| (NC_000872) orf443 gp [Streptococcus therm... 185 9e-046
pir||T13299 probable helicase - Streptococcus phage phi-O12... 185 9e-046
emb|CAC04160.1| (AJ292531) putative helicase [Lactococcus b... 183 3e-045
gb|AAL74161.1|AF442520_1 (AF442520) putative helicase [Stre... 183 5e-045
ref|NP_471919.1| (NC_003212) similar to DEAH-family helicas... 177 2e-043
sp|P24125|V51K_BPL79 51.5 kDa protein >gi|76364|pir||JU0445... 174 2e-042
ref|NP_490263.1| (NC_003276) putative helicase [Nostoc sp. ... 155 8e-037
ref|NP_459877.1| (NC_003197) Fels-1 prophage; putative DNA ... 114 2e-024
ref|NP_231273.1| (NC_002505) helicase-related protein [Vibr... 110 5e-023
ref|NP_142208.1| (NC_000961) hypothetical protein [Pyrococc... 77 5e-013
gb|AAF12716.1|AF066865_14 (AF066865) unknown [bacteriophage... 66 1e-009
ref|NP_559418.1| (NC_003364) helicase, possible DNA repair ... 65 2e-009
ref|NP_248512.1| (NC_000909) ATP dependent RNA helicase, pu... 55 2e-006
sp|O02494|IF4A_CRYPV Eukaryotic initiation factor 4A (eIF-4... 50 6e-005
ref|NP_421775.1| (NC_002696) excinuclease ABC, subunit B [C... 48 3e-004
ref|NP_375993.1| (NC_003106) 621aa long conserved hypotheti... 44 0.006
>emb|CAB63670.1| (AJ251789) hypothetical protein [Lactobacillus casei bacteriophage
A2]
Length = 455
Score = 199 bits (506), Expect = 6e-050
Identities = 139/422 (32%), Positives = 205/422 (47%), Gaps = 52/422 (12%)
Query: 1 MFKLRDYQQLAYNRI---MAAWQEYRSVLAVLPTGAGKTVIFSKIIH---DHVGAAAAIV 54
MF+L YQ+ N+ +AA +SVL V P G+GK+VI ++I + G +V
Sbjct: 1 MFQLHPYQKELVNQAREKLAAGN--KSVLLVSPAGSGKSVIIAEIARLAVERSGHVMFMV 58
Query: 55 HRREIVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLT 114
HR+E+V QI Q Q I +Q T
Sbjct: 59 HRQELVNQIV------------------------------------QTFQADEIDLQATT 82
Query: 115 SKSTMNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXX 174
+ A + + + +L + DE HH + + + K + L +A+P R +
Sbjct: 83 IMTVGKIANRLNRLPRPSLIITDESHHSLAK-TYRKIYSFYADVPRLGFSASPWRMNGQG 141
Query: 175 XXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKAR 234
++M+EGP+ WLI+ +L+ + Y AP T ID++ L GD+ K++ A
Sbjct: 142 LGNVY----ESMVEGPSVKWLIDHQYLAPYDYYAP-TLIDVQKLQTSSTGDYTNKSMDAA 196
Query: 235 VVESHIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERD 294
V ++ I GDVV HY+ ++AIV+A ++ ++Q+ E F+ AG TA +T G+RD
Sbjct: 197 VPKA-IFGDVVSHYQHLAGGRQAIVYAHSIEASKQVVEAFQSAGITAVHADAKTPKGQRD 255
Query: 295 RLLEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEY 354
R++ +F+ G I IL NVDL EGF+VP V +I+ RPT SL +Q R +R GK
Sbjct: 256 RIMADFKAGKITILSNVDLISEGFNVPDVGVIIMLRPTASLVLDIQQSMRGMRYKPGKR- 314
Query: 355 AIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTCDACTIPYEAYNKACPYC 414
+IIID V N R G P WTL R K + + P KTCDAC K CP C
Sbjct: 315 SIIIDHVANAYRFGLPDTEHEWTLNDRPKKKKRDNTGPPIKTCDACYAVIPVQCKICPIC 374
Query: 415 GH 416
GH
Sbjct: 375 GH 376
>ref|NP_511031.1| (NC_003291) helicase [Bacteriophage PSA]
emb|CAC85606.1| (AJ312240) helicase [Bacteriophage PSA]
Length = 418
Score = 189 bits (480), Expect = 6e-047
Identities = 126/388 (32%), Positives = 195/388 (49%), Gaps = 47/388 (12%)
Query: 32 GAGKTVIFSKIIH---DHVGAAAAIVHRREIVGQISLSLASFGVKHXXXXXXXXXXXXXX 88
GAGK+VI S+II + +VHR+E++ QI +L V
Sbjct: 9 GAGKSVILSEIIRMTTHNKNYVLFLVHRKELIDQIRNTLIMNEVD--------------- 53
Query: 89 XXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHHYVEQGIW 148
SY+ VQT+ + + + +L + DE HH + +
Sbjct: 54 -------MSYVKLGM------VQTIVRR--------LNKTSEPSLIIIDESHHVLANS-Y 91
Query: 149 AKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRA 208
K +H AK++ TATP R + T+IE WLIE FL+ + Y A
Sbjct: 92 KKIIHHFSKAKVVGFTATPVRINGGGLGDIN----DTLIEKVNVKWLIENQFLAHYKYYA 147
Query: 209 PQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVDTAE 268
P+T + L V + G+F+ +L + + I GDV++HY+K + ++AI++A+ + +E
Sbjct: 148 PET-VQTETLNVKRTGEFDMTSLDDQFNKRMIWGDVIKHYQKLADGEQAILYASSIYQSE 206
Query: 269 QMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECVIL 328
+MA F AG ++A + G+T RD ++++F +G +++L N+DL EGFDVP VI+
Sbjct: 207 KMAASFNAAGISSAHIDGKTPKPIRDDIIKQFREGELKVLCNLDLIGEGFDVPDCSTVIM 266
Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSG 388
RPTQSL+ ++Q R +R +GK A IID V N +R G P RTW+L+ R KG +S
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYKQGKT-ATIIDHVGNVKRFGLPDMERTWSLEPR-KGSNST 324
Query: 389 SDTVPQKTCDACTIPYEAYNKACPYCGH 416
P K C C + K C +CGH
Sbjct: 325 KAEAPVKICKECFMTVSQTAKKCEHCGH 352
>ref|NP_050128.1| (NC_000896) putative DEAH-family helicase [Lactobacillus
bacteriophage phi adh]
emb|CAB52498.1| (AJ131519) putative DEAH-family helicase [Lactobacillus
bacteriophage phi adh]
Length = 455
Score = 187 bits (475), Expect = 2e-046
Identities = 132/443 (29%), Positives = 214/443 (47%), Gaps = 50/443 (11%)
Query: 4 LRDYQQLAYNRIMAAWQE-YRSVLAVLPTGAGKTVIFSKI---IHDHVGAAAAIVHRREI 59
+R YQ+ + + + + +++VL P G+GK++ +++ I + +VHRRE+
Sbjct: 5 IRSYQKKLIDDLRLSLKHGHKNVLIQSPAGSGKSITMAELAKRITSNGERVLFVVHRREL 64
Query: 60 VGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTM 119
V QI + +S+GV C + VQT + +
Sbjct: 65 VKQIKNTFSSWGVNMNL----------------------------CEIYMVQTASRR--- 93
Query: 120 NDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXX 179
+ + + DE HH + + + K + +NA +L TATP R
Sbjct: 94 -----LEKLMTPNYILVDEAHHSLAK-TYKKINNHFKNAHILGFTATPVRLSGKGFKDIY 147
Query: 180 XXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESH 239
++ GP WLIE +L+ + Y + ID L GD+ K+++ ++
Sbjct: 148 ----DDLVLGPKISWLIENHYLAPYTYYSVNL-IDQTKLKKSSTGDYTHKSIE-NAGKNI 201
Query: 240 IVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEE 299
+ GDV+Q Y K+ N +AI+++ V + +Q+A+EF K A V G+T+ +RD+ +++
Sbjct: 202 VYGDVIQSYHKFANNTKAIIYSYSVHSCQQIAKEFNKNNIPAKEVDGKTKKEDRDKAMQD 261
Query: 300 FEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
F G I+IL+N +L+ EG DVP E VI+ RPTQSL+ F+Q R +R K+ AIIID
Sbjct: 262 FRDGKIKILVNAELYGEGVDVPDCETVIMLRPTQSLSLFIQQSMRCMRYQPDKQ-AIIID 320
Query: 360 PVRNWERHGPPTAVRTWTLKGREK--GQSSGSDTVPQKTCDACTIPYEAYNKACPYCGHV 417
V N+ R G P R WTL+ R K + GSD + KTC C A CP CG+
Sbjct: 321 QVANYTRFGLPDMDRVWTLEDRAKHPQREGGSDGIAIKTCPNCFGVIMASYHKCPLCGYS 380
Query: 418 NVYPDRSAPDKVDGDLVELDLDA 440
R + +L +++LDA
Sbjct: 381 FEAEFRKLAEDKRAELEKINLDA 403
>ref|NP_049952.1| (NC_000871) orf443 gp [Streptococcus thermophilus bacteriophage
Sfi19]
gb|AAC97921.1| (AF077306) putative helicase; gp443 [Streptococcus thermophilus
bacteriophage Sfi19]
gb|AAD44071.1|AF115102_30 (AF115102) orf443 gp [Streptococcus thermophilus bacteriophage
Sfi19]
Length = 443
Score = 185 bits (470), Expect = 9e-046
Identities = 132/423 (31%), Positives = 207/423 (48%), Gaps = 54/423 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
+LR+YQ N I + + + ++ P +GKTV+ + I I+ HR+E
Sbjct: 2 ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61
Query: 59 IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
I Q+ + GV D N ++++ ++T +
Sbjct: 62 INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKIAR 90
Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
D +++ ++ + DE HH V+ + K + N+ +L T TP R D
Sbjct: 91 NLDR-----IQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143
Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
A ++ G WL + G ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
I GDV++HY K + K+AIV+ V+ +E ++ F + GYT+ A+SG+T RDR ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257
Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F G + I++N +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
D V N +RHG P A R W+LKG K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPTT--RTCDECYATFWSAERICPMCG 374
Query: 416 HVN 418
H N
Sbjct: 375 HEN 377
>ref|NP_049999.1| (NC_000872) orf443 gp [Streptococcus thermophilus bacteriophage
Sfi21]
ref|NP_056709.1| (NC_002214) putative helicase [Streptococcus thermophilus
bacteriophage Sfi11]
gb|AAC72435.1| (AF004379) orf443 [Streptococcus thermophilus bacteriophage Sfi21]
gb|AAD44104.1|AF115103_34 (AF115103) orf443 gp [Streptococcus thermophilus bacteriophage
Sfi21]
gb|AAF63056.1|AF158600_10 (AF158600) putative helicase [Streptococcus thermophilus
bacteriophage Sfi11]
gb|AAF63085.1|AF158601_13 (AF158601) putative helicase [Streptococcus thermophilus
bacteriophage SFi18]
Length = 443
Score = 185 bits (470), Expect = 9e-046
Identities = 132/423 (31%), Positives = 207/423 (48%), Gaps = 54/423 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
+LR+YQ N I + + + ++ P +GKTV+ + I I+ HR+E
Sbjct: 2 ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61
Query: 59 IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
I Q+ + GV D N ++++ ++T +
Sbjct: 62 INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKIAR 90
Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
D +++ ++ + DE HH V+ + K + N+ +L T TP R D
Sbjct: 91 NLDR-----IQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143
Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
A ++ G WL + G ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
I GDV++HY K + K+AIV+ V+ +E ++ F + GYT+ A+SG+T RDR ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257
Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F G + I++N +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
D V N +RHG P A R W+LKG K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPTT--RTCDECYATFWSAERICPMCG 374
Query: 416 HVN 418
H N
Sbjct: 375 HEN 377
>pir||T13299 probable helicase - Streptococcus phage phi-O1205
gb|AAC79526.1| (U88974) ORF10 [Streptococcus thermophilus temperate bacteriophage
O1205]
Length = 443
Score = 185 bits (470), Expect = 9e-046
Identities = 131/423 (30%), Positives = 208/423 (48%), Gaps = 54/423 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
+LR+YQ N I + + + ++ P +GKTV+ + I I+ HR+E
Sbjct: 2 ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61
Query: 59 IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
I Q+ + GV D N ++++ ++T +
Sbjct: 62 INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKIAR 90
Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
D +++ ++ + DE HH V+ + K + N+ +L T TP R D
Sbjct: 91 NLDR-----IQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143
Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
A ++ G + WL + G ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKSVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
I GDV++HY K + K+AIV+ V+ +E ++ F + GYT+ A+SG+T RDR ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257
Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F G + I++N +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
D V N +RHG P A R W++KG K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSIKGVNKTKKKLKLGEPTT--RTCDECYATFWSAERICPMCG 374
Query: 416 HVN 418
H N
Sbjct: 375 HEN 377
>emb|CAC04160.1| (AJ292531) putative helicase [Lactococcus bacteriophage phi31]
Length = 448
Score = 183 bits (465), Expect = 3e-045
Identities = 145/442 (32%), Positives = 212/442 (47%), Gaps = 56/442 (12%)
Query: 4 LRDYQQLAYNRIMAAWQE-YRSVLAVLPTGAGKTVIF---SKIIHDHVGAAAAIVHRREI 59
LRDYQ+ I +++ + RS++ P +GKTV+ SK D HR+EI
Sbjct: 3 LRDYQEELVESIKSSFLKGNRSIIVQSPPRSGKTVVMADISKGATDKKNHVLFFSHRKEI 62
Query: 60 VGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTM 119
Q+ + +F + Y+ + SVQ+L K
Sbjct: 63 NDQV---VKTFELNQV-------------------NMEYVT------IGSVQSLVRK--- 91
Query: 120 NDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXX 179
+ + + + DE HH ++ + K + NA LF T TP R +
Sbjct: 92 -----IDELPLPDIILVDEAHH-IKANSYKKILEAFPNALKLFFTGTPIRLNGQGFED-- 143
Query: 180 XXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESH 239
A +I G + WL E G ++ F Y AP ID L +GDF K++ +
Sbjct: 144 --MADDLITGKSIKWLQEHGNIAPFKYYAPNI-IDTSQLKK-TSGDFTQKSMD-EAFKRA 198
Query: 240 IVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEE 299
I GDV+ HY K + K+AI +A +V TA+ ++EEF +AG TA V G+T ER+ ++ +
Sbjct: 199 IYGDVIAHYNKLSKGKQAICYAHNVATAQHISEEFNQAGITAEVVHGKTPKTEREAIMNK 258
Query: 300 FEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
F G I +LINV+LF EG D+P V I+ RPTQSL+ FLQ R L GK AI+ID
Sbjct: 259 FRAGEILVLINVELFTEGVDLPDVTTCIMLRPTQSLSLFLQFAMRPLNPKPGKT-AILID 317
Query: 360 PVRNWERHGPPTAVRTWTLKGREKGQS----SGSDTVPQKTCDACTIPYEAYN-KACPYC 414
V N+ RHG P R WTL G K +S G T+ K C+ C +++ N + CPYC
Sbjct: 318 HVGNYTRHGLPNEDREWTLSGISKKRSEYNTKGELTI--KQCEMCFGCFDSSNARTCPYC 375
Query: 415 GHVNVYPDRSAPDKVDGDLVEL 436
GH +R + + +L E+
Sbjct: 376 GHEPELTERELENIKEIELQEI 397
>gb|AAL74161.1|AF442520_1 (AF442520) putative helicase [Streptococcus thermophilus
bacteriophage k3]
Length = 443
Score = 183 bits (464), Expect = 5e-045
Identities = 131/423 (30%), Positives = 207/423 (47%), Gaps = 54/423 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
+LR+YQ N I + + + ++ P +GKTV+ + I ++ HR+E
Sbjct: 2 ELRNYQNDLVNGIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTVLFFSHRKE 61
Query: 59 IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
I Q+ + GV D N ++++ ++T +
Sbjct: 62 INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKVAR 90
Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
D + + ++ + DE HH V+ + K + N+ +L T TP R D
Sbjct: 91 NLDR-----ISEPSIILIDEAHH-VKAKTYLKIIEYYINSIVLMFTGTPARLDGSGFDD- 143
Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
A ++ G + WL E G ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKSVKWLQENGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
I GDV++HY K + K+AIV+ V+ +E ++ F+K GYT+ A+SG+T R+R ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFKKNGYTSIAISGKTPPEVRERAMQ 257
Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F + I++N +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 TFRDRELTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
D V N +RHG P A R W+LKG K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGINKTKKKLKLGEPTT--RTCDECYATFWSAERICPLCG 374
Query: 416 HVN 418
H N
Sbjct: 375 HEN 377
>ref|NP_471919.1| (NC_003212) similar to DEAH-family helicase [Listeria innocua]
emb|CAC97816.1| (AL596173) similar to DEAH-family helicase [Listeria innocua]
Length = 418
Score = 177 bits (450), Expect = 2e-043
Identities = 121/388 (31%), Positives = 188/388 (48%), Gaps = 47/388 (12%)
Query: 32 GAGKTVIFSKIIH---DHVGAAAAIVHRREIVGQISLSLASFGVKHXXXXXXXXXXXXXX 88
GAGK+VI S+II + +VHR+E++ QI +L V
Sbjct: 9 GAGKSVILSEIIRMTTRNKNNVLFLVHRKELIDQIRNTLTMNDVD--------------- 53
Query: 89 XXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHHYVEQGIW 148
+++ VQT+ + + + L + DE HH + +
Sbjct: 54 -------MKFVNLGM------VQTVVRR--------LEKTSEPALIIIDESHHVLANS-Y 91
Query: 149 AKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRA 208
K ++ NAK++ TATP R + +IE WLIE FL+ + Y A
Sbjct: 92 KKIINHFSNAKVVGFTATPVRINGGGLGDIN----DMLIEKVNVKWLIENQFLAPYKYFA 147
Query: 209 PQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVDTAE 268
P+ + L + + G+F+ L + + I GDV++HY+K ++AI++A+ + +E
Sbjct: 148 PEI-VQTETLDIKRTGEFDMTGLDDQFNKRMIWGDVIKHYQKLANGEQAILYASSLYQSE 206
Query: 269 QMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECVIL 328
+MA F G T+A + G+T RD +++ F +G I++L N+DL EGFDVP VI+
Sbjct: 207 KMAMSFESVGITSAHIDGKTPKSIRDDIIQRFREGEIKVLCNLDLIGEGFDVPDCSTVIM 266
Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSG 388
RPTQSL+ ++Q R +R GK AIIID V N R G P R W+L+ + KG +S
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYRTGKT-AIIIDHVGNVNRFGLPDMERKWSLEAK-KGSNSN 324
Query: 389 SDTVPQKTCDACTIPYEAYNKACPYCGH 416
P K C C + + N C +CGH
Sbjct: 325 KAEAPVKICPDCFMTVLSTNIKCSHCGH 352
>sp|P24125|V51K_BPL79 51.5 kDa protein
pir||JU0445 51K protein GP51C - Lactococcus phage mi7-9
gb|AAA32363.1| (M36388) conserved bacteriophage encoded protein [Lactococcus
bacteriophage (ISOLATE 7-9)]
Length = 452
Score = 174 bits (442), Expect = 2e-042
Identities = 129/394 (32%), Positives = 188/394 (46%), Gaps = 55/394 (13%)
Query: 30 PTGAGKTVIFS---KIIHDHVGAAAAIVHRREIVGQISLSLASFGVKHXXXXXXXXXXXX 86
P G+GK+V+ S K+ + G +VHR+E++ QI+ S GV
Sbjct: 5 PPGSGKSVVISEITKLATEKGGRVLFLVHRKELIDQITNSFKVHGVP------------- 51
Query: 87 XXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHHYVEQG 146
+ V+ LT N +++R + TL + DEGHH +
Sbjct: 52 --------------------LNQVELLTVGKAKNRLSVLR---KPTLIITDEGHHG-KAA 87
Query: 147 IWAKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVY 206
+ K + L TATP R MIEG T WLI L+ + Y
Sbjct: 88 TYQKIYEFYADVPRLGFTATPWRMSGDGFKDTYDY----MIEGKTVEWLINNKRLAPYQY 143
Query: 207 RAPQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVDT 266
+ + ID+ L + KNGD++ +++ + ++ I GDVVQ Y K ++AI++A V+
Sbjct: 144 YSLPS-IDISKLRI-KNGDYSNQSIDDALGKT-IFGDVVQEYVKHANGQKAILYAHSVEA 200
Query: 267 AEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECV 326
++ A+EF++AG A V +T ERD+L+ +F G I++L NVDL EGFDVP
Sbjct: 201 SQSFAKEFQEAGIKAVHVDAKTPKNERDKLMLDFRNGRIKVLCNVDLISEGFDVPDCTVT 260
Query: 327 ILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWT--LKG--RE 382
IL RPT+SL FLQ R++R K AIIID V NW HG P W +G ++
Sbjct: 261 ILCRPTKSLVLFLQQSMRSMRYQHRKT-AIIIDNVMNWHAHGLPDTHHDWKDYFEGGWKK 319
Query: 383 KGQSSGSDTVPQKTCDACTIPYEAYNKACPYCGH 416
KGQ + V K C C+ + K C CGH
Sbjct: 320 KGQ---KNIVQAKQCPDCSAMWPLSQKMCNLCGH 350
>ref|NP_490263.1| (NC_003276) putative helicase [Nostoc sp. PCC 7120]
dbj|BAB78241.1| (AP003600) ORF_ID:alr7157~putative helicase [Nostoc sp. PCC 7120]
Length = 629
Score = 155 bits (393), Expect = 8e-037
Identities = 121/421 (28%), Positives = 182/421 (42%), Gaps = 56/421 (13%)
Query: 1 MFKLRDYQQLAYNRIMAAWQE-YRSVLAVLPTGAGKTVIFSKIIHDHVG---AAAAIVHR 56
MF LRDYQQ ++ AAW R VL L TG GKTVIF++I + HR
Sbjct: 1 MFNLRDYQQDLISKTFAAWSSGIRKVLLQLSTGGGKTVIFAEIASKMTAQGEGVLVVAHR 60
Query: 57 REIVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSK 116
E++ Q + L + Y ++ + S+QTL +
Sbjct: 61 EELILQAAEKLTA----------------VTKLQPGIIKAGYKSTDSPLQIASIQTLARR 104
Query: 117 STMNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXX 176
T A ++ + DE HH + K + +A +L +TATP R D
Sbjct: 105 QTYPSAQLV---------IIDEAHH-SSANSYRKLLDAYPHALVLGLTATPRRED----G 150
Query: 177 XXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVV 236
+I +T LI G+L+ + A V + DF K L+ V
Sbjct: 151 YGLRDIFDHLICSVSTKELITLGYLTDYKLIA---GFKYSRHKVPQKRDFTRKELE-EVA 206
Query: 237 ESHIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRL 296
+ +V++ ++ + K+ ++FA +V ++Q+A F G T + GET ER +
Sbjct: 207 SDYKPSEVLKQWQNFCAGKKTVIFAVNVMHSKQIAAAFCADGITCEHLDGETPHNERQAI 266
Query: 297 LEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAI 356
L F G Q++ N + EGFD P ++ARPT S+ +LQM+GR LR GK+YA
Sbjct: 267 LNRFRSGQTQVISNCAILTEGFDCPDSSAAVIARPTSSVTLWLQMIGRVLRPAPGKDYAT 326
Query: 357 IIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTCDACTIPYEAYNKACPYCGH 416
I+D NW R G P R W+L+ +CD P ++ CP+C H
Sbjct: 327 ILDMTDNWFRLGRPCDNRKWSLE--------------PVSCD----PDTQGSRCCPHCHH 368
Query: 417 V 417
V
Sbjct: 369 V 369
>ref|NP_459877.1| (NC_003197) Fels-1 prophage; putative DNA or RNA helicases of
superfamily II [Salmonella typhimurium LT2]
gb|AAL19836.1| (AE008738) Fels-1 prophage; putative DNA or RNA helicases of
superfamily II [Salmonella typhimurium LT2]
Length = 527
Score = 114 bits (286), Expect = 2e-024
Identities = 100/413 (24%), Positives = 172/413 (41%), Gaps = 46/413 (11%)
Query: 19 WQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAI---VHRREIVGQISLSLASFGVKHX 75
W+ + + + PTG+GKT + + I+ V + V + ++ Q + +G+
Sbjct: 6 WKSHNTFMVYAPTGSGKTGLAAFIVDGFVSRGMRVMFCVPYQILITQTASRFIEYGLS-- 63
Query: 76 XXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMR-WVKQVTLS 134
G + + +Q ++ D I R + + L
Sbjct: 64 ---------------GDEIGYVWASHPNHNPALKIQIASA-----DTLIRREFPDNIDLL 103
Query: 135 VYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHW 194
+ DE H ++ + K++ ++ TP + +I+ T
Sbjct: 104 IIDEAHLRKKRILQDIERLRASGVKVIGLSGTP-------FSPFLGKYYDRLIKPTTIGE 156
Query: 195 LIEQGFLSKFVYRAPQTDIDLRDLAVGKN----GDFNAKALKARVVESHIVGDVVQHYRK 250
LI++G LSK+ + AP T DL+ + + D+N L + S +VGD+VQ++ +
Sbjct: 157 LIQRGDLSKYEFYAP-TKPDLKGVKTKSSLEYGSDYNEAQLAEIMCGSTLVGDIVQNWLE 215
Query: 251 WGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILIN 310
G + I F +V A + +F AG A ++ +T ER ++ FE G +I+++
Sbjct: 216 NGRDLPTIAFCVNVAHANYLTIQFNLAGINAEVMTADTPVDERQTIIHRFETGATKIIVS 275
Query: 311 VDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPP 370
V + GFD V C+I ARPT+S ++LQ +GR LR GKE +I D R G P
Sbjct: 276 VGVLVAGFD-SDVRCIIYARPTKSEIRWLQAIGRGLRTAPGKESCLIFDHSGTVHRLGYP 334
Query: 371 TAVRTWTLKGREKGQSSGSDTVPQ-------KTCDACTIPYEAYNKACPYCGH 416
++ L G+ G + Q + C C A CP CGH
Sbjct: 335 DSIEYDDLPGKSDGMEESARRSSQERAEKLPRECSQCHYMKPAGVYVCPKCGH 387
>ref|NP_231273.1| (NC_002505) helicase-related protein [Vibrio cholerae]
pir||B82176 helicase-related protein VC1636 [imported] - Vibrio cholerae (group
O1 strain N16961)
gb|AAF94787.1| (AE004241) helicase-related protein [Vibrio cholerae]
Length = 657
Score = 110 bits (274), Expect = 5e-023
Identities = 111/441 (25%), Positives = 186/441 (42%), Gaps = 74/441 (16%)
Query: 1 MFKLRDYQQLAYNRIMAAWQEYRS-VLAVLPTGAGKTVIFSKIIHDHVGAAAAIVHRREI 59
M+ LR YQ + ++ ++++ + + VLPTGAGK+++ +++ G + H +E+
Sbjct: 64 MYTLRPYQADSVKAVVHYFRQHSTPAVIVLPTGAGKSLVIAELARLARGRVLVLAHVKEL 123
Query: 60 VGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTM 119
V Q +G+ G+ DQ Q SVQ++
Sbjct: 124 VEQNHAKYEGYGL-------------TGAIFSAGLGRKETDQ--QVVFASVQSVVRN--- 165
Query: 120 NDATIMRWVKQVTLSVYDEGHHYVEQ--GIWAKAV-HLME---NAKLLFVTATPERADXX 173
+ + Q +L V DE H + + K + HL+E K+L +TATP R
Sbjct: 166 ----LSEFQNQFSLLVIDECHRVPDDKNSSYQKVITHLLELNPGMKVLGLTATPYRLGMG 221
Query: 174 XXXXXXXX----------FAQTMIEGPTTHWLIEQGFLSKF-VYRAPQTDIDLRDLAVGK 222
F + E P H+L+++GFL+ + P D L
Sbjct: 222 WIYQYHTRGLVRSEEPRFFRDCIFELPI-HYLLDEGFLTPAQLIDTPVMSYDFSQLKPAN 280
Query: 223 NGDFNAKALKARVVESH-----IVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKA 277
G + L + +S IV +++ + + ++FA V AE++
Sbjct: 281 TGRYRESELDLVIEQSQRATPQIVAQIIELAKA---RQGVMIFAATVRHAEEIYRLLPTE 337
Query: 278 GYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAK 337
AA V G+T + ERDR+++ F+Q I+ L+NV + GFD P V+ + + RPT+S++
Sbjct: 338 H--AALVIGDTPTPERDRIIQAFKQQQIKFLVNVSVLTTGFDAPHVDLIAILRPTESVSL 395
Query: 338 FLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTC 397
+ Q+VGR LR+ GK +++D N + L E G D P
Sbjct: 396 YQQIVGRGLRLAAGKSDCLVLDYAGN-----------QYDLYQPEVG-----DPKPDSDS 439
Query: 398 DACTIPYEAYNKACPYCGHVN 418
+ TIP CP CG N
Sbjct: 440 EIITIP-------CPACGFNN 453
>ref|NP_142208.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
pir||H71243 probable helicase protein PH0210 - Pyrococcus horikoshii
dbj|BAA29279.1| (AP000001) 444aa long hypothetical protein [Pyrococcus horikoshii]
Length = 444
Score = 77.0 bits (188), Expect = 5e-013
Identities = 91/359 (25%), Positives = 154/359 (42%), Gaps = 37/359 (10%)
Query: 2 FKLRDYQQLAYNRIMAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIVHRREIVG 61
+KLR YQ+ A + A +E VLA LP G+GKTV+ +IIH+ +A IVH +E++
Sbjct: 93 YKLRKYQKKA---VKLAIKEKMGVLA-LPVGSGKTVVGLRIIHEINKSALVIVHTKELLY 148
Query: 62 QISLSLASF-GVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTMN 120
Q + + GV+ V +QTL S+ T
Sbjct: 149 QWANKVREILGVEPGIIGDNKWS------------------EGPITVAMIQTLLSRGTD- 189
Query: 121 DATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXX 180
+ + + ++DE H + K + ++ATP R
Sbjct: 190 -----KLQNKYAIVMFDECHRTSAAEKFYKVGISLPQVYRFGLSATPWRRLRGEEMKIEG 244
Query: 181 XFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESHI 240
+ E LI++GFL+K + + D + LA K + +A+ +
Sbjct: 245 VVGPIIYEVKAED-LIKEGFLAKPKFEVIEYDSKMPALA-DKYKELYEEAVMENEERNRA 302
Query: 241 VGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEF 300
+ + K G R ++ +D E + + R G A +S ++S R +LE++
Sbjct: 303 IVEKAIELAKQGH--RVLIDVKRIDHGEILVKMLRDRGINAEFLS--SQSPNRWEILEKY 358
Query: 301 EQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
++G I +L++ L EG D+P + +ILA +S +Q +GRALR G E A+I+D
Sbjct: 359 KKGEIPVLVST-LLKEGVDIPEISAIILAGGGKSDVMTIQTIGRALRPKPGGE-AVIVD 415
>gb|AAF12716.1|AF066865_14 (AF066865) unknown [bacteriophage TPW22]
Length = 205
Score = 65.9 bits (159), Expect = 1e-009
Identities = 46/136 (33%), Positives = 67/136 (48%), Gaps = 8/136 (5%)
Query: 137 DEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLI 196
DE HH ++ + K + NA LF T TP R + A +I G + WL
Sbjct: 78 DEAHH-IKANSYKKILEAFPNALKLFFTGTPIRLNGQGFED----MADDLITGKSIKWLQ 132
Query: 197 EQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKR 256
E G ++ F Y AP ID L +GDF K++ + I GDV+ HY K + K+
Sbjct: 133 EHGNIAPFKYYAPNI-IDTSQLKK-TSGDFTQKSID-EAFKRAIYGDVIAHYNKLSKGKQ 189
Query: 257 AIVFATDVDTAEQMAE 272
AI +A +V TA+ ++E
Sbjct: 190 AICYAHNVATAQHISE 205
>ref|NP_559418.1| (NC_003364) helicase, possible DNA repair rad25 [Pyrobaculum
aerophilum]
gb|AAL63600.1| (AE009830) helicase, possible DNA repair rad25 [Pyrobaculum
aerophilum]
Length = 662
Score = 65.1 bits (157), Expect = 2e-009
Identities = 63/242 (26%), Positives = 104/242 (42%), Gaps = 32/242 (13%)
Query: 133 LSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXX-------XFAQT 185
L ++DE HH + A+ L E+ L ++ATPER D ++
Sbjct: 387 LVIFDEAHHVPAETFKEVALSL-ESPFRLALSATPEREDQNEHLIYEAVGPPVYRASYRS 445
Query: 186 MIEG----PTTHWLIEQGFL--SKFVYRA-PQTD-IDLRDLAVGKNGDFNAKALKARVVE 237
MIE P H+ I + +Y + PQ + I LR+ A AKA K +
Sbjct: 446 MIEAGLVVPVEHYRIYVRMTREEEALYASLPQNNAIVLRNAA--------AKASKKIPIA 497
Query: 238 SHIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLL 297
+V V + +VF +D AE++ + R++G A ++ +E G R+
Sbjct: 498 VRVVVQEV------ALGSKVLVFTQFIDQAEELYKRLRESGIAAELIT--SEEGNREAAF 549
Query: 298 EEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAII 357
F G ++++ + DEG DVP E ++ T S + +Q GR +R GK A +
Sbjct: 550 RRFSNGLSKVVVTTTVLDEGVDVPDAEVAVIVSGTGSRRQMIQRAGRVVRATAGKRAARV 609
Query: 358 ID 359
+
Sbjct: 610 YE 611
>ref|NP_248512.1| (NC_000909) ATP dependent RNA helicase, putative [Methanococcus
jannaschii]
sp|Q58900|YF05_METJA Putative ATP-dependent RNA helicase MJ1505
gb|AAB99518.1| (U67591) ATP dependent RNA helicase, putative [Methanococcus
jannaschii]
Length = 778
Score = 55.1 bits (131), Expect = 2e-006
Identities = 35/124 (28%), Positives = 63/124 (50%), Gaps = 8/124 (6%)
Query: 241 VGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESG-------ER 293
V D+V++ + +++R I+FA DT E++ + G A G+ E+
Sbjct: 339 VVDMVKNILEKNKDERIIIFAQYRDTVEKIVNLLTQNGIKAIRFIGQANKEGKGMSQKEQ 398
Query: 294 DRLLEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKE 353
+E F++ +L++ + +EG D+P+V +I P S +F+Q GRA+R GK
Sbjct: 399 IEAIERFKKEG-SVLVSTSVSEEGIDIPSVNYIIFYEPVPSEIRFIQRRGRAMRGEGGKV 457
Query: 354 YAII 357
Y +I
Sbjct: 458 YVLI 461
>sp|O02494|IF4A_CRYPV Eukaryotic initiation factor 4A (eIF-4A) (eIF4A)
gb|AAB58726.1| (AF001211) translation initiation factor [Cryptosporidium parvum]
gb|AAB58799.1| (AF001378) translation initiation factor [Cryptosporidium parvum]
Length = 405
Score = 50.1 bits (118), Expect = 6e-005
Identities = 29/117 (24%), Positives = 59/117 (49%), Gaps = 5/117 (4%)
Query: 256 RAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFD 315
+AI++ +Q+ ++ R+ +T +++ G+ + +R+ ++ +F G+ ++LI DL
Sbjct: 272 QAIIYCNTRRRVDQLTKQMRERDFTCSSMHGDMDQKDREVIMRQFRSGSSRVLITTDLLA 331
Query: 316 EGFDVPAVECVILARPTQSLAKFLQMVGRALRI-MEGKEYAIIIDP----VRNWERH 367
G DV V VI S ++ +GR+ R +G + D +R+ ERH
Sbjct: 332 RGIDVQQVSLVINYDLPVSPETYIHRIGRSGRFGKKGVSINFVTDDDIVCLRDIERH 388
>ref|NP_421775.1| (NC_002696) excinuclease ABC, subunit B [Caulobacter crescentus]
gb|AAK24943.1| (AE005961) excinuclease ABC, subunit B [Caulobacter crescentus]
Length = 749
Score = 47.8 bits (112), Expect = 3e-004
Identities = 59/275 (21%), Positives = 107/275 (38%), Gaps = 29/275 (10%)
Query: 241 VGDVVQHYRKW-GENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEE 299
V DVV R+ + R +V AE + E + G + + ++ ER ++ +
Sbjct: 478 VDDVVDEIRQTIQKGYRTLVTVLTKKMAEDLTEYLTEQGIRVRYMHSDVDTIERIEIIRD 537
Query: 300 FEQGTIQILINVDLFDEGFDVPAVECVILA-------RPTQSLAKFLQMVGRALRIMEGK 352
G +L+ ++L EG D+P EC ++A +S +Q +GRA R ++GK
Sbjct: 538 LRLGHFDVLVGINLLREGLDIP--ECGLVAILDADKEGFLRSETSLIQTIGRAARNVDGK 595
Query: 353 EYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTCDACTIPYEAYNKAC- 411
+ ER TA R ++V + D PYE ++
Sbjct: 596 VILYADRVTGSMERAMAETARRREKQHAYNLEHGITPESVKRDIKDILNSPYERGDRVLV 655
Query: 412 ----------PYCGHVNVYPDRSAPDKVDGDLVELDLDAWNALFAEIERAKMSDDDYELD 461
P+ G + K+ L+ + L EI+R K+ D ++ +
Sbjct: 656 PMGMSETDDRPFSGDNFKAALKDLEAKMREAAANLEFETAARLRDEIKRMKLMDLEFANE 715
Query: 462 M-------IRRGVPQIGRGQELRRHRAAKYRREVL 489
+ + + +P+ R ELR +A +R+ L
Sbjct: 716 VLTAPGEAVDKAMPKRVRA-ELRAEQAEAFRKSRL 749
>ref|NP_375993.1| (NC_003106) 621aa long conserved hypothetical protein [Sulfolobus
tokodaii]
dbj|BAB65102.1| (AP000981) 621aa long conserved hypothetical protein [Sulfolobus
tokodaii]
Length = 621
Score = 43.5 bits (101), Expect = 0.006
Identities = 29/107 (27%), Positives = 50/107 (46%), Gaps = 14/107 (13%)
Query: 255 KRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLF 314
K A+VF ++AE +AE RK A RD+++E F +G +++L++
Sbjct: 215 KNALVFVRSRNSAETLAETLRKFSIPALPYHSGLPYDVRDKVIESFMKGEVRVLVSTTAL 274
Query: 315 DEGFDVPAVECVI--LARPTQ------------SLAKFLQMVGRALR 347
+G ++P V ++ P S+A+F Q+ GRA R
Sbjct: 275 GQGVNLPVYATVFYDISLPDSDDKGEFKGWRDLSVAEFKQIAGRAGR 321
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.321 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 278,304,758
Number of Sequences: 884779
Number of extensions: 10979687
Number of successful extensions: 26850
Number of sequences better than 5.0e-02: 632
Number of HSP's better than 0.1 without gapping: 488
Number of HSP's successfully gapped in prelim test: 144
Number of HSP's that attempted gapping in prelim test: 25932
Number of HSP's gapped (non-prelim): 966
length of query: 550
length of database: 277,083,050
effective HSP length: 125
effective length of query: 425
effective length of database: 166,485,675
effective search space: 70756411875
effective search space used: 70756411875
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 93 (40.4 bits)
Query= orf47 [13233-13402]
(168 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_416862.1| (NC_000913) orf, hypothetical protein [Esc... 96 1e-019
sp|P76514|YFDR_ECOLI Hypothetical protein yfdR 96 1e-019
ref|NP_420264.1| (NC_002696) conserved hypothetical protein... 58 4e-008
ref|NP_531749.1| (NC_003304) conserved hypothetical protein... 54 6e-007
ref|NP_354075.1| (NC_003062) AGR_C_1942p [Agrobacterium tum... 54 6e-007
dbj|BAA34643.1| (AB020211) unnamed protein product [Roseoba... 54 8e-007
ref|NP_108031.1| (NC_002678) hypothetical protein [Mesorhiz... 53 1e-006
ref|NP_385209.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN... 52 3e-006
>ref|NP_416862.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
pir||F65009 hypothetical protein b2361 - Escherichia coli (strain K-12)
gb|AAC75420.1| (AE000324) orf, hypothetical protein [Escherichia coli K12]
Length = 187
Score = 95.9 bits (237), Expect = 1e-019
Identities = 53/116 (45%), Positives = 73/116 (62%), Gaps = 2/116 (1%)
Query: 19 DKFDMPLATIAYHLSHINRFNGAVGQ-YSVAQHCVQVAALLPQDLKLAGLLHDATEAVLC 77
+K D+ + IA LS+I RF G + YSVAQH V + L+PQ+ L+HDATEA
Sbjct: 27 NKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQ 86
Query: 78 DVPAPLKRMLPDYQRIEERLQDAVDRRFNV-KTRHKRVREADLSMLAAEARDFNLD 132
D+PAPLKR+LPDY+R+EE++ + ++ + V+ ADL MLA E RD LD
Sbjct: 87 DIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLD 142
>sp|P76514|YFDR_ECOLI Hypothetical protein yfdR
Length = 178
Score = 95.9 bits (237), Expect = 1e-019
Identities = 53/116 (45%), Positives = 73/116 (62%), Gaps = 2/116 (1%)
Query: 19 DKFDMPLATIAYHLSHINRFNGAVGQ-YSVAQHCVQVAALLPQDLKLAGLLHDATEAVLC 77
+K D+ + IA LS+I RF G + YSVAQH V + L+PQ+ L+HDATEA
Sbjct: 18 NKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQ 77
Query: 78 DVPAPLKRMLPDYQRIEERLQDAVDRRFNV-KTRHKRVREADLSMLAAEARDFNLD 132
D+PAPLKR+LPDY+R+EE++ + ++ + V+ ADL MLA E RD LD
Sbjct: 78 DIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLD 133
>ref|NP_420264.1| (NC_002696) conserved hypothetical protein [Caulobacter crescentus]
gb|AAK23432.1| (AE005819) conserved hypothetical protein [Caulobacter crescentus]
Length = 250
Score = 57.8 bits (138), Expect = 4e-008
Identities = 44/121 (36%), Positives = 69/121 (56%), Gaps = 16/121 (13%)
Query: 22 DMPLATIAYHLSHINRFNG-AVGQ--YSVAQHCV---QVAALLPQDL----KLAGLLHDA 71
D+ + IA+ L+ + R+NG VG +SVAQH + ++AA + DL +LA LLHDA
Sbjct: 68 DIEIEDIAHGLARVARWNGQTVGDHGFSVAQHSLVVEEIAAHIKPDLEPRWRLAALLHDA 127
Query: 72 TEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNVKTR-----HKRVREADLSMLAAE 125
+E V+ D+ +P K L Y+ E RL+DA+ RF + + K +++AD + E
Sbjct: 128 SEYVIGDMISPFKAALGVSYKDFETRLEDAIHIRFGLPVKTPTPIKKLIKQADRACAFFE 187
Query: 126 A 126
A
Sbjct: 188 A 188
>ref|NP_531749.1| (NC_003304) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAL42065.1| (AE009069) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 207
Score = 53.9 bits (128), Expect = 6e-007
Identities = 39/124 (31%), Positives = 68/124 (54%), Gaps = 18/124 (14%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALL--------PQDLKLAGLLHD 70
D+ +A IA+ L+ + R+NG ++VAQHC+ V + P ++++A LLHD
Sbjct: 28 DVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQMA-LLHD 86
Query: 71 ATEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV-----KTRHKRVREADLSMLAA 124
A E V+ D+ +P K ++ Y+ +E+RL+ AV RF + + R+++AD
Sbjct: 87 APEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADTVAAFF 146
Query: 125 EARD 128
EA +
Sbjct: 147 EATE 150
>ref|NP_354075.1| (NC_003062) AGR_C_1942p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
gb|AAK86860.1| (AE008035) AGR_C_1942p [Agrobacterium tumefaciens str. C58
(Cereon)]
Length = 226
Score = 53.9 bits (128), Expect = 6e-007
Identities = 39/124 (31%), Positives = 68/124 (54%), Gaps = 18/124 (14%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALL--------PQDLKLAGLLHD 70
D+ +A IA+ L+ + R+NG ++VAQHC+ V + P ++++A LLHD
Sbjct: 47 DVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQMA-LLHD 105
Query: 71 ATEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV-----KTRHKRVREADLSMLAA 124
A E V+ D+ +P K ++ Y+ +E+RL+ AV RF + + R+++AD
Sbjct: 106 APEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADTVAAFF 165
Query: 125 EARD 128
EA +
Sbjct: 166 EATE 169
>dbj|BAA34643.1| (AB020211) unnamed protein product [Roseobacter denitrificans]
Length = 195
Score = 53.5 bits (127), Expect = 8e-007
Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 11/97 (11%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALL-------PQDLKLAGLLHDA 71
D+ + IA L+ + R+NG YSVA+H + V L P +LA LLHDA
Sbjct: 23 DVEIEDIARGLAFVARWNGQTSGAHAYSVAEHSLLVETLFTRIADKSPAKWRLAALLHDA 82
Query: 72 TEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV 107
E V+ D+ +P+K + P Y+ ++ERL A+ RF +
Sbjct: 83 PEYVIGDMISPVKAAVGPGYEALDERLTAAIHLRFGL 119
>ref|NP_108031.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
dbj|BAB54176.1| (AP003012) hypothetical protein [Mesorhizobium loti]
Length = 211
Score = 53.1 bits (126), Expect = 1e-006
Identities = 37/97 (38%), Positives = 58/97 (59%), Gaps = 11/97 (11%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAAL----LPQDL---KLAGLLHDA 71
D+ ++ IA+ L+ + R+NG +SVAQH + V AL +PQ +LA LLHDA
Sbjct: 31 DIEISDIAHGLARVARWNGQTSGEHAFSVAQHSLLVEALFNDLVPQASATDRLAALLHDA 90
Query: 72 TEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV 107
E V+ D+ +P K ++ Y+ E RLQ A+ +RF++
Sbjct: 91 PEYVIGDMISPFKSVMGGSYKDCELRLQRAIHQRFSL 127
>ref|NP_385209.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
emb|CAC45682.1| (AL591786) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
Length = 202
Score = 51.6 bits (122), Expect = 3e-006
Identities = 36/112 (32%), Positives = 63/112 (56%), Gaps = 15/112 (13%)
Query: 22 DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALLPQDLK------LAGLLHDAT 72
D+ L+ IA+ L+ + R+NG +SVAQH + V + + + L LLHDA
Sbjct: 25 DVELSDIAHGLARVARWNGQTSGDHAFSVAQHSLVVEEIFRRTNRCDAQDCLMALLHDAP 84
Query: 73 EAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV-----KTRHKRVREAD 118
E V+ D+ +P K ++ Y+ +E+RL++AV RF + + +R+++AD
Sbjct: 85 EYVIGDMISPFKAVVGGGYKTVEKRLENAVHLRFGLPAHTPRELKERIKKAD 136
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.324 0.139 2.98e-071
Gapped
Lambda K H
0.267 0.0410 -1.21e-305
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 75,839,941
Number of Sequences: 884779
Number of extensions: 2768435
Number of successful extensions: 6282
Number of sequences better than 5.0e-02: 11
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 6272
Number of HSP's gapped (non-prelim): 11
length of query: 168
length of database: 277,083,050
effective HSP length: 111
effective length of query: 57
effective length of database: 178,872,581
effective search space: 10195737117
effective search space used: 10195737117
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 86 (37.7 bits)
Query= orf48 [13402-13573]
(170 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.319 0.133 -1.21e-305
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 92,549,307
Number of Sequences: 884779
Number of extensions: 3526080
Number of successful extensions: 9240
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 9240
Number of HSP's gapped (non-prelim): 0
length of query: 170
length of database: 277,083,050
effective HSP length: 111
effective length of query: 59
effective length of database: 178,872,581
effective search space: 10553482279
effective search space used: 10553482279
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 86 (37.7 bits)
Query= orf49 [13588-13700]
(111 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_335121.1| (NC_002755) transcriptional regulator, put... 48 2e-005
ref|NP_215195.1| (NC_000962) hypothetical protein Rv0681 [M... 46 6e-005
emb|CAC95193.1| (AJ416905) hypothetical transcriptional reg... 42 9e-004
ref|NP_266283.1| (NC_002662) transcription regulator [Lacto... 42 0.001
pir||T34657 probable transcription regulator - Streptomyces... 41 0.002
ref|NP_241219.1| (NC_002570) BH0353~unknown conserved prote... 40 0.003
ref|NP_252662.1| (NC_002516) probable transcriptional regul... 39 0.007
ref|NP_149192.1| (NC_001988) HTH transcriptional regulator ... 39 0.009
ref|NP_355336.1| (NC_003062) AGR_C_4325p [Agrobacterium tum... 39 0.012
ref|NP_251575.1| (NC_002516) probable transcriptional regul... 38 0.021
emb|CAC03642.1| (AL391338) putative TetR-family transcripti... 37 0.027
ref|NP_534677.1| (NC_003305) transcriptional regulator, Tet... 37 0.027
ref|NP_511232.1| (NC_003292) tetracycline repressor protein... 37 0.027
pir||RPECR1 tet repressor protein (Tn 1721) - Escherichia c... 37 0.047
>ref|NP_335121.1| (NC_002755) transcriptional regulator, putative [Mycobacterium
tuberculosis CDC1551]
gb|AAK44935.1| (AE006965) transcriptional regulator, putative [Mycobacterium
tuberculosis CDC1551]
Length = 218
Score = 47.8 bits (112), Expect = 2e-005
Identities = 34/101 (33%), Positives = 51/101 (49%), Gaps = 8/101 (7%)
Query: 2 AKKITKRPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
A K RPA R E IV+ L R GW+S+T A+A +G PSL H ++ L+
Sbjct: 19 AVKSVARPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLR 77
Query: 62 RAIVRAAVRANDGLVVAM-AVQAGDWSRDKTDELLYIRGKH 101
RA+ +R D ++ + V AG R + D +L + G +
Sbjct: 78 RAV---RIRVIDDIITMLNRVGAG---RARDDAVLVMAGAY 112
>ref|NP_215195.1| (NC_000962) hypothetical protein Rv0681 [Mycobacterium tuberculosis
H37Rv]
pir||B70827 hypothetical protein Rv0681 - Mycobacterium tuberculosis (strain
H37RV)
emb|CAA17464.1| (AL021943) hypothetical protein Rv0681 [Mycobacterium tuberculosis
H37Rv]
Length = 196
Score = 46.2 bits (108), Expect = 6e-005
Identities = 32/95 (33%), Positives = 49/95 (50%), Gaps = 8/95 (8%)
Query: 8 RPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRA 67
RPA R E IV+ L R GW+S+T A+A +G PSL H ++ L+RA+
Sbjct: 3 RPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLRRAV--- 58
Query: 68 AVRANDGLVVAM-AVQAGDWSRDKTDELLYIRGKH 101
+R D ++ + V AG R + D +L + G +
Sbjct: 59 RIRVIDDIITMLNRVGAG---RARDDAVLVMAGAY 90
>emb|CAC95193.1| (AJ416905) hypothetical transcriptional regulator [Rhizobium
leguminosarum]
Length = 216
Score = 42.4 bits (98), Expect = 9e-004
Identities = 20/64 (31%), Positives = 38/64 (59%)
Query: 2 AKKITKRPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
AK + + ++EQI++A L V +G+ T +AE G++ P+L+Y++RT +
Sbjct: 7 AKTLRRTRIQEEKEEQILEAALDVFSASGFRGSTIDQIAEVAGMSKPNLLYYFRTKEAMH 66
Query: 62 RAIV 65
RA++
Sbjct: 67 RALI 70
>ref|NP_266283.1| (NC_002662) transcription regulator [Lactococcus lactis subsp.
lactis]
gb|AAK04225.1|AE006251_1 (AE006251) transcription regulator [Lactococcus lactis subsp.
lactis]
Length = 191
Score = 41.6 bits (96), Expect = 0.001
Identities = 18/52 (34%), Positives = 36/52 (68%)
Query: 12 AVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRA 63
A+ +++IVQA + +A + G ++++ +AE+ GI APSL H++ M +++ A
Sbjct: 5 ALTRDKIVQATIELAGKIGLSNVSFPRLAEYFGIKAPSLYNHFKNMEEVRVA 56
>pir||T34657 probable transcription regulator - Streptomyces coelicolor
emb|CAA22378.1| (AL034446) putative transcriptional regulator [Streptomyces
coelicolor A3(2)]
Length = 189
Score = 41.2 bits (95), Expect = 0.002
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 9 PAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAA 68
P + +++V A +A TG+ES+T A+A H G+ SL H R + L+ VR A
Sbjct: 2 PRVGLTTDRVVAAAADLADETGFESVTVSALARHFGVKDASLYTHVRNLQDLR---VRVA 58
Query: 69 VRANDGLVVAMA 80
+ A L+ +A
Sbjct: 59 LLAGGELIEEIA 70
>ref|NP_241219.1| (NC_002570) BH0353~unknown conserved protein [Bacillus
halodurans]
dbj|BAB04072.1| (AP001508) BH0353~unknown conserved protein [Bacillus halodurans]
Length = 197
Score = 40.4 bits (93), Expect = 0.003
Identities = 22/73 (30%), Positives = 39/73 (53%)
Query: 13 VRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVRAN 72
V KE++++A G S+T K+VAE G+ ++ YH+R+ QL IV + +
Sbjct: 7 VTKEELIEAAKRCIVEKGIHSLTLKSVAEGAGVTQGTVYYHFRSKDQLMLEIVESMCTSA 66
Query: 73 DGLVVAMAVQAGD 85
+ +M+ Q G+
Sbjct: 67 WHSLESMSKQEGE 79
>ref|NP_252662.1| (NC_002516) probable transcriptional regulator [Pseudomonas
aeruginosa]
pir||C83149 probable transcription regulator PA3973 [imported] - Pseudomonas
aeruginosa (strain PAO1)
gb|AAG07360.1|AE004815_4 (AE004815) probable transcriptional regulator [Pseudomonas
aeruginosa]
Length = 215
Score = 39.3 bits (90), Expect = 0.007
Identities = 27/82 (32%), Positives = 44/82 (52%), Gaps = 3/82 (3%)
Query: 1 MAKKITKR--PAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMT 58
M ++T+R + +E+I+Q GL G+ ++T +A+A+ GIA SL H+R
Sbjct: 1 MVYRVTERRLQRDSALRERILQLGLRRVVEGGFAALTMQALADDAGIATGSLYRHFRGKG 60
Query: 59 QLKRAIV-RAAVRANDGLVVAM 79
+L I RA+ R D L V +
Sbjct: 61 ELAAEIFRRASQREVDALAVVL 82
>ref|NP_149192.1| (NC_001988) HTH transcriptional regulator TetR family
[Clostridium acetobutylicum]
gb|AAK76774.1|AE001438_27 (AE001438) HTH transcriptional regulator TetR family [Clostridium
acetobutylicum]
Length = 187
Score = 38.9 bits (89), Expect = 0.009
Identities = 17/47 (36%), Positives = 29/47 (61%)
Query: 15 KEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
KE+I+Q +A G + +T++ +AE + I +PSL H+ M+ LK
Sbjct: 8 KEKIIQIAFSLADEIGLDKVTFQKLAEKLDIKSPSLYNHFSNMSDLK 54
>ref|NP_355336.1| (NC_003062) AGR_C_4325p [Agrobacterium tumefaciens]
[Agrobacterium tumefaciens str. C58 (Cereon)]
ref|NP_533056.1| (NC_003304) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAK88121.1| (AE008152) AGR_C_4325p [Agrobacterium tumefaciens str. C58
(Cereon)]
gb|AAL43372.1| (AE009186) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 213
Score = 38.5 bits (88), Expect = 0.012
Identities = 16/52 (30%), Positives = 32/52 (60%)
Query: 14 RKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIV 65
++E I++A L V G+ T +AE G++ P+++Y++RT + RA++
Sbjct: 19 KQETILEAALSVFSTNGFRGSTIDQIAEAAGMSKPNVLYYFRTKEAMHRALI 70
>ref|NP_251575.1| (NC_002516) probable transcriptional regulator [Pseudomonas
aeruginosa]
pir||C83286 probable transcriptional regulator PA2885 [imported] -
Pseudomonas aeruginosa (strain PAO1)
gb|AAG06273.1|AE004714_10 (AE004714) probable transcriptional regulator [Pseudomonas
aeruginosa]
Length = 198
Score = 37.7 bits (86), Expect = 0.021
Identities = 18/75 (24%), Positives = 39/75 (52%)
Query: 11 PAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVR 70
P + +++Q + R G+E T + +A VGI + S+ +H+++ ++ R+++ +
Sbjct: 13 PESARGKLLQTAAHLFRSKGYERTTVRDLASAVGIQSGSIFHHFKSKDEILRSVMEETIL 72
Query: 71 ANDGLVVAMAVQAGD 85
N L+ A A D
Sbjct: 73 YNTALMRAALADAED 87
>emb|CAC03642.1| (AL391338) putative TetR-family transcriptional regulator.
[Streptomyces coelicolor A3(2)]
Length = 190
Score = 37.4 bits (85), Expect = 0.027
Identities = 32/104 (30%), Positives = 51/104 (48%), Gaps = 10/104 (9%)
Query: 4 KITKRPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRA 63
K +R A A R +QI+ A + G ++++ + +AE VGI APSL H+ +A
Sbjct: 2 KSGERSAVADRAQQIMSAARELLDAEGPDALSMRRIAERVGIRAPSLYKHF-----ADKA 56
Query: 64 IVRAAVRANDGLVVAMAVQAGDWSRDKTDELLYIRG----KHAL 103
+ A ++A GL + VQ + D L + G +HAL
Sbjct: 57 ALVAGLQA-QGLTLLAEVQEAAEAEIGADPPLLVLGYAYRRHAL 99
>ref|NP_534677.1| (NC_003305) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAL44993.1| (AE009349) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 190
Score = 37.4 bits (85), Expect = 0.027
Identities = 20/72 (27%), Positives = 37/72 (50%), Gaps = 11/72 (15%)
Query: 12 AVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAI------- 64
A R+++I+ A ++A R GW S+T + +A+ + + P L H+ + A+
Sbjct: 12 ADREKRIIAAARMLAERDGWASVTVRRLAQEIEYSQPVLYAHFENRDAIVGAVALEGFGE 71
Query: 65 ----VRAAVRAN 72
+RA+VR N
Sbjct: 72 LGPTLRASVRRN 83
>ref|NP_511232.1| (NC_003292) tetracycline repressor protein [IncN plasmid R46]
sp|P03039|TER3_ECOLI TETRACYCLINE REPRESSOR PROTEIN CLASS C
pir||RPECYS tetracycline repressor - Escherichia coli plasmid pSC101
gb|AAA25677.1| (M36272) tetracycline resistance element repressor protein (tetR)
[Plasmid pSC101]
gb|AAG43219.1| (AY046276) tetracycline repressor protein [IncN plasmid R46]
prf||1101269A repressor tet [Salmonella enterica subsp. enterica serovar
Panama]
Length = 219
Score = 37.4 bits (85), Expect = 0.027
Identities = 15/60 (25%), Positives = 33/60 (55%)
Query: 13 VRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVRAN 72
+++E +++ L + G E +T + +AE +G+ P+L +H++ L A+ A + N
Sbjct: 4 LQREAVIRTALELLNDVGMEGLTTRRLAERLGVQQPALYWHFKNKRALLDALAEAMLTIN 63
>pir||RPECR1 tet repressor protein (Tn 1721) - Escherichia coli plasmid RP1
emb|CAA24908.1| (X00006) tetR regulatory protein [unidentified bacterium]
gb|AAB67685.1| (U75324) TetR [Cloning vector pJB3Tc20]
gb|AAB82322.1| (U82001) TetR [Expression vector pJB866]
Length = 216
Score = 36.6 bits (83), Expect = 0.047
Identities = 19/69 (27%), Positives = 34/69 (48%), Gaps = 2/69 (2%)
Query: 18 IVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVRANDGLVV 77
+++A L + G + +T + +AE +G+ P+L +H+R L A+ A + N V
Sbjct: 9 VIRAALDLLNEVGVDGLTTRKLAERLGVQQPALYWHFRNKRALLDALAEAMLAENHSTSV 68
Query: 78 AMAVQAGDW 86
A DW
Sbjct: 69 PRA--DDDW 75
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.321 0.131 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 53,834,554
Number of Sequences: 884779
Number of extensions: 1672650
Number of successful extensions: 3826
Number of sequences better than 5.0e-02: 27
Number of HSP's better than 0.1 without gapping: 21
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 3805
Number of HSP's gapped (non-prelim): 27
length of query: 111
length of database: 277,083,050
effective HSP length: 87
effective length of query: 24
effective length of database: 200,107,277
effective search space: 4802574648
effective search space used: 4802574648
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf50 [13719-13803]
(83 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.323 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 42,149,018
Number of Sequences: 884779
Number of extensions: 1364954
Number of successful extensions: 2024
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2024
Number of HSP's gapped (non-prelim): 0
length of query: 83
length of database: 277,083,050
effective HSP length: 59
effective length of query: 24
effective length of database: 224,881,089
effective search space: 5397146136
effective search space used: 5397146136
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 84 (37.0 bits)
Query= orf51 [13820-13891]
(70 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 39,140,978
Number of Sequences: 884779
Number of extensions: 1138345
Number of successful extensions: 1923
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1923
Number of HSP's gapped (non-prelim): 0
length of query: 70
length of database: 277,083,050
effective HSP length: 46
effective length of query: 24
effective length of database: 236,383,216
effective search space: 5673197184
effective search space used: 5673197184
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf52 [13890-13943]
(52 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.340 0.156 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,870,899
Number of Sequences: 884779
Number of extensions: 624806
Number of successful extensions: 3386
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3386
Number of HSP's gapped (non-prelim): 0
length of query: 52
length of database: 277,083,050
effective HSP length: 28
effective length of query: 24
effective length of database: 252,309,238
effective search space: 6055421712
effective search space used: 6055421712
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf53 [13938-14059]
(120 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.326 0.139 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 53,994,972
Number of Sequences: 884779
Number of extensions: 1833223
Number of successful extensions: 6142
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6142
Number of HSP's gapped (non-prelim): 0
length of query: 120
length of database: 277,083,050
effective HSP length: 96
effective length of query: 24
effective length of database: 192,144,266
effective search space: 4611462384
effective search space used: 4611462384
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf54 [14058-14205]
(146 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.327 0.141 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 69,897,528
Number of Sequences: 884779
Number of extensions: 2206116
Number of successful extensions: 4882
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4882
Number of HSP's gapped (non-prelim): 0
length of query: 146
length of database: 277,083,050
effective HSP length: 108
effective length of query: 38
effective length of database: 181,526,918
effective search space: 6898022884
effective search space used: 6898022884
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 85 (37.4 bits)
Query= orf55 [14203-14366]
(162 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.323 0.141 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 84,006,742
Number of Sequences: 884779
Number of extensions: 3123567
Number of successful extensions: 4673
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4673
Number of HSP's gapped (non-prelim): 0
length of query: 162
length of database: 277,083,050
effective HSP length: 110
effective length of query: 52
effective length of database: 179,757,360
effective search space: 9347382720
effective search space used: 9347382720
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 86 (37.7 bits)
Query= orf56 [14371-14438]
(66 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.322 0.141 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 37,685,106
Number of Sequences: 884779
Number of extensions: 1219724
Number of successful extensions: 2261
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2261
Number of HSP's gapped (non-prelim): 0
length of query: 66
length of database: 277,083,050
effective HSP length: 42
effective length of query: 24
effective length of database: 239,922,332
effective search space: 5758135968
effective search space used: 5758135968
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf57 [14438-14502]
(63 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.327 0.139 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,620,112
Number of Sequences: 884779
Number of extensions: 438664
Number of successful extensions: 856
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 856
Number of HSP's gapped (non-prelim): 0
length of query: 63
length of database: 277,083,050
effective HSP length: 39
effective length of query: 24
effective length of database: 242,576,669
effective search space: 5821840056
effective search space used: 5821840056
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf58 [14503-14581]
(77 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.321 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 39,661,005
Number of Sequences: 884779
Number of extensions: 1249866
Number of successful extensions: 2792
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2792
Number of HSP's gapped (non-prelim): 0
length of query: 77
length of database: 277,083,050
effective HSP length: 53
effective length of query: 24
effective length of database: 230,189,763
effective search space: 5524554312
effective search space used: 5524554312
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf59 [14600-14648]
(47 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.321 0.140 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,039,454
Number of Sequences: 884779
Number of extensions: 544599
Number of successful extensions: 1239
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1239
Number of HSP's gapped (non-prelim): 0
length of query: 47
length of database: 277,083,050
effective HSP length: 23
effective length of query: 24
effective length of database: 256,733,133
effective search space: 6161595192
effective search space used: 6161595192
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf60 [14647-14740]
(92 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.319 0.130 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 47,667,396
Number of Sequences: 884779
Number of extensions: 1565823
Number of successful extensions: 2931
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2931
Number of HSP's gapped (non-prelim): 0
length of query: 92
length of database: 277,083,050
effective HSP length: 68
effective length of query: 24
effective length of database: 216,918,078
effective search space: 5206033872
effective search space used: 5206033872
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf61 [14739-14855]
(115 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.318 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 53,313,387
Number of Sequences: 884779
Number of extensions: 1819600
Number of successful extensions: 14774
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 14774
Number of HSP's gapped (non-prelim): 0
length of query: 115
length of database: 277,083,050
effective HSP length: 91
effective length of query: 24
effective length of database: 196,568,161
effective search space: 4717635864
effective search space used: 4717635864
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf62 [14853-14991]
(137 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAC01493.1| (AL391017) putative polypeptide deformylase... 80 2e-015
ref|NP_213407.1| (NC_000918) polypeptide deformylase [Aquif... 69 1e-011
ref|NP_438782.1| (NC_000907) polypeptide deformylase (def) ... 68 1e-011
ref|NP_248709.1| (NC_002516) polypeptide deformylase [Pseud... 68 2e-011
sp|Q9XAQ2|DEF_STRCO Polypeptide deformylase (PDF) (Formylme... 67 2e-011
ref|NP_273168.1| (NC_003112) polypeptide deformylase [Neiss... 67 4e-011
ref|NP_297006.1| (NC_002620) polypeptide deformylase [Chlam... 66 5e-011
ref|NP_219861.1| (NC_000117) Polypeptide Deformylase [Chlam... 66 6e-011
emb|CAB94606.1| (AL359214) putative polypeptide deformylase... 63 5e-010
ref|NP_246498.1| (NC_002663) Def [Pasteurella multocida] >g... 63 5e-010
ref|NP_487119.1| (NC_003272) polypeptide deformylase [Nosto... 62 7e-010
ref|NP_354552.1| (NC_003062) AGR_C_2856p [Agrobacterium tum... 59 8e-009
ref|NP_269070.1| (NC_002737) putative polypeptide deformyla... 57 3e-008
ref|NP_563974.1| (NM_101408) expressed protein [Arabidopsis... 56 5e-008
sp|Q9FV53|DEFM_ARATH Polypeptide deformylase, mitochondrial... 56 5e-008
ref|NP_518191.1| (NC_003295) PROBABLE POLYPEPTIDE DEFORMYLA... 55 1e-007
ref|NP_345996.1| (NC_003028) polypeptide deformylase [Strep... 52 7e-007
ref|NP_360311.1| (NC_003103) polypeptide deformylase [EC:3.... 51 2e-006
ref|NP_229461.1| (NC_000853) polypeptide deformylase [Therm... 50 4e-006
ref|NP_561676.1| (NC_003366) polypeptide deformylase [Clost... 47 4e-005
sp|O08450|DEF_CLOBE Polypeptide deformylase (PDF) (Formylme... 44 3e-004
>emb|CAC01493.1| (AL391017) putative polypeptide deformylase [Streptomyces
coelicolor A3(2)]
Length = 217
Score = 80.5 bits (197), Expect = 2e-015
Identities = 49/132 (37%), Positives = 63/132 (47%), Gaps = 6/132 (4%)
Query: 9 PELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFK---- 63
P LHA AEV G ++ LV D NQVG R+ V CP +
Sbjct: 60 PVLHARCAEVTDFGPELAALVEDLFATMYAAHGVGLAANQVGEAVRVFVYDCPDDEDERH 119
Query: 64 -GCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTA 122
G V+NP + V PEGCLS PG +R ++ VV G+ + +P+T+ G A
Sbjct: 120 LGHVVNPRLVETGGVVVRGPEGCLSLPGLEAGTERYDEAVVTGFTVAGEPVTVRGTGFFA 179
Query: 123 FCLQHEIDHLNG 134
CLQHE DHL G
Sbjct: 180 RCLQHECDHLEG 191
>ref|NP_213407.1| (NC_000918) polypeptide deformylase [Aquifex aeolicus]
sp|O66847|DEF_AQUAE Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||C70352 polypeptide deformylase - Aquifex aeolicus
gb|AAC06802.1| (AE000696) polypeptide deformylase [Aquifex aeolicus]
Length = 169
Score = 68.6 bits (166), Expect = 1e-011
Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 7/121 (5%)
Query: 22 EDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLR------CPTFKGCVINPIITRHT 75
++VK+L+ D NQ+GV ++V+ P K +INP I +
Sbjct: 25 KEVKNLIRDMFDTMYEAEGVGLAANQIGVPLSVMVIDTSPKEDAPPLKLVLINPEI-KEG 83
Query: 76 DGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
+G + EGCLS+PG +V +R KV V + +P+ + +G A QHE+DHL G+
Sbjct: 84 EGKIKYKEGCLSFPGLSVEVERFQKVKVNALNEHGEPVELTLEGFPAIVFQHELDHLKGI 143
Query: 136 T 136
T
Sbjct: 144 T 144
>ref|NP_438782.1| (NC_000907) polypeptide deformylase (def) [Haemophilus influenzae
Rd]
sp|P44786|DEF_HAEIN Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||D64082 N-formylmethionylaminoacyl-tRNA deformylase (EC 3.5.1.27) -
Haemophilus influenzae (strain Rd KW20)
gb|AAC22282.1| (U32745) polypeptide deformylase (def) [Haemophilus influenzae Rd]
Length = 169
Score = 68.2 bits (165), Expect = 1e-011
Identities = 40/92 (43%), Positives = 51/92 (54%), Gaps = 4/92 (4%)
Query: 47 QVGVLKRIIVLRCPTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVV 103
QV +L+RII + K +INP I ++G EGCLS PG R+ KV V
Sbjct: 51 QVDILQRIITIDVEGDKQNQFVLINPEILA-SEGETGIEEGCLSIPGFRALVPRKEKVTV 109
Query: 104 EGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
D D + T+ A GL A C+QHEIDHLNG+
Sbjct: 110 RALDRDGKEFTLDADGLLAICIQHEIDHLNGI 141
>ref|NP_248709.1| (NC_002516) polypeptide deformylase [Pseudomonas aeruginosa]
sp|Q9I7A8|DEF_PSEAE Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||H83643 polypeptide deformylase PA0019 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG03409.1|AE004441_10 (AE004441) polypeptide deformylase [Pseudomonas aeruginosa]
Length = 168
Score = 67.8 bits (164), Expect = 2e-011
Identities = 49/138 (35%), Positives = 61/138 (43%), Gaps = 4/138 (2%)
Query: 1 MKILKDDAPELHAIAAEVPHGED-VKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC 59
+ IL+ P L IA V +D V+ L+ D QV V KRI+V+
Sbjct: 4 LNILEFPDPRLRTIAKPVEVVDDAVRQLIDDMFETMYEAPGIGLAATQVNVHKRIVVMDL 63
Query: 60 PTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
K INP T+ EGCLS PG R KV ++ D D P
Sbjct: 64 SEDKSEPRVFINPEFEPLTEDMDQYQEGCLSVPGFYENVDRPQKVRIKALDRDGNPFEEV 123
Query: 117 AKGLTAFCLQHEIDHLNG 134
A+GL A C+QHE DHLNG
Sbjct: 124 AEGLLAVCIQHECDHLNG 141
>sp|Q9XAQ2|DEF_STRCO Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||T34626 probable polypetide deformylase - Streptomyces coelicolor
emb|CAB44533.1| (AL078618) putative polypetide deformylase [Streptomyces
coelicolor]
Length = 208
Score = 67.4 bits (163), Expect = 2e-011
Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 8/122 (6%)
Query: 21 GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKGC-----VINPIITRHT 75
G D+ L+ D NQVGV R+ V CP +G ++NP++
Sbjct: 48 GPDLAALIDDMFRTMYVAEGAGLAANQVGVDLRLFVYDCPDDEGVRHVGHLVNPVLDALD 107
Query: 76 DGH---VYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHL 132
+ EGCLS PG +A R ++ VV G D D P+ + G A CL HE DH+
Sbjct: 108 PAARRLLDEGEGCLSVPGAVMAVPRPDRAVVRGLDKDGVPLLVEGTGYFARCLAHETDHV 167
Query: 133 NG 134
NG
Sbjct: 168 NG 169
>ref|NP_273168.1| (NC_003112) polypeptide deformylase [Neisseria meningitidis MC58]
ref|NP_283013.1| (NC_003116) polypeptide deformylase [Neisseria meningitidis Z2491]
sp|Q9JQN0|DEF_NEIMA Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||E81238 formylmethionine deformylase (EC 3.5.1.31) NMA0164 [imported] -
Neisseria meningitidis (group B strain MD58, group A
strain Z2491)
gb|AAF40569.1| (AE002369) polypeptide deformylase [Neisseria meningitidis MC58]
emb|CAB83478.1| (AL162752) polypeptide deformylase [Neisseria meningitidis Z2491]
Length = 167
Score = 66.6 bits (161), Expect = 4e-011
Identities = 44/139 (31%), Positives = 63/139 (44%), Gaps = 5/139 (3%)
Query: 1 MKILKDDAPELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC 59
+ IL+ LH +A V E ++ L+ D QV V +R++V+
Sbjct: 4 LNILQYPDERLHTVAKPVEQVDERIRKLIADMFETMYESRGIGLAATQVDVHERVVVMDL 63
Query: 60 PTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
+ INP+I DG EGCLS PG R +V VE + + T+
Sbjct: 64 TEDRSEPRVFINPVIVEK-DGETTYEEGCLSVPGIYDTVTRAERVKVEALNEKGEKFTLE 122
Query: 117 AKGLTAFCLQHEIDHLNGV 135
A GL A C+QHE+DHL G+
Sbjct: 123 ADGLLAICVQHELDHLMGI 141
>ref|NP_297006.1| (NC_002620) polypeptide deformylase [Chlamydia muridarum]
sp|Q9PK41|DEF_CHLMU Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||C81680 polypeptide deformylase TC0632 [imported] - Chlamydia muridarum
(strain Nigg)
gb|AAF39461.1| (AE002332) polypeptide deformylase [Chlamydia muridarum]
Length = 181
Score = 66.2 bits (160), Expect = 5e-011
Identities = 42/141 (29%), Positives = 65/141 (45%), Gaps = 12/141 (8%)
Query: 7 DAPELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPT---- 61
D+P L +AA + ++++ LVLD QVG + ++
Sbjct: 9 DSPILRKVAAPIDEITDELRQLVLDMSETMTFYKGVGLAAPQVGHSVALFIMGVEKELDD 68
Query: 62 -------FKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPIT 114
F INP+IT+ ++ VY EGCLS PG R +K+ V ++D QP +
Sbjct: 69 GELIFCDFPKVFINPVITQKSEQLVYGNEGCLSIPGLRGEVARPDKITVTAKNLDGQPFS 128
Query: 115 IAAKGLTAFCLQHEIDHLNGV 135
+ +G A + HE DHL+GV
Sbjct: 129 MTLEGFLARIVMHETDHLHGV 149
>ref|NP_219861.1| (NC_000117) Polypeptide Deformylase [Chlamydia trachomatis]
sp|O84357|DEF_CHLTR Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||B71526 probable polypeptide deformylase - Chlamydia trachomatis (serotype
D, strain UW3/Cx)
gb|AAC67948.1| (AE001308) Polypeptide Deformylase [Chlamydia trachomatis]
Length = 181
Score = 65.9 bits (159), Expect = 6e-011
Identities = 43/141 (30%), Positives = 65/141 (45%), Gaps = 12/141 (8%)
Query: 7 DAPELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPT---- 61
D+P L +AA V ++++ LVLD QVG + ++
Sbjct: 9 DSPILRKVAAPVTEITDELRQLVLDMSETMAFYKGVGLAAPQVGQSISLFIMGVERELED 68
Query: 62 -------FKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPIT 114
F INP+IT+ ++ VY EGCLS PG R +K+ V ++D Q +
Sbjct: 69 GELVFCDFPRVFINPVITQKSEQLVYGNEGCLSIPGLRGEVARPDKITVSAKNLDGQQFS 128
Query: 115 IAAKGLTAFCLQHEIDHLNGV 135
+A +G A + HE DHL+GV
Sbjct: 129 LALEGFLARIVMHETDHLHGV 149
>emb|CAB94606.1| (AL359214) putative polypeptide deformylase [Streptomyces
coelicolor A3(2)]
Length = 216
Score = 62.8 bits (151), Expect = 5e-010
Identities = 46/138 (33%), Positives = 57/138 (40%), Gaps = 15/138 (10%)
Query: 9 PELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKGC-- 65
P LH +V GE+ + LV D NQ+GV K++ V CP +G
Sbjct: 50 PVLHKECEDVTDFGEEFQQLVADMFASQRTAEGVGLAANQIGVSKKVFVYDCPDDEGVRH 109
Query: 66 ---VINPIIT------RHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
V NP + R D S EGCLS P R + V G D P+ +
Sbjct: 110 VGVVCNPRLVELPADRRRLDD---SNEGCLSVPTAYAPLARPDYAEVTGQDEKGNPVKVR 166
Query: 117 AKGLTAFCLQHEIDHLNG 134
G A CLQHE DHL G
Sbjct: 167 GTGYFARCLQHETDHLYG 184
>ref|NP_246498.1| (NC_002663) Def [Pasteurella multocida]
sp|P57948|DEF_PASMU Polypeptide deformylase (PDF) (Formylmethionine deformylase)
gb|AAK03643.1| (AE006193) Def [Pasteurella multocida]
Length = 170
Score = 62.8 bits (151), Expect = 5e-010
Identities = 38/117 (32%), Positives = 59/117 (49%), Gaps = 4/117 (3%)
Query: 22 EDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKG---CVINPIITRHTDGH 78
++++++V + QV + +RII + K +INP I + G
Sbjct: 26 DEIREIVDNMFETMYLEEGIGLAATQVNIHQRIITIDVEGTKENQYVLINPEII-DSCGE 84
Query: 79 VYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
EGCLS PG R+ KV ++ D + T++A+GL A C+QHEIDHLNG+
Sbjct: 85 TGIEEGCLSLPGFRGFVPRKEKVTIKALDRHGEEYTLSAEGLLAICIQHEIDHLNGI 141
>ref|NP_487119.1| (NC_003272) polypeptide deformylase [Nostoc sp. PCC 7120]
dbj|BAB74778.1| (AP003591) polypeptide deformylase [Nostoc sp. PCC 7120]
Length = 187
Score = 62.4 bits (150), Expect = 7e-010
Identities = 39/119 (32%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
Query: 22 EDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKGC-----VINPIITRHTD 76
++ + L+ D QVG+ K++IV+ C +INP I + +
Sbjct: 39 DETRQLIRDMLQTMYSSDGIGLAAPQVGINKQLIVIDCEPDNPANPPLILINPTIKQVSR 98
Query: 77 GHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
+ EGCLS PG + KR V V D + +P T+ A L C+QHE+DHLNGV
Sbjct: 99 EICSAQEGCLSIPGVYMDVKRPEVVEVAYKDENGRPQTLKATDLLGRCIQHEMDHLNGV 157
>ref|NP_354552.1| (NC_003062) AGR_C_2856p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
ref|NP_532239.1| (NC_003304) polypeptide deformylase [Agrobacterium tumefaciens str.
C58 (U. Washington)]
gb|AAK87337.1| (AE008078) AGR_C_2856p [Agrobacterium tumefaciens str. C58
(Cereon)]
gb|AAL42555.1| (AE009113) polypeptide deformylase [Agrobacterium tumefaciens str.
C58 (U. Washington)]
Length = 164
Score = 58.9 bits (141), Expect = 8e-009
Identities = 43/127 (33%), Positives = 60/127 (46%), Gaps = 2/127 (1%)
Query: 11 LHAIAAEVPHGED-VKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC-PTFKGCVIN 68
L I A V +D +++LV D +GVL+R+ VL P IN
Sbjct: 14 LSGICAPVTVFDDHLRELVTDLIDTMRAAPGVGITAAHIGVLQRVFVLELTPGTILTYIN 73
Query: 69 PIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHE 128
P IT H+ + EG +S PG T +R + V V D+ T A+G A C+QHE
Sbjct: 74 PEITSHSPQTMRHVEGSVSMPGFTDEVERPSTVEVRFQDITGAEQTETAEGFHAICIQHE 133
Query: 129 IDHLNGV 135
ID L+G+
Sbjct: 134 IDQLDGI 140
>ref|NP_269070.1| (NC_002737) putative polypeptide deformylase [Streptococcus
pyogenes] [Streptococcus pyogenes M1 GAS]
gb|AAK33791.1| (AE006536) putative polypeptide deformylase [Streptococcus pyogenes
M1 GAS]
Length = 136
Score = 57.0 bits (136), Expect = 3e-008
Identities = 32/92 (34%), Positives = 49/92 (52%), Gaps = 2/92 (2%)
Query: 46 NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEG 105
N +G KRI+++ + NP++ G + E CLS G + R +K+ VE
Sbjct: 47 NMIGEQKRIVIVSMGFIDLVMFNPVMVSKK-GIYQTKESCLSLSGYRKTQ-RYDKITVEY 104
Query: 106 YDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
D +W+P ++ GLTA QHE+DHL G+ I
Sbjct: 105 LDHNWRPKRLSLTGLTAQICQHELDHLEGILI 136
>ref|NP_563974.1| (NM_101408) expressed protein [Arabidopsis thaliana]
gb|AAG33973.1|AF250959_1 (AF250959) peptide deformylase-like protein [Arabidopsis thaliana]
Length = 269
Score = 56.2 bits (134), Expect = 5e-008
Identities = 47/156 (30%), Positives = 66/156 (42%), Gaps = 23/156 (14%)
Query: 2 KILKDDAPELHAIAAEVPHGE----DVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVL 57
+I+ P LH A EV GE ++ ++ D Q+GV RIIVL
Sbjct: 82 EIVASGDPVLHEKAREVDPGEIGSERIQKIIDDMIKVMRLAPGVGLAAPQIGVPLRIIVL 141
Query: 58 R--------CPTFK-----------GCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRR 98
P + ++NP++ ++ EGCLS G A +R
Sbjct: 142 EDTKEYISYAPKEEILAQERRHFDLMVMVNPVLKERSNKKALFFEGCLSVDGFRAAVERY 201
Query: 99 NKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNG 134
+VVV GYD + I + A G A LQHE DHL+G
Sbjct: 202 LEVVVTGYDRQGKRIEVNASGWQARILQHECDHLDG 237
>sp|Q9FV53|DEFM_ARATH Polypeptide deformylase, mitochondrial precursor (PDF)
(Formylmethionine deformylase)
gb|AAD39667.1|AC007591_32 (AC007591) Simalar to gi|4377403 Polypeptide Deformylase from
Chlamydia pneumoniae genome gb|AE001687. [Arabidopsis
thaliana]
gb|AAK32873.1|AF361861_1 (AF361861) At1g15390/F9L1_34 [Arabidopsis thaliana]
Length = 259
Score = 56.2 bits (134), Expect = 5e-008
Identities = 47/156 (30%), Positives = 66/156 (42%), Gaps = 23/156 (14%)
Query: 2 KILKDDAPELHAIAAEVPHGE----DVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVL 57
+I+ P LH A EV GE ++ ++ D Q+GV RIIVL
Sbjct: 72 EIVASGDPVLHEKAREVDPGEIGSERIQKIIDDMIKVMRLAPGVGLAAPQIGVPLRIIVL 131
Query: 58 R--------CPTFK-----------GCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRR 98
P + ++NP++ ++ EGCLS G A +R
Sbjct: 132 EDTKEYISYAPKEEILAQERRHFDLMVMVNPVLKERSNKKALFFEGCLSVDGFRAAVERY 191
Query: 99 NKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNG 134
+VVV GYD + I + A G A LQHE DHL+G
Sbjct: 192 LEVVVTGYDRQGKRIEVNASGWQARILQHECDHLDG 227
>ref|NP_518191.1| (NC_003295) PROBABLE POLYPEPTIDE DEFORMYLASE PROTEIN [Ralstonia
solanacearum]
emb|CAD13598.1| (AL646057) PROBABLE POLYPEPTIDE DEFORMYLASE PROTEIN [Ralstonia
solanacearum]
Length = 169
Score = 55.1 bits (131), Expect = 1e-007
Identities = 40/138 (28%), Positives = 60/138 (42%), Gaps = 4/138 (2%)
Query: 1 MKILKDDAPELHAIAAEVPHGED-VKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC 59
+ IL+ P LH +A V +D ++ LV D QV V +R+I +
Sbjct: 4 LNILQYPDPRLHKVAKPVAVVDDRIRKLVADMAETMYDAPGIGLAATQVDVHERVITIDV 63
Query: 60 PTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
+ INP I ++ EGCLS P +R ++V V + + +
Sbjct: 64 SESRDELRVFINPEIVWASEARKVWDEGCLSVPDIYDKVERPDRVRVRALNEKGESFELE 123
Query: 117 AKGLTAFCLQHEIDHLNG 134
GL A C+QHE+DHL G
Sbjct: 124 TDGLLAVCIQHEMDHLMG 141
>ref|NP_345996.1| (NC_003028) polypeptide deformylase [Streptococcus pneumoniae
TIGR4]
ref|NP_359001.1| (NC_003098) Formylmethionine deformylase [Streptococcus pneumoniae
R6]
gb|AAK75636.1| (AE007449) polypeptide deformylase [Streptococcus pneumoniae TIGR4]
gb|AAL00212.1| (AE008510) Formylmethionine deformylase [Streptococcus pneumoniae
R6]
gb|AAK13237.1| (AY014508) peptide deformylase-like protein [Streptococcus
pneumoniae]
Length = 136
Score = 52.4 bits (124), Expect = 7e-007
Identities = 31/89 (34%), Positives = 43/89 (47%), Gaps = 2/89 (2%)
Query: 46 NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEG 105
N +GV KR+I+ + NP++ +G + EGCLS G + KR + +
Sbjct: 47 NMIGVQKRVIIFNLGLVPVVMFNPVLLSF-EGSYEAEEGCLSLVGVR-STKRYETIRLAY 104
Query: 106 YDMDWQPITIAAKGLTAFCLQHEIDHLNG 134
D WQ TI G A QHE+DHL G
Sbjct: 105 RDSKWQEQTITLTGFPAQICQHELDHLEG 133
>ref|NP_360311.1| (NC_003103) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
conorii]
gb|AAL03212.1| (AE008626) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
conorii]
Length = 183
Score = 50.8 bits (120), Expect = 2e-006
Identities = 34/95 (35%), Positives = 49/95 (50%), Gaps = 4/95 (4%)
Query: 46 NQVGVLKRIIVLRC----PTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKV 101
N VG+LKRI V+ + INP IT ++ EG LS+PG + R +
Sbjct: 54 NMVGILKRIAVVDLHENNKSSPIVFINPNITYFSEEKQTFIEGSLSFPGIEASITRSKAI 113
Query: 102 VVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGVT 136
V+ D + +AA+G A +QHEI++LNG T
Sbjct: 114 KVKYLDYNGNKQELAAEGFLATVIQHEIEYLNGKT 148
>ref|NP_229461.1| (NC_000853) polypeptide deformylase [Thermotoga maritima]
sp|P96113|DEF_THEMA Polypeptide deformylase (PDF) (Formylmethionine deformylase)
pir||C72224 polypeptide deformylase - Thermotoga maritima (strain MSB8)
emb|CAA71356.1| (Y10306) polypeptide deformylase [Thermotoga maritima]
gb|AAD36728.1|AE001808_3 (AE001808) polypeptide deformylase [Thermotoga maritima]
Length = 164
Score = 50.1 bits (118), Expect = 4e-006
Identities = 31/91 (34%), Positives = 44/91 (48%)
Query: 47 QVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGY 106
QVG+ +R V+ VINP I + EGCLS+P V +R ++ V+
Sbjct: 49 QVGISQRFFVMDVGNGPVAVINPEILEIDPETEVAEEGCLSFPEIFVEIERSKRIKVKYQ 108
Query: 107 DMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
+ + + +G A QHE DHLNGV I
Sbjct: 109 NTRGEYVEEELEGYAARVFQHEFDHLNGVLI 139
>ref|NP_561676.1| (NC_003366) polypeptide deformylase [Clostridium perfringens]
dbj|BAB80466.1| (AP003188) polypeptide deformylase [Clostridium perfringens]
Length = 136
Score = 46.6 bits (109), Expect = 4e-005
Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 4/93 (4%)
Query: 46 NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVV-VE 104
N +GV KRI+V ++NP+I + + + E CLS G K R +++ VE
Sbjct: 47 NMIGVKKRILVFSIGNIIVPMVNPVILKKEKSYE-TEESCLSLTG--FRKTTRYEIIEVE 103
Query: 105 GYDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
D +++ G TA +QHE+DH G+ I
Sbjct: 104 YLDRNFKKHKETFSGFTAQIIQHEVDHFEGIII 136
>sp|O08450|DEF_CLOBE Polypeptide deformylase (PDF) (Formylmethionine deformylase)
emb|CAB09662.1| (Z96934) formylmethionine deformylase; peptide deformylase
[Clostridium beijerinckii]
Length = 136
Score = 43.9 bits (102), Expect = 3e-004
Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 6/94 (6%)
Query: 46 NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPG--KTVAKKRRNKVVV 103
N +GV KRI+V +INP+I + + + E CLS G KT KR + V
Sbjct: 47 NMIGVKKRILVFTVGNLIVPMINPVILKKEKPYE-TEESCLSLIGFRKT---KRYETIEV 102
Query: 104 EGYDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
D ++ G TA +QHE+DH G+ I
Sbjct: 103 TYLDRNFNKKKQVFNGFTAQIIQHEMDHFEGIII 136
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.322 0.141 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,984,581
Number of Sequences: 884779
Number of extensions: 2425202
Number of successful extensions: 3955
Number of sequences better than 5.0e-02: 98
Number of HSP's better than 0.1 without gapping: 83
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 3851
Number of HSP's gapped (non-prelim): 99
length of query: 137
length of database: 277,083,050
effective HSP length: 113
effective length of query: 24
effective length of database: 177,103,023
effective search space: 4250472552
effective search space used: 4250472552
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf63 [14990-15085]
(94 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.320 0.137 7.19e+252
Gapped
Lambda K H
0.267 0.0410 8.76e+252
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 42,933,908
Number of Sequences: 884779
Number of extensions: 1333473
Number of successful extensions: 4398
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4398
Number of HSP's gapped (non-prelim): 0
length of query: 94
length of database: 277,083,050
effective HSP length: 70
effective length of query: 24
effective length of database: 215,148,520
effective search space: 5163564480
effective search space used: 5163564480
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf64 [15068-15204]
(135 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.324 0.140 8.76e+252
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 92,315,289
Number of Sequences: 884779
Number of extensions: 3835217
Number of successful extensions: 6623
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6623
Number of HSP's gapped (non-prelim): 0
length of query: 135
length of database: 277,083,050
effective HSP length: 111
effective length of query: 24
effective length of database: 178,872,581
effective search space: 4292941944
effective search space used: 4292941944
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 83 (36.6 bits)
Query= orf65 [15212-16004]
(791 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_489430.1| (NC_003273) ORF_ID:all8519~hypothetical pr... 99 2e-019
gb|AAB00938.1| (L31763) virulence-associated protein E [Dic... 85 3e-015
ref|NP_371315.1| (NC_002758) hypothetical protein [Staphylo... 85 3e-015
gb|AAL67615.1| (U93688) orf15 [Staphylococcus aureus] 85 3e-015
gb|AAL04136.1|AF410775_12 (AF410775) unknown [Staphylococcu... 84 6e-015
ref|NP_569125.1| (NC_003374) hypothetical protein [Gluconob... 83 1e-014
pir||S49113 hypothetical protein 2 - Microcystis aeruginosa... 81 5e-014
ref|NP_279342.1| (NC_002607) Vng0215c [Halobacterium sp. NR... 55 2e-006
>ref|NP_489430.1| (NC_003273) ORF_ID:all8519~hypothetical protein [Nostoc sp. PCC
7120]
dbj|BAB77438.1| (AP003604) ORF_ID:all8519~hypothetical protein [Nostoc sp. PCC
7120]
Length = 836
Score = 98.6 bits (244), Expect = 2e-019
Identities = 87/317 (27%), Positives = 147/317 (45%), Gaps = 20/317 (6%)
Query: 466 KSDIKGMIAYMAHRYAHNPVKAWVESAP--WDGLDHVGLLFSHITLTPDEDRPLCEHLFR 523
K K ++ +A +++PV +++S + +D V LL + PL L +
Sbjct: 341 KEQAKSVVLELALANSYSPVVEYLQSVERRYPNVD-VSLLDKLAERYFGTNDPLHAALMK 399
Query: 524 KWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDSVLLDVKDKDSVK 583
+ + AAV+ + + + +Q + K+ F+ L E F TD L+ +KD V
Sbjct: 400 RTLIAAVARAFEPGCKHDEITILQGKQKSL-KSTFWEILAGEEFFTDD--LNGTEKDEVL 456
Query: 584 TAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEA 643
YW++E E + ++ + + + LKAFLSR KD +R PYG +PR + VG+ N
Sbjct: 457 KISQYWILEYAEFENSYKKKDVSQLKAFLSRKKDSMRRPYGTDIEDFPRPSILVGTTNLD 516
Query: 644 EFLVDTTSNRRFIPMKVAALNHQHRVNVQQ---VWAQALQEARSG---MATYVEAHEVAQ 697
EFL D T RRF +KV ++ Q+ +WA A+ R+G T VE +
Sbjct: 517 EFLYDPTGERRFWVIKVLFKKIPIKMLQQERDLIWAAAVAAYRNGEQWRLTEVEDEWLDA 576
Query: 698 RNASFQATSAIDDLLSERLNEATGERNVHMTVTALLK---RCGLHNPTKRDLNDAGKWLR 754
N +Q+T ++ + A E ++V +L + L +K + N LR
Sbjct: 577 ANKQYQSTDTWEEAVM-----AWAELQKEVSVGEILSDVLKIELAKQSKAEQNRVAAILR 631
Query: 755 SNGYERCVRTGVRGFML 771
S+G+ R R V G ++
Sbjct: 632 SHGWTRGDRKRVNGRLI 648
>gb|AAB00938.1| (L31763) virulence-associated protein E [Dichelobacter nodosus]
Length = 437
Score = 85.1 bits (209), Expect = 3e-015
Identities = 71/250 (28%), Positives = 121/250 (48%), Gaps = 20/250 (8%)
Query: 465 PKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTPDEDRPLCEHLFRK 524
P I I +A + P++ ++ES WDG + F E +
Sbjct: 96 PIDSIDMGIKMLAREHIFCPIQQYLESLQWDGKPRIATAFHDFLGV--ERNTYTIGISTL 153
Query: 525 WMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDSVLLDVKDKDSVKT 584
+ RA V+ + V+V +QG +GK+ F R L E + + + K++++
Sbjct: 154 FFRALVARVYQPAIKFDHVIVLQGDQG-IGKSSFLRILGGEWYSDG--IRKFEGKEAIEA 210
Query: 585 AISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEAE 644
+ E+ EL G FS++E +KAF++RT+D R+ YGR +PR T +VG+ N +
Sbjct: 211 IQGVLIGEIPELQG-FSKAEIEEIKAFITRTEDRARMAYGRRVQSFPRRTVFVGTTNADD 269
Query: 645 FLVDTTSNRRFIPMKV-AALNHQHRVNV-QQVWAQALQEARSGMATYV----------EA 692
+L D T NRRF P+ AL+ + V +Q++A+A+ ++ A YV E
Sbjct: 270 YLRDATGNRRFFPIICHKALDFEALAEVREQLFAEAVVHYKA--APYVPLILQGKEALEQ 327
Query: 693 HEVAQRNASF 702
++AQ+ AS+
Sbjct: 328 AKLAQQAASY 337
>ref|NP_371315.1| (NC_002758) hypothetical protein [Staphylococcus aureus subsp.
aureus Mu50]
dbj|BAB56953.1| (AP003360) hypothetical protein [Staphylococcus aureus subsp.
aureus Mu50]
Length = 489
Score = 84.7 bits (208), Expect = 3e-015
Identities = 65/234 (27%), Positives = 107/234 (44%), Gaps = 14/234 (5%)
Query: 452 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 511
+S I+ + N+ + + I +I A++ +P+K+ +ES WDG+ + LF I
Sbjct: 95 ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 151
Query: 512 DEDRPLCEHLFRKWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDS 571
ED + +KWM AV+ + +++ QG VGK+ L +
Sbjct: 152 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 209
Query: 572 VLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 631
+ K + K W+ E+ EL F +S +K F+S D R YG+ ++P
Sbjct: 210 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 267
Query: 632 RMTAYVGSVNEAEFLVDTTSNRRFIPMKV-------AALNHQHRVNVQQVWAQA 678
R +VG+ N EFL D T NRRF P+ + + V VQQ++A+A
Sbjct: 268 RQCVFVGTTNNYEFLKDQTGNRRFFPITTDKNKATKSPFDDLTPVVVQQMFAEA 321
>gb|AAL67615.1| (U93688) orf15 [Staphylococcus aureus]
Length = 477
Score = 84.7 bits (208), Expect = 3e-015
Identities = 65/234 (27%), Positives = 107/234 (44%), Gaps = 14/234 (5%)
Query: 452 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 511
+S I+ + N+ + + I +I A++ +P+K+ +ES WDG+ + LF I
Sbjct: 87 ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 143
Query: 512 DEDRPLCEHLFRKWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDS 571
ED + +KWM AV+ + +++ QG VGK+ L +
Sbjct: 144 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 201
Query: 572 VLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 631
+ K + K W+ E+ EL F +S +K F+S D R YG+ ++P
Sbjct: 202 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 259
Query: 632 RMTAYVGSVNEAEFLVDTTSNRRFIPMKV-------AALNHQHRVNVQQVWAQA 678
R +VG+ N EFL D T NRRF P+ + + V VQQ++A+A
Sbjct: 260 RQCVFVGTTNNYEFLKDQTGNRRFFPITTDKNKATKSPFDDLTPVVVQQMFAEA 313
>gb|AAL04136.1|AF410775_12 (AF410775) unknown [Staphylococcus aureus]
Length = 474
Score = 84.0 bits (206), Expect = 6e-015
Identities = 59/207 (28%), Positives = 96/207 (45%), Gaps = 7/207 (3%)
Query: 452 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 511
+S I+ + N+ + + I +I A++ +P+K+ +ES WDG+ + LF I
Sbjct: 84 ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 140
Query: 512 DEDRPLCEHLFRKWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDS 571
ED + +KWM AV+ + +++ QG VGK+ L +
Sbjct: 141 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 198
Query: 572 VLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 631
+ K + K W+ E+ EL F +S +K F+S D R YG+ ++P
Sbjct: 199 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 256
Query: 632 RMTAYVGSVNEAEFLVDTTSNRRFIPM 658
R +VG+ N EFL D T NRRF P+
Sbjct: 257 RQCVFVGTTNNYEFLKDQTGNRRFFPI 283
>ref|NP_569125.1| (NC_003374) hypothetical protein [Gluconobacter oxydans]
emb|CAD21834.1| (AJ428837) hypothetical protein [Gluconobacter oxydans]
Length = 416
Score = 82.8 bits (203), Expect = 1e-014
Identities = 73/248 (29%), Positives = 108/248 (43%), Gaps = 35/248 (14%)
Query: 463 HYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGL-LFSHITLTPDEDRPLCEHL 521
++ + I +A A++P++ ++ WDG+ + L + E L
Sbjct: 99 NFSTQQVSAAIQSVAEENAYHPIRDYLTGLEWDGVQRLDTWLHDFLGAEDSEYTAGVGRL 158
Query: 522 FRKWMRAAVSAGVGDQAGC--EPVLVFVDEQGG---------VGKTRFFRTLCPEPFRTD 570
F M A V + GC + VL+F QG G +R+F P P R
Sbjct: 159 FLIGMVARVM-----EPGCKNDHVLIFEGNQGIRKSTACKVLAGGSRYFSDSLP-PIREG 212
Query: 571 SVLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKY 630
S KD WLVE+ E+ SR+++ ALKAFL+RT + R YGR +
Sbjct: 213 S-------KDLASHLAGKWLVEVAEMSAA-SRADTEALKAFLTRTHETYRPAYGRNEITQ 264
Query: 631 PRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVNVQQVWAQALQEARSGMATYV 690
PR VG+ N E+L D+T +RR P+K VN + LQEAR + +
Sbjct: 265 PRQCVLVGTTNRKEYLRDSTGDRRVWPVKCG-------VNGEPCDIDGLQEARDQL--FA 315
Query: 691 EAHEVAQR 698
EA + QR
Sbjct: 316 EAFHLYQR 323
>pir||S49113 hypothetical protein 2 - Microcystis aeruginosa
emb|CAA82191.1| (Z28337) ORF2 [Microcystis aeruginosa]
Length = 502
Score = 80.9 bits (198), Expect = 5e-014
Identities = 67/218 (30%), Positives = 106/218 (47%), Gaps = 21/218 (9%)
Query: 555 KTRFFRTLCPEPFRTDSVLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSR 614
K+ FF+TL E F TD+++ DKD + T W++E ELD +++ +K+FLSR
Sbjct: 278 KSTFFKTLGGEYF-TDAMMST--DKDGLLTMAKNWIIEWSELDHFTAKTYHGIIKSFLSR 334
Query: 615 TKDDIRLPYGRTNMKYPRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVNVQQV 674
+ D R PY R M +PR + VGS N FL D T NRRF + + +H++N+
Sbjct: 335 SDDFYREPYAREAMSHPRCSMIVGSTNRDTFLNDPTGNRRF---WIIPIPKKHKINID-- 389
Query: 675 WAQALQEARSGMAT--YVEAHEV----------AQRNASFQATSAIDDLLSERLNEATGE 722
+ QA +E G A Y++ A+ N ++ + +D L+E L+ + +
Sbjct: 390 YLQAYREKLLGWAVWRYLDGEPCQLPEEFKALQAEANKQWENNDSWEDELAEFLDRES-D 448
Query: 723 RNVHMTVTALLKRCGLHNPTKRDLNDAGKWLRSNGYER 760
+V + L R N + D G LR G+ R
Sbjct: 449 LSVKECLQRLEDRGYPINFGRSDEMRMGDILRKAGFSR 486
>ref|NP_279342.1| (NC_002607) Vng0215c [Halobacterium sp. NRC-1]
gb|AAG18822.1| (AE004986) Vng0215c [Halobacterium sp. NRC-1]
Length = 857
Score = 55.5 bits (132), Expect = 2e-006
Identities = 33/112 (29%), Positives = 55/112 (48%), Gaps = 11/112 (9%)
Query: 175 EHDSAERVEITDQEAQTVREALSYIDPGDKREEWVKVGLALKSGFGDDPQGLSLFDEWSS 234
++D E ++ +D + +ALS+++P +EW+K+G A+ + G LF++WS
Sbjct: 212 DYDGDEWLDKSD-----IEDALSHVNPDCGYDEWLKLGFAVHD-WDSGSTGKRLFEQWSK 265
Query: 235 GALWQDGDMPANYVPEHIETQWHSFKAEGGRTIATVYHKAIEGGWQPPAGIN 286
G D N +I+ W S G T+ T+ H A +GGW P N
Sbjct: 266 GGSKWD-----NQAERNIQDIWDSASEGQGVTVGTLIHYAKDGGWTVPTRSN 312
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.317 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 449,803,643
Number of Sequences: 884779
Number of extensions: 19827664
Number of successful extensions: 52924
Number of sequences better than 5.0e-02: 14
Number of HSP's better than 0.1 without gapping: 10
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 52901
Number of HSP's gapped (non-prelim): 17
length of query: 791
length of database: 277,083,050
effective HSP length: 128
effective length of query: 663
effective length of database: 163,831,338
effective search space: 108620177094
effective search space used: 108620177094
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 95 (41.2 bits)
Query= orf66 [16203-16434]
(230 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.319 0.132 5.03e+175
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 104,471,067
Number of Sequences: 884779
Number of extensions: 3759278
Number of successful extensions: 11290
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 11290
Number of HSP's gapped (non-prelim): 0
length of query: 230
length of database: 277,083,050
effective HSP length: 116
effective length of query: 114
effective length of database: 174,448,686
effective search space: 19887150204
effective search space used: 19887150204
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 89 (38.9 bits)
Query= orf67 [16507-16558]
(51 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
884,779 sequences; 277,083,050 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Feb 23, 2002 3:08 AM
Number of letters in database: 277,083,050
Number of sequences in database: 884,779
Lambda K H
0.315 0.128 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 19,334,521
Number of Sequences: 884779
Number of extensions: 414668
Number of successful extensions: 1433
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1433
Number of HSP's gapped (non-prelim): 0
length of query: 51
length of database: 277,083,050
effective HSP length: 27
effective length of query: 24
effective length of database: 253,194,017
effective search space: 6076656408
effective search space used: 6076656408
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)