National Center for Biotechnology Information (NCBI)
welcome to the blast network service.
PEPTIDE SEQUENCE DATABASES
nr Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
month All new or revised GenBank CDS translation+PDB+SwissProt+PIR
sequences released in the last 30 days
pdb PDB protein sequences
yeast Yeast (Saccharomyces cerevisiae) protein sequences.
kabat Kabat Sequences of Proteins of Immunological Interest
alu * Translations of Select Alu Repeats from REPBASE
swissprot SwissProt sequences
NUCLEOTIDE SEQUENCE DATABASES
nr Non-redundant GenBank+EMBL+DDBJ+PDB sequences
(but no EST, STS, GSS, or HTGS sequences)
month All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
the last 30 days
yeast Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
est + Non-redundant Database of GenBank+EMBL+DDBJ EST Division
sts + Non-redundant Database of GenBank+EMBL+DDBJ STS Division
htgs High Throughput Genomic Sequences
pdb PDB nucleotide sequences
vector Vector subset of GenBank
mito * Database of mitochondrial sequences, Rel. 1.0, July 1995
gss Genome Survey Sequences (includes single-pass genomic data, exon-
trapped sequences, and Alu PCR primers.
kabat Kabat Sequences of Nucleic Acid of Immunological Interest
epd Eukaryotic Promotor Database
alu *+ Select Alu Repeats from REPBASE
BLASTP 2.2.3 [Apr-24-2002]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= orf1 [1-97]
(94 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.335 0.141 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 36,126,822
Number of Sequences: 1038431
Number of extensions: 689974
Number of successful extensions: 937
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 937
Number of HSP's gapped (non-prelim): 0
length of query: 94
length of database: 328,470,697
effective HSP length: 70
effective length of query: 24
effective length of database: 255,780,527
effective search space: 6138732648
effective search space used: 6138732648
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf2 [116-399]
(282 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064740.1| (NC_002519) gp3 [Roseophage SIO1] >gi|9944... 552 e-156
ref|NP_640321.1| (NC_003907) hypothetical protein [Vibrioph... 144 1e-033
>ref|NP_064740.1| (NC_002519) gp3 [Roseophage SIO1]
gb|AAG02585.1|AF189021_3 (AF189021) Roseophage SIO1 complete genome
Length = 282
Score = 552 bits (1423), Expect = e-156
Identities = 271/271 (100%), Positives = 271/271 (100%)
Query: 1 MAGTTIDIESMIDPHSLAVEIANRWTSWNNARSEKVKEWKELRNYIYATDTRTTSNNKLP 60
MAGTTIDIESMIDPHSLAVEIANRWTSWNNARSEKVKEWKELRNYIYATDTRTTSNNKLP
Sbjct: 1 MAGTTIDIESMIDPHSLAVEIANRWTSWNNARSEKVKEWKELRNYIYATDTRTTSNNKLP 60
Query: 61 WSNSTTTPKLTQIADNLHANYFAALFPQKRWFRFEATDADSDTKIKRSIIQAYMQNKLRQ 120
WSNSTTTPKLTQIADNLHANYFAALFPQKRWFRFEATDADSDTKIKRSIIQAYMQNKLRQ
Sbjct: 61 WSNSTTTPKLTQIADNLHANYFAALFPQKRWFRFEATDADSDTKIKRSIIQAYMQNKLRQ 120
Query: 121 SDFVNTTSKLVNDYIQYGNCFATVDFERKVTKYEDGDRIVNYVGPKVVRISPFDICFNPL 180
SDFVNTTSKLVNDYIQYGNCFATVDFERKVTKYEDGDRIVNYVGPKVVRISPFDICFNPL
Sbjct: 121 SDFVNTTSKLVNDYIQYGNCFATVDFERKVTKYEDGDRIVNYVGPKVVRISPFDICFNPL 180
Query: 181 AANFSDTPKIVRSVLTLGEIQRMVENDSSKGYMADIFNKMLGNRGSARGNEVDINKSEGF 240
AANFSDTPKIVRSVLTLGEIQRMVENDSSKGYMADIFNKMLGNRGSARGNEVDINKSEGF
Sbjct: 181 AANFSDTPKIVRSVLTLGEIQRMVENDSSKGYMADIFNKMLGNRGSARGNEVDINKSEGF 240
Query: 241 VADGFASLTDYYESDYVEVLTFYGDIYDKVR 271
VADGFASLTDYYESDYVEVLTFYGDIYDKVR
Sbjct: 241 VADGFASLTDYYESDYVEVLTFYGDIYDKVR 271
>ref|NP_640321.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28399.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 599
Score = 144 bits (362), Expect = 1e-033
Identities = 86/266 (32%), Positives = 134/266 (50%), Gaps = 4/266 (1%)
Query: 8 IESMIDPHSLAVEIANRWTSWNNARSEKVKEWKELRNYIYATDTRTTSNNKLPWSNSTTT 67
+E D + E+ +T+ NAR++K +E KEL +YI ATDTR TSN+KLP+ NSTT
Sbjct: 12 LEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTI 71
Query: 68 PKLTQIADNLHANYFAALFPQKRWFRFEATDADSDTKIKRSIIQAYMQNKLRQSDFVNTT 127
KL + + +Y L P + W F D DS KR I ++Y++ K+ S+
Sbjct: 72 NKLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVI 131
Query: 128 SKLVNDYIQYGNCFATVDFERKVTKYEDGDRIVNYVGPKVVRISPFDICFNPLAANFSDT 187
++V+D+ G C A +++T + I NY G R+SP D+ ++ A +
Sbjct: 132 ERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIKNYSGTVTERLSPSDVFWDVTADSLPKA 191
Query: 188 PKIVRSVLTLGEIQRMVENDSSKGYMADIFNKMLGNRGSARGNEVD----INKSEGFVAD 243
K +R + TLG ++R +E + + F K+ R + R D K +
Sbjct: 192 AKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRKFDSLHKK 251
Query: 244 GFASLTDYYESDYVEVLTFYGDIYDK 269
G+ S+ +Y VEVLTF GD YD+
Sbjct: 252 GYGSMMNYINEGVVEVLTFMGDFYDE 277
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.317 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 175,248,416
Number of Sequences: 1038431
Number of extensions: 7018587
Number of successful extensions: 18455
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 18452
Number of HSP's gapped (non-prelim): 2
length of query: 282
length of database: 328,470,697
effective HSP length: 120
effective length of query: 162
effective length of database: 203,858,977
effective search space: 33025154274
effective search space used: 33025154274
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 90 (39.3 bits)
Query= orf3 [433-705]
(271 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1] >gi|9944... 537 e-152
ref|NP_640321.1| (NC_003907) hypothetical protein [Vibrioph... 151 8e-036
ref|NP_597932.1| (NC_002519) gp5-like protein [Roseophage S... 123 2e-027
>ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1]
gb|AAG02587.1|AF189021_5 (AF189021) Roseophage SIO1 complete genome
Length = 271
Score = 537 bits (1384), Expect = e-152
Identities = 271/271 (100%), Positives = 271/271 (100%)
Query: 1 MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY 60
MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY
Sbjct: 1 MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY 60
Query: 61 LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT 120
LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT
Sbjct: 61 LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT 120
Query: 121 AHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLYFFTQITRDDIKANGKIVPMGA 180
AHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLYFFTQITRDDIKANGKIVPMGA
Sbjct: 121 AHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLYFFTQITRDDIKANGKIVPMGA 180
Query: 181 RHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSEQMG 240
RHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSEQMG
Sbjct: 181 RHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSEQMG 240
Query: 241 KTVFTVVMRWVVGNKFLFQNLTVTTTVVIHL 271
KTVFTVVMRWVVGNKFLFQNLTVTTTVVIHL
Sbjct: 241 KTVFTVVMRWVVGNKFLFQNLTVTTTVVIHL 271
>ref|NP_640321.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28399.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 599
Score = 151 bits (381), Expect = 8e-036
Identities = 81/198 (40%), Positives = 113/198 (56%), Gaps = 2/198 (1%)
Query: 1 MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFE--PNARIYLGDEGDV 58
+GPL L GMQY++D EN + D+ D+ +P LK GDV + PN + + GDV
Sbjct: 326 IGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREKGMRGGPNHVFEVEETGDV 385
Query: 59 GYLVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQH 118
Y+ P + L D Q+ ME ++GAP+E++G R+AGEKT FEV L ++F+
Sbjct: 386 QYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRR 445
Query: 119 KTAHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLYFFTQITRDDIKANGKIVPM 178
K FER L P+LN LE R ++D DT K N + G F IT DD+ NG++V
Sbjct: 446 KVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQ 505
Query: 179 GARHFAERAQRVQNLTTM 196
GA FAE+A +QNL +
Sbjct: 506 GATLFAEKANTLQNLNAI 523
>ref|NP_597932.1| (NC_002519) gp5-like protein [Roseophage SIO1]
Length = 99
Score = 123 bits (308), Expect = 2e-027
Identities = 62/62 (100%), Positives = 62/62 (100%)
Query: 178 MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE 237
MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE
Sbjct: 1 MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE 60
Query: 238 QM 239
QM
Sbjct: 61 QM 62
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.321 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 156,973,050
Number of Sequences: 1038431
Number of extensions: 5982633
Number of successful extensions: 15236
Number of sequences better than 5.0e-02: 3
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 15232
Number of HSP's gapped (non-prelim): 3
length of query: 271
length of database: 328,470,697
effective HSP length: 119
effective length of query: 152
effective length of database: 204,897,408
effective search space: 31144406016
effective search space used: 31144406016
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 90 (39.3 bits)
Query= orf4 [643-1092]
(448 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064742.1| (NC_002519) gp4 [Roseophage SIO1] >gi|9944... 732 0.0
ref|NP_064739.1| (NC_002519) gp2 [Roseophage SIO1] >gi|9944... 396 e-109
ref|NP_640322.1| (NC_003907) hypothetical protein [Vibrioph... 343 3e-093
ref|NP_064738.1| (NC_002519) gp1 [Roseophage SIO1] >gi|9944... 279 6e-074
sp|P10310|VDMB_BPT3 DNA MATURASE B (DNA PACKAGING PROTEIN B... 54 5e-006
ref|NP_042010.1| (NC_001604) gene 19, DNA maturation protei... 53 7e-006
ref|NP_641408.1| (NC_003919) phage-related DNA maturase [Xa... 50 7e-005
ref|NP_298858.1| (NC_002488) hypothetical protein [Xylella ... 46 8e-004
>ref|NP_064742.1| (NC_002519) gp4 [Roseophage SIO1]
gb|AAG02586.1|AF189021_4 (AF189021) Roseophage SIO1 complete genome
Length = 353
Score = 732 bits (1889), Expect = 0.0
Identities = 353/353 (100%), Positives = 353/353 (100%)
Query: 96 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 155
MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM
Sbjct: 1 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 60
Query: 156 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 215
GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK
Sbjct: 61 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 120
Query: 216 RGQYLDRVQFRAQYYNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVD 275
RGQYLDRVQFRAQYYNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVD
Sbjct: 121 RGQYLDRVQFRAQYYNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVD 180
Query: 276 FAYSVSKRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAE 335
FAYSVSKRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAE
Sbjct: 181 FAYSVSKRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAE 240
Query: 336 CTAAQSAIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRG 395
CTAAQSAIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRG
Sbjct: 241 CTAAQSAIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRG 300
Query: 396 GNCQVLEEELVSYNPAHDDCKDCLAHLFTDRYVLFEECWLTKFVCKKTGKLFA 448
GNCQVLEEELVSYNPAHDDCKDCLAHLFTDRYVLFEECWLTKFVCKKTGKLFA
Sbjct: 301 GNCQVLEEELVSYNPAHDDCKDCLAHLFTDRYVLFEECWLTKFVCKKTGKLFA 353
>ref|NP_064739.1| (NC_002519) gp2 [Roseophage SIO1]
gb|AAG02584.1|AF189021_2 (AF189021) Roseophage SIO1 complete genome
Length = 247
Score = 396 bits (1018), Expect = e-109
Identities = 190/190 (100%), Positives = 190/190 (100%)
Query: 231 NDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIV 290
NDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIV
Sbjct: 24 NDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIV 83
Query: 291 VIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDN 350
VIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDN
Sbjct: 84 VIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDN 143
Query: 351 YIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNP 410
YIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNP
Sbjct: 144 YIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNP 203
Query: 411 AHDDCKDCLA 420
AHDDCKDCLA
Sbjct: 204 AHDDCKDCLA 213
>ref|NP_640322.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28398.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 535
Score = 343 bits (879), Expect = 3e-093
Identities = 178/420 (42%), Positives = 256/420 (60%), Gaps = 8/420 (1%)
Query: 2 VAYRVAWELTKNPTLRVLYISATSNLAQKQLSFIKNIFESDIHQKYWPEHLHKDESKREK 61
+A V W++ KNP + + Y+ AT +LA QL IK I SD + P+ + E KR+K
Sbjct: 79 IAVWVCWQIFKNPAVTIAYVCATESLAILQLYDIKQILTSDEFTRLSPDMIEPMEKKRQK 138
Query: 62 WTTSEIALDHPDRKKEAIRDPSIFTGGLTTSLTGMHCDIAVLDDVVVFENAYTNEGRNKV 121
W + I +DHP RKKE RDP++ GL ++ G HC+I V DDVV+ +N+ T R KV
Sbjct: 139 WAETAIIVDHPIRKKERPRDPTVLATGLDSNNIGAHCNIMVKDDVVIDKNSLTETARQKV 198
Query: 122 KSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLMGMEEDIYSKEGELVGKENIYEVMEKA 181
+++ LSSI ++ E+ VGTRYHPKD Y L+ M E+++ + +LVG+ +Y V +
Sbjct: 199 EAKAGHLSSILTTDGMEFCVGTRYHPKDHYQTLIDMTEEVWEGD-QLVGERPVYAVHTRV 257
Query: 182 VEDNGDGTGEFLWPRQLRK-DGKFFGFDVQILAKKRGQY-LDRVQFRAQYYNDPTDPDSQ 239
VE G FLWPR R+ DGK FGFD L++K+ +Y D F QYYNDP +
Sbjct: 258 VEVE----GVFLWPRMARESDGKMFGFDRAQLSRKKAKYRKDMRNFYCQYYNDPNAINEG 313
Query: 240 PIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIVVIGVDSENN 299
I F YYDK+ + R G W+ +K+N+ A DFAYSV +D+TA+ ++G+D +
Sbjct: 314 GIEKSMFLYYDKEKVVRRKGSWWVGKRKVNIIACQDFAYSVQTGSDWTALFILGMDKDKR 373
Query: 300 VYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDNYIKPNGLAL 359
+Y+LDI R++T K S Y++++ N+W F+ +RAE AAQ I+ L++ Y + +
Sbjct: 374 IYILDIVRYQTKKPSVYWKNLETAYNKWKFKWVRAEAVAAQEVIIEALRE-YAEKEQCPI 432
Query: 360 KIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNPAHDDCKDCL 419
+I ++PN G KE RI+ LEP Y+N +YHYRGG C +LEEEL P HDD KDCL
Sbjct: 433 RIKAYKPNSQSGDKETRISQTLEPLYENGDIYHYRGGLCDMLEEELTQDRPPHDDLKDCL 492
>ref|NP_064738.1| (NC_002519) gp1 [Roseophage SIO1]
gb|AAG02583.1|AF189021_1 (AF189021) Roseophage SIO1 complete genome
Length = 136
Score = 279 bits (713), Expect = 6e-074
Identities = 135/135 (100%), Positives = 135/135 (100%)
Query: 96 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 155
MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM
Sbjct: 1 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 60
Query: 156 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 215
GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK
Sbjct: 61 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 120
Query: 216 RGQYLDRVQFRAQYY 230
RGQYLDRVQFRAQYY
Sbjct: 121 RGQYLDRVQFRAQYY 135
>sp|P10310|VDMB_BPT3 DNA MATURASE B (DNA PACKAGING PROTEIN B) (GP19)
pir||JVBPB3 DNA maturase B - phage T3
gb|AAA92528.1| (M14784) DNA-packaging protein B [Bacteriophage T3]
Length = 586
Score = 53.5 bits (127), Expect = 5e-006
Identities = 35/116 (30%), Positives = 55/116 (47%), Gaps = 10/116 (8%)
Query: 6 VAWELTKNPTLRVLYISATSNLAQKQLSFIKNIFESDIHQKYWPEHLHKDESKREKWTTS 65
V W L ++P L++L +SA+ A FIKNI I + + L +R+ +
Sbjct: 72 VVWTLWRDPQLKILIVSASKERADLNSIFIKNI----IDLLPFLDELKPSPGQRDSVISF 127
Query: 66 EIALDHPDRKKEAIRDPSIFTGGLTTSLTGMHCDIAVLDDVVVFENAYTNEGRNKV 121
++ PD PS+ + G+T LTG DI + DDV + N+ T R K+
Sbjct: 128 DVGPAKPDHS------PSVKSVGITGQLTGSRADIIIADDVEIPSNSATQGAREKL 177
>ref|NP_042010.1| (NC_001604) gene 19, DNA maturation protein [Enterobacteria phage
T7]
sp|P03694|VDMB_BPT7 DNA MATURASE B (DNA PACKAGING PROTEIN B) (GP19)
pir||JVBPB7 DNA maturase B - phage T7
emb|CAA24440.1| (V01146) gene 19, DNA maturation protein [Enterobacteria phage T7]
Length = 586
Score = 53.1 bits (126), Expect = 7e-006
Identities = 44/153 (28%), Positives = 68/153 (43%), Gaps = 16/153 (10%)
Query: 6 VAWELTKNPTLRVLYISATSNLAQKQLSFIKNIFESDIHQKYWPEHLHKDESKREKWTTS 65
V W L ++P L++L +SA+ A FIKNI I + L +R+ +
Sbjct: 72 VVWSLWRDPQLKILIVSASKERADANSIFIKNI----IDLLPFLSELKPRPGQRDSVISF 127
Query: 66 EIALDHPDRKKEAIRDPSIFTGGLTTSLTGMHCDIAVLDDVVVFENAYTNEGRNK----V 121
++ +PD PS+ + G+T LTG DI + DDV + N+ T R K V
Sbjct: 128 DVGPANPDHS------PSVKSVGITGQLTGSRADIIIADDVEIPSNSATMGAREKLWTLV 181
Query: 122 KSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDL 154
+ +LL + S +GT LY +L
Sbjct: 182 QEFAALLKPLPSSRV--IYLGTPQTEMTLYKEL 212
>ref|NP_641408.1| (NC_003919) phage-related DNA maturase [Xanthomonas axonopodis pv.
citri str. 306]
gb|AAM35944.1| (AE011736) phage-related DNA maturase [Xanthomonas axonopodis pv.
citri str. 306]
Length = 526
Score = 49.7 bits (117), Expect = 7e-005
Identities = 41/162 (25%), Positives = 69/162 (42%), Gaps = 12/162 (7%)
Query: 1 MVAYRVAWELTKNPTLRVLYISATSNLAQKQLSFIKNIFESDIHQKYWP-EHLHKDESKR 59
+ A V W L K+P +++ +SA+ A F+K I E+ + P HL +R
Sbjct: 65 LTAGYVCWLLWKDPQHKIMVVSASKERADAFSVFVKQIIET-----FEPLSHLRPRSDQR 119
Query: 60 EKWTTSEIALDHPDRKKEAIRDPSIFTGGLTTSLTGMHCDIAVLDDVVVFENAYTNEGRN 119
++ PD+ PS+ + G+T LTG + + DD+ V +N+ T R
Sbjct: 120 NSNLAFDVGPALPDQS------PSVKSVGITGQLTGSRANTIIPDDIEVVKNSQTIVQRE 173
Query: 120 KVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLMGMEEDI 161
K+ ++ Q +GT + LY+ L DI
Sbjct: 174 KLAELIKEFDAVLKPGGQVIYLGTPQTEESLYNKLPERGYDI 215
>ref|NP_298858.1| (NC_002488) hypothetical protein [Xylella fastidiosa 9a5c]
ref|NP_298963.1| (NC_002488) hypothetical protein [Xylella fastidiosa 9a5c]
sp|Q9P9T5|YF69_XYLFA Hypothetical protein Xf1569/Xf1674
pir||H82649 hypothetical protein XF1674 XF1569 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF84378.1|AE003986_8 (AE003986) hypothetical protein [Xylella fastidiosa 9a5c]
gb|AAF84483.1|AE003993_2 (AE003993) hypothetical protein [Xylella fastidiosa 9a5c]
Length = 316
Score = 46.2 bits (108), Expect = 8e-004
Identities = 34/132 (25%), Positives = 62/132 (46%), Gaps = 14/132 (10%)
Query: 10 LTKNPTLRVLYISATSNLAQKQLSFIKNIFESDIHQKYWPEHLHKDESKREKWTTSEIAL 69
L NP R L+IS + +LA + I +SD ++ WP + D +++W + +
Sbjct: 85 LALNPYARFLHISYSDDLALLNSETAREIVQSDEYRALWPLEIADDAKSKKRW---NVVV 141
Query: 70 DHPDRKKEAIRDPSIFTGGLTTSLTGMHC-----DIAVLDDVVVFENAYTNEGRNKVKSQ 124
D K+A ++ GG T H ++DD + E+AY+ GR+K +
Sbjct: 142 D----GKKAGGVYAVSLGGQVTGFRAGHMAPGWQGAIIIDDPLKVEDAYSKTGRSKANRK 197
Query: 125 YSLLSSIEGSEA 136
L+S+++ +A
Sbjct: 198 --LVSTVKSRKA 207
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.318 0.136 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 309,503,897
Number of Sequences: 1038431
Number of extensions: 13787790
Number of successful extensions: 30521
Number of sequences better than 5.0e-02: 11
Number of HSP's better than 0.1 without gapping: 4
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 30506
Number of HSP's gapped (non-prelim): 11
length of query: 448
length of database: 328,470,697
effective HSP length: 124
effective length of query: 324
effective length of database: 199,705,253
effective search space: 64704501972
effective search space used: 64704501972
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 93 (40.4 bits)
Query= orf5 [1152-1341]
(188 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064743.1| (NC_002519) gp5.1 [Roseophage SIO1] >gi|99... 396 e-110
>ref|NP_064743.1| (NC_002519) gp5.1 [Roseophage SIO1]
gb|AAG02588.1|AF189021_6 (AF189021) Roseophage SIO1 complete genome
Length = 188
Score = 396 bits (1017), Expect = e-110
Identities = 188/188 (100%), Positives = 188/188 (100%)
Query: 1 MTLHLENISFVAVPKTATIAIEKAFSPYARGYTPHTHEPVNVVKRDGGQDCLGLIRNPHD 60
MTLHLENISFVAVPKTATIAIEKAFSPYARGYTPHTHEPVNVVKRDGGQDCLGLIRNPHD
Sbjct: 1 MTLHLENISFVAVPKTATIAIEKAFSPYARGYTPHTHEPVNVVKRDGGQDCLGLIRNPHD 60
Query: 61 WIKSYYLYLKHSPYFYSASSAWGIGEKTFEEFVGRFCDGHRLWPEPMRLQSMYLLRNGVS 120
WIKSYYLYLKHSPYFYSASSAWGIGEKTFEEFVGRFCDGHRLWPEPMRLQSMYLLRNGVS
Sbjct: 61 WIKSYYLYLKHSPYFYSASSAWGIGEKTFEEFVGRFCDGHRLWPEPMRLQSMYLLRNGVS 120
Query: 121 TDFIYRFENLLDAVAHLGAACGQMPKMSRHNVSPKCDVILSKKMTYLFEDAASEDFDLYE 180
TDFIYRFENLLDAVAHLGAACGQMPKMSRHNVSPKCDVILSKKMTYLFEDAASEDFDLYE
Sbjct: 121 TDFIYRFENLLDAVAHLGAACGQMPKMSRHNVSPKCDVILSKKMTYLFEDAASEDFDLYE 180
Query: 181 SGADGVMT 188
SGADGVMT
Sbjct: 181 SGADGVMT 188
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.322 0.138 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 130,209,886
Number of Sequences: 1038431
Number of extensions: 5267172
Number of successful extensions: 8796
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8795
Number of HSP's gapped (non-prelim): 1
length of query: 188
length of database: 328,470,697
effective HSP length: 114
effective length of query: 74
effective length of database: 210,089,563
effective search space: 15546627662
effective search space used: 15546627662
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 88 (38.5 bits)
Query= orf6 [1340-1477]
(136 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_102239.1| (NC_002678) unknown protein [Mesorhizobium... 44 2e-004
>ref|NP_102239.1| (NC_002678) unknown protein [Mesorhizobium loti]
dbj|BAB48025.1| (AP002994) unknown protein [Mesorhizobium loti]
Length = 130
Score = 44.3 bits (103), Expect = 2e-004
Identities = 21/57 (36%), Positives = 32/57 (55%), Gaps = 3/57 (5%)
Query: 1 MKYKSARYINSTKIDCEIEHPVHGWIPFTCDPSDA---GSEFDVAALHAEMAANPET 54
M+Y++A + +D EI HP +GWIPFT P D+ G E +A + +P+T
Sbjct: 2 MEYRNAAFNQFGTVDLEINHPEYGWIPFTASPDDSEELGREIFASASPIAASYDPKT 58
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.316 0.133 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 91,963,603
Number of Sequences: 1038431
Number of extensions: 3379852
Number of successful extensions: 8424
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8423
Number of HSP's gapped (non-prelim): 1
length of query: 136
length of database: 328,470,697
effective HSP length: 112
effective length of query: 24
effective length of database: 212,166,425
effective search space: 5091994200
effective search space used: 5091994200
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf7 [1477-1675]
(197 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064744.1| (NC_002519) gp6 [Roseophage SIO1] >gi|9944... 362 1e-099
>ref|NP_064744.1| (NC_002519) gp6 [Roseophage SIO1]
gb|AAG02589.1|AF189021_7 (AF189021) Roseophage SIO1 complete genome
Length = 249
Score = 362 bits (929), Expect = 1e-099
Identities = 170/197 (86%), Positives = 170/197 (86%)
Query: 1 MLEVTSDGSMVYQRYTTYNSVHTVYVRTYYSGTWYNWAKQWDNLNDGSGSGLDADLLDGQ 60
MLEVTSDGSMVYQRYTTYNSVHTVYVRTYYSGTWYNWAKQWDNLNDGSGSGLDADLLDGQ
Sbjct: 53 MLEVTSDGSMVYQRYTTYNSVHTVYVRTYYSGTWYNWAKQWDNLNDGSGSGLDADLLDGQ 112
Query: 61 QGSYYYPASNPNGYTAYTNSDVDTHLNTGTATTDQVLSWNGTDYDWIDGGGAPPTDLHAV 120
QGSYYYPASNPNGYTAYTNSDVDTHLNTGTATTDQVLSWNGTDYDWIDGGGAPPTDLHAV
Sbjct: 113 QGSYYYPASNPNGYTAYTNSDVDTHLNTGTATTDQVLSWNGTDYDWIDGGGAPPTDLHAV 172
Query: 121 GTYTVACTGVNVAQGGTTAGSNLRYRNIYALDRFXXXXXXXXXXXXXGTWRNVGGPVXXX 180
GTYTVACTGVNVAQGGTTAGSNLRYRNIYALDRF GTWRNVGGPV
Sbjct: 173 GTYTVACTGVNVAQGGTTAGSNLRYRNIYALDRFTVLTTTSSTVTLTGTWRNVGGPVQYV 232
Query: 181 XXXXXXXXXXXLWVRIS 197
LWVRIS
Sbjct: 233 SGRGGGQRGVSLWVRIS 249
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.314 0.133 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 138,118,973
Number of Sequences: 1038431
Number of extensions: 6353711
Number of successful extensions: 9494
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 9493
Number of HSP's gapped (non-prelim): 1
length of query: 197
length of database: 328,470,697
effective HSP length: 115
effective length of query: 82
effective length of database: 209,051,132
effective search space: 17142192824
effective search space used: 17142192824
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 88 (38.5 bits)
Query= orf8 [1705-2246]
(540 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064745.1| (NC_002519) gp7 [Roseophage SIO1] >gi|9944... 531 e-150
ref|NP_570387.1| (NC_003390) unknown [Cyanophage P60] >gi|1... 97 4e-019
ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED... 44 0.005
gb|AAC04580.1| (AF048699) surface layer protein [Campylobac... 41 0.042
>ref|NP_064745.1| (NC_002519) gp7 [Roseophage SIO1]
gb|AAG02590.1|AF189021_8 (AF189021) Roseophage SIO1 complete genome
Length = 540
Score = 531 bits (1368), Expect = e-150
Identities = 283/540 (52%), Positives = 283/540 (52%)
Query: 1 MAKKPTVTTLQSGFNSTEVLNENFENIRDAFDNTLSLDGSTPNAMQADLDLNGNNLIGAT 60
MAKKPTVTTLQSGFNSTEVLNENFENIRDAFDNTLSLDGSTPNAMQADLDLNGNNLIGAT
Sbjct: 1 MAKKPTVTTLQSGFNSTEVLNENFENIRDAFDNTLSLDGSTPNAMQADLDLNGNNLIGAT 60
Query: 61 GLLINGTDYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
GLLINGTDY
Sbjct: 61 GLLINGTDYLADVEAAKAAALVAQAAAELAENNAETAEVNAEASETAAGLSATAAATSAT 120
Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
Sbjct: 121 NAGASETAASASATASATSATNSATSASQAATSASAAAVSEGNAATSETNAANSATSASG 180
Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
Sbjct: 181 SASTATTQASAASVSATNAATSASNAATSATNAAASQAAAATSETNAAASESTVTTSATN 240
Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
Sbjct: 241 AATSEANAATSASTATTQATNAATSASTASTSATNAATSETNAASSASSAASSATSAQAS 300
Query: 301 KXXXXXXXXXXXXRYLGQKVSDPTLDNDGNALVAGALYFNTTDGIMKVYDGSVWLAAYAS 360
K RYLGQKVSDPTLDNDGNALVAGALYFNTTDGIMKVYDGSVWLAAYAS
Sbjct: 301 KDAALAALDSFDDRYLGQKVSDPTLDNDGNALVAGALYFNTTDGIMKVYDGSVWLAAYAS 360
Query: 361 LSGALIATNNLSDVLDVTASRTNLGLGXXXXXXXXXXXXXXQGALAVSAVQPNDSPSFGS 420
LSGALIATNNLSDVLDVTASRTNLGLG QGALAVSAVQPNDSPSFGS
Sbjct: 361 LSGALIATNNLSDVLDVTASRTNLGLGTAATTASTDYATAAQGALAVSAVQPNDSPSFGS 420
Query: 421 VTVTGTVDGRNVAADGSKLDGIEAGATADQTASEILTSIKTVDGSGSGLDADLLDGIHAS 480
VTVTGTVDGRNVAADGSKLDGIEAGATADQTASEILTSIKTVDGSGSGLDADLLDGIHAS
Sbjct: 421 VTVTGTVDGRNVAADGSKLDGIEAGATADQTASEILTSIKTVDGSGSGLDADLLDGIHAS 480
Query: 481 SFLQGNQTITLSGDVSGSGTTSILVTVADDSHNHIIANVDGLQTALDGKWSKGADIGGGC 540
SFLQGNQTITLSGDVSGSGTTSILVTVADDSHNHIIANVDGLQTALDGKWSKGADIGGGC
Sbjct: 481 SFLQGNQTITLSGDVSGSGTTSILVTVADDSHNHIIANVDGLQTALDGKWSKGADIGGGC 540
>ref|NP_570387.1| (NC_003390) unknown [Cyanophage P60]
gb|AAL73325.1| (AF338467) unknown [Cyanophage P60]
Length = 126
Score = 97.4 bits (241), Expect = 4e-019
Identities = 50/80 (62%), Positives = 62/80 (77%), Gaps = 1/80 (1%)
Query: 419 GSVTVTGTVDGRNVAADGSKLDGIEAGATADQTASEILTSIKTVDGSGSGLDADLLDGIH 478
G++TVTGTVDGR+V+ DG+KLDG+EA AT DQTASEILT + TVDG+GSGLDAD LDG +
Sbjct: 17 GNITVTGTVDGRDVSGDGTKLDGMEANATGDQTASEILTLLLTVDGTGSGLDADKLDGKN 76
Query: 479 ASSFLQGNQTITLSGDVSGS 498
S +L N T D+ G+
Sbjct: 77 GSHYLNYNNA-TNKPDIRGN 95
>ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14589.1| (AL646061) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 3501
Score = 43.9 bits (102), Expect = 0.005
Identities = 48/200 (24%), Positives = 78/200 (39%), Gaps = 13/200 (6%)
Query: 320 VSDPTLDNDGNALVAGALYFNTTDGIMKVYDGSVWLAAYASLSGALIATNNLSDVLDVTA 379
VS ++DN G +VAG + ++ G V A+++ A TN + A
Sbjct: 851 VSGGSVDNSGGTVVAGGTAVVQSGSTLRNAAGRVQAKGSAAVTAAGAITNTGGQIEADGA 910
Query: 380 SRTNLGLGXXXXXXXXXXXXXXQGALAVSAVQPNDSPSFGSVTVTGTVDGR-NVAADGSK 438
S T G GA V A ++ + G V GT+ G +V G
Sbjct: 911 SATLQVTGASLDNTNGRIANAGSGATTVGAASITNA-NTGGVAGAGTIGGNGDVTVSGQT 969
Query: 439 LDGIEAG---ATADQTASEILTSIKTVDGSGSGLDADLLDGIHASSFLQGNQTITLSGDV 495
L + G A D T L + ++V+ SG L A + ++ G + +G +
Sbjct: 970 LSNTQGGQIVAGHDLT----LATTRSVNNSGGTLSA----ANNVTANAAGAAVVNQAGSI 1021
Query: 496 SGSGTTSILVTVADDSHNHI 515
G+G S+ V D++ I
Sbjct: 1022 RGNGAVSLNVASLDNTSGKI 1041
>gb|AAC04580.1| (AF048699) surface layer protein [Campylobacter fetus]
Length = 1112
Score = 40.8 bits (94), Expect = 0.042
Identities = 55/211 (26%), Positives = 85/211 (40%), Gaps = 18/211 (8%)
Query: 328 DGNALVAGALYFNTTDGIMKVYDGSVWLAAYASLSGALIATNNLSDVLDVTASRTNLGLG 387
+ N V+G TT V D ++ +L +T N S ++ LG
Sbjct: 669 EANINVSGEFKLGTTTAKGDV-DFNIENVGALTLGAITSSTGNASVIISSATGNVTLGAV 727
Query: 388 XXXXXXXXXXXXXXQGALAVSAVQPNDSPSFGSVTVTGTVDGRNVAADGSKLDGIEAGAT 447
G + + A+ D S V GT++ +A G+K++ T
Sbjct: 728 SATQGNLTLNAGNTLGNITIGALA-GDIVSVDLGGVLGTIN----SASGNKVEITSNEVT 782
Query: 448 ADQTASEILTSIK--TVDGSGSGLDADLLDGIHASSFLQ-----GNQTITLSGDVSGSGT 500
SEI ++ T G+ L+A ++ G A L QTIT SGD+SG GT
Sbjct: 783 --YVGSEISKNVVEITAAAGGTDLNAQVIGGAAADDALTIIGKGDTQTITASGDLSG-GT 839
Query: 501 TSILVTVADDSHNHIIANVDGLQ--TALDGK 529
++ +T A + I+ V GL TA+D K
Sbjct: 840 LTLTLTDATKLSSLDISGVKGLSAATAIDLK 870
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.310 0.130 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 180,332,444
Number of Sequences: 1038431
Number of extensions: 6600334
Number of successful extensions: 15805
Number of sequences better than 5.0e-02: 4
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 15795
Number of HSP's gapped (non-prelim): 9
length of query: 540
length of database: 328,470,697
effective HSP length: 126
effective length of query: 414
effective length of database: 197,628,391
effective search space: 81818153874
effective search space used: 81818153874
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 94 (40.8 bits)
Query= orf9 [2247-2411]
(163 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_640324.1| (NC_003907) unknown [Vibriophage VpV262] >... 43 0.001
>ref|NP_640324.1| (NC_003907) unknown [Vibriophage VpV262]
gb|AAM28396.1| (AY095314) unknown [Vibriophage VpV262]
Length = 185
Score = 42.7 bits (99), Expect = 0.001
Identities = 40/150 (26%), Positives = 62/150 (40%), Gaps = 27/150 (18%)
Query: 19 RNEFTNYSLKKKDHKVGDKTYLSLHKIYIEMEDPTEYEFALSVFGDYSVWENLCNLSWFK 78
R ++ ++L +DH D T LS ++Y + + ++YE A+ + G + W+ LC WF
Sbjct: 46 RGKYPVFNLSMRDH---DDT-LSFPQLYFQCD--SDYEAAMVILGSWGHWQRLCEAKWFA 99
Query: 79 THHVQMQKELMLKLKARTIRNMINDLEEGKASYNAQKYLADAGYL-------DNGSKKRG 131
Q+E KA+ D+ G+A A LADAG L G +
Sbjct: 100 EKLALWQEE-----KAQ------RDIAHGRAKIKA---LADAGNLSACKFLASGGLSDKP 145
Query: 132 RPSKDELDGALKQAALEKAETEDDATRIGL 161
+ K K EDD + GL
Sbjct: 146 KQEKPSNPKPAKNPEPAPTAEEDDFLKHGL 175
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.314 0.131 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 108,933,295
Number of Sequences: 1038431
Number of extensions: 4364355
Number of successful extensions: 7985
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 7985
Number of HSP's gapped (non-prelim): 1
length of query: 163
length of database: 328,470,697
effective HSP length: 112
effective length of query: 51
effective length of database: 212,166,425
effective search space: 10820487675
effective search space used: 10820487675
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 86 (37.7 bits)
Query= orf10 [2701-2753]
(51 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.313 0.136 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 30,367,118
Number of Sequences: 1038431
Number of extensions: 779413
Number of successful extensions: 1468
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1468
Number of HSP's gapped (non-prelim): 0
length of query: 51
length of database: 328,470,697
effective HSP length: 27
effective length of query: 24
effective length of database: 300,433,060
effective search space: 7210393440
effective search space used: 7210393440
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 85 (37.4 bits)
Query= orf11 [2751-2948]
(196 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064747.1| (NC_002519) gp9 [Roseophage SIO1] >gi|9944... 375 e-103
>ref|NP_064747.1| (NC_002519) gp9 [Roseophage SIO1]
gb|AAG02592.1|AF189021_10 (AF189021) Roseophage SIO1 complete genome
Length = 196
Score = 375 bits (963), Expect = e-103
Identities = 183/196 (93%), Positives = 183/196 (93%)
Query: 1 MTNHVRNILKLYRQATQEDTINGVEWYARAERMAKAIASDAGLPLPTVIGVMAALSPNNR 60
MTNHVRNILKLYRQATQEDTINGVEWYARAERMAKAIASDAGLPLPTVIGVMAALSPNNR
Sbjct: 1 MTNHVRNILKLYRQATQEDTINGVEWYARAERMAKAIASDAGLPLPTVIGVMAALSPNNR 60
Query: 61 WERNCRDAATMCKAWQNGDSMDSFKVSCYNTMKAKAWAILDLGLTDDEDILSHLNGQKIR 120
WERNCRDAATMCKAWQNGDSMDSFKVSCYNTMKAKAWAILDLGLTDDEDILSHLNGQKIR
Sbjct: 61 WERNCRDAATMCKAWQNGDSMDSFKVSCYNTMKAKAWAILDLGLTDDEDILSHLNGQKIR 120
Query: 121 SFYSNIRGLDEVTIDGHALNIARGKREGLTSDKTNMGKREYRELQXXXXXXXXXXXXXPH 180
SFYSNIRGLDEVTIDGHALNIARGKREGLTSDKTNMGKREYRELQ PH
Sbjct: 121 SFYSNIRGLDEVTIDGHALNIARGKREGLTSDKTNMGKREYRELQAAYVRAAKRVKVKPH 180
Query: 181 VLQAITWTTWKRVHNI 196
VLQAITWTTWKRVHNI
Sbjct: 181 VLQAITWTTWKRVHNI 196
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.318 0.132 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 118,760,735
Number of Sequences: 1038431
Number of extensions: 4380212
Number of successful extensions: 8453
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8452
Number of HSP's gapped (non-prelim): 1
length of query: 196
length of database: 328,470,697
effective HSP length: 115
effective length of query: 81
effective length of database: 209,051,132
effective search space: 16933141692
effective search space used: 16933141692
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 88 (38.5 bits)
Query= orf12 [2956-3099]
(142 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064748.1| (NC_002519) gp10 [Roseophage SIO1] >gi|994... 277 1e-074
>ref|NP_064748.1| (NC_002519) gp10 [Roseophage SIO1]
gb|AAG02593.1|AF189021_11 (AF189021) Roseophage SIO1 complete genome
Length = 142
Score = 277 bits (709), Expect = 1e-074
Identities = 142/142 (100%), Positives = 142/142 (100%)
Query: 1 MTNTQHKNILKHLKTAKGLTVREALIEYSISSLTKRVHELRGLGYDIESVRKKHPCNRSK 60
MTNTQHKNILKHLKTAKGLTVREALIEYSISSLTKRVHELRGLGYDIESVRKKHPCNRSK
Sbjct: 1 MTNTQHKNILKHLKTAKGLTVREALIEYSISSLTKRVHELRGLGYDIESVRKKHPCNRSK 60
Query: 61 IYTLLSTRGARAMSMCGEIENTENAIKSLRAKEFNIQFNSIGLTDKERLTIVNTIREELL 120
IYTLLSTRGARAMSMCGEIENTENAIKSLRAKEFNIQFNSIGLTDKERLTIVNTIREELL
Sbjct: 61 IYTLLSTRGARAMSMCGEIENTENAIKSLRAKEFNIQFNSIGLTDKERLTIVNTIREELL 120
Query: 121 EETVRLNKLLELANKAEGSHRA 142
EETVRLNKLLELANKAEGSHRA
Sbjct: 121 EETVRLNKLLELANKAEGSHRA 142
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.315 0.130 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 72,910,001
Number of Sequences: 1038431
Number of extensions: 2372231
Number of successful extensions: 7420
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7419
Number of HSP's gapped (non-prelim): 1
length of query: 142
length of database: 328,470,697
effective HSP length: 118
effective length of query: 24
effective length of database: 205,935,839
effective search space: 4942460136
effective search space used: 4942460136
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf13 [3200-3250]
(49 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.312 0.134 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,743,079
Number of Sequences: 1038431
Number of extensions: 1003505
Number of successful extensions: 1375
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1375
Number of HSP's gapped (non-prelim): 0
length of query: 49
length of database: 328,470,697
effective HSP length: 25
effective length of query: 24
effective length of database: 302,509,922
effective search space: 7260238128
effective search space used: 7260238128
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf14 [3249-3472]
(222 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064749.1| (NC_002519) RP thymidylate synthase [Roseo... 461 e-129
ref|NP_228259.1| (NC_000853) thy1 protein [Thermotoga marit... 112 3e-024
pdb|1KQ4|C Chain C, Crystal Structure Of Thy1-Complementing... 110 9e-024
sp|P15808|THY1_DICDI Thy1 protein >gi|279686|pir||YXDOTC th... 104 8e-022
ref|NP_360042.1| (NC_003103) thy1 protein [Rickettsia conor... 103 1e-021
ref|NP_219444.1| (NC_000919) thymidylate synthase-complemen... 102 3e-021
ref|NP_045749.1| (NC_001857) thymidylate synthase-complemen... 98 8e-020
ref|NP_570363.1| (NC_003390) unknown [Cyanophage P60] >gi|1... 87 1e-016
ref|NP_629868.1| (NC_003888) conserved hypothetical protein... 71 8e-012
ref|NP_049030.1| (NC_000852) similar to Synechocystis ORF s... 65 4e-010
ref|NP_280185.1| (NC_002607) Vng1325c [Halobacterium sp. NR... 45 8e-004
sp|Q9HQ52|YD25_HALN1 Hypothetical protein Vng1325c 45 8e-004
ref|NP_578269.1| (NC_003413) hypothetical protein [Pyrococc... 42 0.004
ref|NP_542556.1| (NC_003345) putative phage C31 protein gp1... 40 0.025
>ref|NP_064749.1| (NC_002519) RP thymidylate synthase [Roseophage SIO1]
gb|AAG02594.1|AF189021_12 (AF189021) Roseophage SIO1 complete genome
Length = 222
Score = 461 bits (1185), Expect = e-129
Identities = 222/222 (100%), Positives = 222/222 (100%)
Query: 1 MTQIEATYIDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHK 60
MTQIEATYIDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHK
Sbjct: 1 MTQIEATYIDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHK 60
Query: 61 HISPFGHAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKK 120
HISPFGHAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKK
Sbjct: 61 HISPFGHAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKK 120
Query: 121 QGSDGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFAD 180
QGSDGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFAD
Sbjct: 121 QGSDGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFAD 180
Query: 181 MCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALMEGEKQ 222
MCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALMEGEKQ
Sbjct: 181 MCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALMEGEKQ 222
>ref|NP_228259.1| (NC_000853) thy1 protein [Thermotoga maritima]
sp|Q9WYT0|Y449_THEMA Hypothetical protein TM0449
pir||B72375 thy1 protein - Thermotoga maritima (strain MSB8)
gb|AAD35532.1|AE001723_2 (AE001723) thy1 protein [Thermotoga maritima]
Length = 220
Score = 112 bits (280), Expect = 3e-024
Identities = 73/215 (33%), Positives = 110/215 (50%), Gaps = 26/215 (12%)
Query: 9 IDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSG--RDTKLIKYLAKHKHISPFG 66
+D MG+DLS V AARVSF D GL RD LI+YL KH H +PF
Sbjct: 14 VDVMGNDLSAVRAARVSF---------------DMGLKDEERDRHLIEYLMKHGHETPFE 58
Query: 67 HAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKKQGSDGV 126
H +FHVKAPIFVARQ +H+ +NE+S RY EFY P S ++ +G
Sbjct: 59 HIVFTFHVKAPIFVARQWFRHRIASYNELSGRYSKLSYEFYIP------SPERLEGYKTT 112
Query: 127 VNPEYNPQYLD---NKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCR 183
+ PE + + +K YL+ ++ G+ E AR++LP + T ++W+ + + +
Sbjct: 113 IPPERVTEKISEIVDKAYRTYLELIESGVPREVARIVLPLNLYTRFFWTVNARSLMNFLN 172
Query: 184 LRCKEDTQYESRVVADQISEKMADLYPVSWAALME 218
LR Q+E + A I+ + P ++ A ++
Sbjct: 173 LRADSHAQWEIQQYALAIARIFKEKCPWTFEAFLK 207
>pdb|1KQ4|C Chain C, Crystal Structure Of Thy1-Complementing Protein (Tm0449)
From Thermotoga Maritima At 2.25 A Resolution
pdb|1KQ4|B Chain B, Crystal Structure Of Thy1-Complementing Protein (Tm0449)
From Thermotoga Maritima At 2.25 A Resolution
pdb|1KQ4|D Chain D, Crystal Structure Of Thy1-Complementing Protein (Tm0449)
From Thermotoga Maritima At 2.25 A Resolution
pdb|1KQ4|A Chain A, Crystal Structure Of Thy1-Complementing Protein (Tm0449)
From Thermotoga Maritima At 2.25 A Resolution
Length = 232
Score = 110 bits (276), Expect = 9e-024
Identities = 72/215 (33%), Positives = 109/215 (50%), Gaps = 26/215 (12%)
Query: 9 IDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSG--RDTKLIKYLAKHKHISPFG 66
+D G+DLS V AARVSF D GL RD LI+YL KH H +PF
Sbjct: 26 VDVXGNDLSAVRAARVSF---------------DXGLKDEERDRHLIEYLXKHGHETPFE 70
Query: 67 HAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKKQGSDGV 126
H +FHVKAPIFVARQ +H+ +NE+S RY EFY P S ++ +G
Sbjct: 71 HIVFTFHVKAPIFVARQWFRHRIASYNELSGRYSKLSYEFYIP------SPERLEGYKTT 124
Query: 127 VNPEYNPQYLD---NKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCR 183
+ PE + + +K YL+ ++ G+ E AR++LP + T ++W+ + + +
Sbjct: 125 IPPERVTEKISEIVDKAYRTYLELIESGVPREVARIVLPLNLYTRFFWTVNARSLXNFLN 184
Query: 184 LRCKEDTQYESRVVADQISEKMADLYPVSWAALME 218
LR Q+E + A I+ + P ++ A ++
Sbjct: 185 LRADSHAQWEIQQYALAIARIFKEKCPWTFEAFLK 219
>sp|P15808|THY1_DICDI Thy1 protein
pir||YXDOTC thymidylate synthase-complementing protein - slime mold
(Dictyostelium discoideum)
gb|AAA33257.1| (M27713) Thy1 protein [Dictyostelium discoideum]
Length = 260
Score = 104 bits (259), Expect = 8e-022
Identities = 70/206 (33%), Positives = 101/206 (48%), Gaps = 20/206 (9%)
Query: 14 SDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHAFASFH 73
+D S+ AARVS+G ++ V DKGL I+YL +H+H SPF FH
Sbjct: 59 ADFSICQAARVSYGAGTKKVT------EDKGL-------IRYLYRHQHTSPFEMVEFKFH 105
Query: 74 VKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDV--WRGRSADKKQGSDGVVNPEY 131
P+F+ARQ ++H+ NE S RY +FY P + R +S +QG + + P+
Sbjct: 106 CVMPVFIARQWIRHRTANVNEYSARYSVLPDKFYHPSIEEVRKQSTSNRQGGEEALEPKT 165
Query: 132 NPQYLDNKIKF-----AYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCRLRC 186
++LD K Y + L+ G+S E R+ LP S TEWYW L RLR
Sbjct: 166 AQEFLDYLDKVEENYKTYNELLEKGLSRELGRIGLPVSIYTEWYWKIDLHNLFHFLRLRM 225
Query: 187 KEDTQYESRVVADQISEKMADLYPVS 212
+Q E R A+ I + + PV+
Sbjct: 226 DSHSQKEIRDYANTIFALIRPIVPVA 251
>ref|NP_360042.1| (NC_003103) thy1 protein [Rickettsia conorii]
gb|AAL02943.1| (AE008605) thy1 protein [Rickettsia conorii]
Length = 294
Score = 103 bits (257), Expect = 1e-021
Identities = 77/231 (33%), Positives = 109/231 (46%), Gaps = 34/231 (14%)
Query: 9 IDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHA 68
ID+MG D ++V AARVS+GK ++ + +DKGL I YL +H H +PF
Sbjct: 32 IDYMGDDSAIVQAARVSYGKGTKQL------NQDKGL-------INYLLRHYHTTPFEMC 78
Query: 69 FASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWR--GRSADKKQGSDGV 126
FH+K PIF+ARQ ++H+ NE S RY EFY P+ +SA KQ +G
Sbjct: 79 DIKFHIKLPIFIARQWIRHRTASVNEYSARYSILGNEFYLPEPANIASQSAVNKQCREGD 138
Query: 127 VNPEYNPQ----YLDNKIKFAYLQALDI---------------GISPEQARMLLPQSTMT 167
P+ + L+ + Y ++ GI+ E ARM L + T
Sbjct: 139 SLPKEVAEKVLAILEEDARQCYGHYKELMNADEEGNIIDENATGIARELARMNLTLNYYT 198
Query: 168 EWYWSGSLDAFADMCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALME 218
EWYW +L RLR QYE RV A+++ E + P + A E
Sbjct: 199 EWYWKINLHNLLHFLRLRADPHAQYEIRVYAEKMLEIVKAWVPFVYEAFEE 249
>ref|NP_219444.1| (NC_000919) thymidylate synthase-complementing protein (thy1)
[Treponema pallidum]
sp|O83971|YA07_TREPA Hypothetical protein TP1007
pir||A71255 probable thymidylate synthase-complementing protein (thy1) -
syphilis spirochete
gb|AAC65955.1| (AE001268) thymidylate synthase-complementing protein (thy1)
[Treponema pallidum]
Length = 305
Score = 102 bits (254), Expect = 3e-021
Identities = 77/223 (34%), Positives = 105/223 (46%), Gaps = 24/223 (10%)
Query: 9 IDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHA 68
ID++G D +V AARVS+G G RD D LI +L ++KH SPF
Sbjct: 55 IDYLGGDARIVQAARVSYG-------AGTRTARD------DAALIDFLLRNKHTSPFEQV 101
Query: 69 FASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVW--RGRSADKKQG--SD 124
+FHV+APIFVARQ ++H+ R +E+S RY + Y P +S KQG S+
Sbjct: 102 VLTFHVRAPIFVARQWMRHRTARISEVSSRYSLLSHDCYVPQETSVAVQSTRNKQGRASE 161
Query: 125 GVVNPEYNPQY------LDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAF 178
G ++PE + AY + I+ E AR+ +P S TEWYW L
Sbjct: 162 G-ISPEQQQEVRAAFEAQQKAACAAYDALIQKNIARELARINVPLSLYTEWYWQIDLHNL 220
Query: 179 ADMCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALMEGEK 221
RLR Q E R A+ I E + P + A+ EK
Sbjct: 221 FHFLRLRASAHAQAEIRAYAEVIIEITRAVAPCATASFENHEK 263
>ref|NP_045749.1| (NC_001857) thymidylate synthase-complementing protein (thy1)
[Borrelia burgdorferi]
sp|O50965|YP76_BORBU Hypothetical protein BBA76
pir||D70216 thymidylate synthase-complementing protein (thy1) homolog - Lyme
disease spirochete plasmid A/lp54
gb|AAC66242.1| (AE000790) thymidylate synthase-complementing protein (thy1)
[Borrelia burgdorferi]
Length = 265
Score = 97.8 bits (242), Expect = 8e-020
Identities = 71/222 (31%), Positives = 105/222 (46%), Gaps = 36/222 (16%)
Query: 9 IDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHA 68
ID MG D +V AAR+S+ R++ + +D +LI YL ++ H SP
Sbjct: 16 IDFMGDDRRIVKAARISY--------------REESVKRKDAELIDYLIRNGHTSPLEQV 61
Query: 69 FASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKK-QGSDGVV 127
+FHVKAPIFVARQ ++H+ R NE+S Y EFY P D K Q S
Sbjct: 62 VFTFHVKAPIFVARQWMRHRTARINEVSGCYSLAREEFYVP-----LEEDLKCQTSSNSS 116
Query: 128 NPEYNP-QYLDNKIKF-------AYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFA 179
E+ + L +KIK Y ++ I E +R++LP S TEWYW L+
Sbjct: 117 EKEFKSLEKLSDKIKHHQKHSYELYQDMINANIPKELSRIVLPLSLYTEWYWQIDLNNLF 176
Query: 180 DMCRLR--------CKEDTQYESRVVADQISEKMADLYPVSW 213
+LR KE++ E R A + + ++ P+++
Sbjct: 177 HFIKLRLALDSPKEIKENSPKEMREYAKALISIVREIVPIAF 218
>ref|NP_570363.1| (NC_003390) unknown [Cyanophage P60]
gb|AAL73301.1| (AF338467) unknown [Cyanophage P60]
Length = 235
Score = 87.0 bits (214), Expect = 1e-016
Identities = 60/210 (28%), Positives = 108/210 (50%), Gaps = 7/210 (3%)
Query: 15 DLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHAFASFHV 74
++S V+ V+ + Y + +D L +LI+YL KH H SPF A +
Sbjct: 25 NMSKVSLISVTPDAEELIAYLARVSNKDAKLGDPADRLIRYLIKHGHWSPFEMAHMVVEI 84
Query: 75 KAPIFVARQLVKHKFLRWNEISRRY-VDDEPEFYTPDVWRGRSADKKQGSDGV---VNPE 130
+ +A Q+++H+ + E S+RY V D EF P++ R + +++ D + + E
Sbjct: 85 ETTRAIAPQILRHRSFSFQEFSQRYAVADLGEFEVPELRRQDTKNRQNSIDDLPAEIVAE 144
Query: 131 YNPQYLD--NKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCRLRCKE 188
+ + + + Y + L G++ E AR +LP +T T+ Y SG++ ++ C+LRC
Sbjct: 145 FQSEIDEVFETTRVLYQRMLSYGVAKECARGILPLNTPTKLYMSGTIRSWIHYCQLRCGN 204
Query: 189 DTQYESRVVADQISEKM-ADLYPVSWAALM 217
TQ E R++A++ + A+L V+ AA M
Sbjct: 205 GTQEEHRLIAEECKNLLYANLPNVAKAAFM 234
>ref|NP_629868.1| (NC_003888) conserved hypothetical protein SC9A10.07 [Streptomyces
coelicolor A3(2)]
pir||T35843 hypothetical protein SC9A10.07 SC9A10.07 - Streptomyces coelicolor
emb|CAA20294.1| (AL031260) conserved hypothetical protein SC9A10.07 [Streptomyces
coelicolor A3(2)]
Length = 246
Score = 71.2 bits (173), Expect = 8e-012
Identities = 64/230 (27%), Positives = 102/230 (43%), Gaps = 23/230 (10%)
Query: 2 TQIEATYIDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKH 61
+ I + +D V+ AARVS + + R KGL I YL + +H
Sbjct: 15 SDITVELVKSAATDSDVLFAARVSTAGEQSLDELKKDPERSKGL-------INYLMRDRH 67
Query: 62 ISPFGHAFASFHVKAPIFVARQLVKHKF-LRWNEISRRYVDDEPEFYTPDVWRGRSADKK 120
SPF H +F V APIFV R+ ++H+ +NE S RY + +P FY PD R +
Sbjct: 68 GSPFEHNSMTFFVSAPIFVFREFMRHRVGWSYNEESGRYRELQPVFYAPDASRKLVQQGR 127
Query: 121 QGSDGVV--NPEYNP---QYLDNKIKFAYL---QALDIGISPEQARMLLPQSTMTEWYWS 172
G V PE + +++ + AY Q L G++ E AR +LP + Y +
Sbjct: 128 PGKYVFVEGTPEQHELVGSAMEDSYRQAYATYQQMLAAGVAREVARAVLPVGLYSSMYAT 187
Query: 173 GSLDAFADMCRLRCKED-------TQYESRVVADQISEKMADLYPVSWAA 215
+ + LR + + Q E + +++ + A L P++ AA
Sbjct: 188 CNARSLMHFLGLRTQHELAKVPSFPQREIEMAGEKMEAEWARLMPLTHAA 237
>ref|NP_049030.1| (NC_000852) similar to Synechocystis ORF s111635, corresponds to
GenBank Accession Number D90903 [Paramecium bursaria
Chlorella virus 1]
sp|O41156|A674_CHVP1 Putative protein A674R
pir||T18176 hypothetical protein A674R - Chlorella virus PBCV-1
gb|AAC96983.1| (U42580) similar to Synechocystis ORF s111635, corresponds to
GenBank Accession Number D90903 [Paramecium bursaria
Chlorella virus 1]
Length = 216
Score = 65.5 bits (158), Expect = 4e-010
Identities = 50/167 (29%), Positives = 77/167 (45%), Gaps = 11/167 (6%)
Query: 52 LIKYLAKHKHISPFGHAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDV 111
L+KY +HKH S F AF + +K +A Q+++H+ + E S+RY TP
Sbjct: 44 LLKYCIRHKHWSIFETAFMTLELKTSRGIAAQVLRHRSFHFQEFSQRYAS---VMETPPP 100
Query: 112 WRGRSADKKQGSDGV-VNPEYNPQYLDNKIKFAYLQ-------ALDIGISPEQARMLLPQ 163
+ R D K + + PE + + + + Y Q AL+ GI+ E AR +LP
Sbjct: 101 HQARFQDHKNRQNSLDTVPEDDQTWWATEQEKLYAQSMELYNKALEKGIAKECARFILPL 160
Query: 164 STMTEWYWSGSLDAFADMCRLRCKEDTQYESRVVADQISEKMADLYP 210
ST T Y SG++ + LR TQ E +A+ E +P
Sbjct: 161 STPTTIYMSGTIRDWIHYIELRTSNGTQREHIDLANACKEIFIKEFP 207
>ref|NP_280185.1| (NC_002607) Vng1325c [Halobacterium sp. NRC-1]
gb|AAG19665.1| (AE005054) Vng1325c [Halobacterium sp. NRC-1]
Length = 266
Score = 44.7 bits (104), Expect = 8e-004
Identities = 53/207 (25%), Positives = 81/207 (38%), Gaps = 30/207 (14%)
Query: 30 SEWVYCGQSDGRDKGLSGRDT-----KLIKYLAKHKHISPFGHAFASFHVKAPIFVAR-- 82
S+WV D + G T LI L H PF H A+F ++ V+R
Sbjct: 45 SDWVGDTPLDTAMASVDGDTTDEKLSNLIAQLLTRGHYGPFEHPSATFAIEG---VSRSC 101
Query: 83 --QLVKHKFLRWNEISRRYV--DD--------------EPEFYTPDVWRGRSADKKQ-GS 123
QL +H+ ++ S RYV DD P PD W GR+ D
Sbjct: 102 MAQLTRHRHASFDVQSMRYVAFDDVDPAAVAEGELVVTPPSATDPD-WVGRNQDAGDIDE 160
Query: 124 DGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCR 183
+ + E Q + Y + L +G+ PE AR +LP T + + + +
Sbjct: 161 ETMAEREAVFQASVRRAVEDYQELLGLGMPPEDARFVLPIGTEVNVVITLNPRSLMHVAD 220
Query: 184 LRCKEDTQYESRVVADQISEKMADLYP 210
+R D Q+E R + +Q+ + A P
Sbjct: 221 MRAAADAQWEIRELTEQLLDAAAQWCP 247
>sp|Q9HQ52|YD25_HALN1 Hypothetical protein Vng1325c
Length = 247
Score = 44.7 bits (104), Expect = 8e-004
Identities = 53/207 (25%), Positives = 81/207 (38%), Gaps = 30/207 (14%)
Query: 30 SEWVYCGQSDGRDKGLSGRDT-----KLIKYLAKHKHISPFGHAFASFHVKAPIFVAR-- 82
S+WV D + G T LI L H PF H A+F ++ V+R
Sbjct: 26 SDWVGDTPLDTAMASVDGDTTDEKLSNLIAQLLTRGHYGPFEHPSATFAIEG---VSRSC 82
Query: 83 --QLVKHKFLRWNEISRRYV--DD--------------EPEFYTPDVWRGRSADKKQ-GS 123
QL +H+ ++ S RYV DD P PD W GR+ D
Sbjct: 83 MAQLTRHRHASFDVQSMRYVAFDDVDPAAVAEGELVVTPPSATDPD-WVGRNQDAGDIDE 141
Query: 124 DGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCR 183
+ + E Q + Y + L +G+ PE AR +LP T + + + +
Sbjct: 142 ETMAEREAVFQASVRRAVEDYQELLGLGMPPEDARFVLPIGTEVNVVITLNPRSLMHVAD 201
Query: 184 LRCKEDTQYESRVVADQISEKMADLYP 210
+R D Q+E R + +Q+ + A P
Sbjct: 202 MRAAADAQWEIRELTEQLLDAAAQWCP 228
>ref|NP_578269.1| (NC_003413) hypothetical protein [Pyrococcus furiosus DSM 3638]
gb|AAL80664.1| (AE010177) hypothetical protein [Pyrococcus furiosus DSM 3638]
Length = 244
Score = 42.4 bits (98), Expect = 0.004
Identities = 45/167 (26%), Positives = 77/167 (45%), Gaps = 23/167 (13%)
Query: 61 HISPFGHAFASFHVKAPIFVA-RQLVKHKFLRWNEISRRYV-----DDEPEFYTPD-VWR 113
H S HA +F ++ V QLV+H+ + + S+RY+ + E F P+ + +
Sbjct: 54 HESILEHATFTFSIEGCSRVCTHQLVRHRIASYTQQSQRYIVLDEENVEETFVIPESIKK 113
Query: 114 GRSADKKQGSDGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSG 173
R +K + + I Y ++++ G+ E AR +LPQ+ T+ +
Sbjct: 114 DRELYEKW-----------KKVMAETISL-YKESINRGVHQEDARFILPQAVKTKIIVTM 161
Query: 174 SLDAFADMCRLRCKEDTQYESRVVADQISEKMA---DLYP-VSWAAL 216
+L LR E Q+E R VA ++ E+MA D+ P + WA L
Sbjct: 162 NLRELKHFFGLRLCERAQWEIREVAWKMLEEMAKRDDIRPIIKWAKL 208
>ref|NP_542556.1| (NC_003345) putative phage C31 protein gp16-like protein [Halovirus
HF2]
gb|AAL54978.1| (AF222060) putative phage C31 protein gp16-like protein [Halovirus
HF2]
Length = 234
Score = 39.7 bits (91), Expect = 0.025
Identities = 41/180 (22%), Positives = 76/180 (41%), Gaps = 18/180 (10%)
Query: 52 LIKYLAKHKHISPFGHAFASFHVKAPIFVAR-QLVKHKFLRWNEISRRYVDDEPEFYTPD 110
L+ L + H PF H A F V+ A Q+ +H+ + ++ S+RY D F +
Sbjct: 50 LMGELLRRGHFGPFEHIQAFFAVEGLSRSAMAQVTRHRHMSFDVQSQRYCD----FSDKN 105
Query: 111 VWRGRSADKKQGSDGVVNPEYNPQYLD------------NKIKFAYLQALDIGISPEQAR 158
+ K ++ VV +Y+ D N ++ Y + ++ G++ E AR
Sbjct: 106 IVIPPGDKKNPAAEDVVLGQYDGPAEDFTGADALEGHFVNSVEL-YEKLIEQGMAKEDAR 164
Query: 159 MLLPQSTMTEWYWSGSLDAFADMCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALME 218
+LP + +S + LR Q+E+R A ++ E+ + P+ + A E
Sbjct: 165 FVLPIGVEVDMTFSANARTLMHFFDLRKNMKAQWEAREFATKVLEECKEWSPLVFNAYEE 224
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.319 0.134 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 148,891,095
Number of Sequences: 1038431
Number of extensions: 6066257
Number of successful extensions: 11539
Number of sequences better than 5.0e-02: 27
Number of HSP's better than 0.1 without gapping: 15
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 11496
Number of HSP's gapped (non-prelim): 30
length of query: 222
length of database: 328,470,697
effective HSP length: 117
effective length of query: 105
effective length of database: 206,974,270
effective search space: 21732298350
effective search space used: 21732298350
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 89 (38.9 bits)
Query= orf15 [3471-3576]
(103 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064750.1| (NC_002519) gp12 [Roseophage SIO1] >gi|994... 227 2e-059
>ref|NP_064750.1| (NC_002519) gp12 [Roseophage SIO1]
gb|AAG02595.1|AF189021_14 (AF189021) Roseophage SIO1 complete genome
Length = 114
Score = 227 bits (579), Expect = 2e-059
Identities = 103/104 (99%), Positives = 103/104 (99%), Gaps = 1/104 (0%)
Query: 1 MTNRKFTPPTEFPAEYVDGFGCKVTILGRSYYNKERPLVGFDDEGCACNYAENGAYWPDD 60
MTNRKFTPPTEFPAEYVDGFGCKVTILGRSYYNKERPLVGFDDEGCACNYAENGAYWPDD
Sbjct: 11 MTNRKFTPPTEFPAEYVDGFGCKVTILGRSYYNKERPLVGFDDEGCACNYAENGAYWPDD 70
Query: 61 GGKYDLHDIQKRITTWHNVYEGWVGASNKVN-GVTENRLCVIPH 103
GGKYDLHDIQKRITTWHNVYEGWVGASNKVN GVTENRLCVIPH
Sbjct: 71 GGKYDLHDIQKRITTWHNVYEGWVGASNKVNXGVTENRLCVIPH 114
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.319 0.141 6.50e-038
Gapped
Lambda K H
0.267 0.0410 6.96e+252
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 83,486,452
Number of Sequences: 1038431
Number of extensions: 3580346
Number of successful extensions: 6273
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6271
Number of HSP's gapped (non-prelim): 1
length of query: 103
length of database: 328,470,697
effective HSP length: 79
effective length of query: 24
effective length of database: 246,434,648
effective search space: 5914431552
effective search space used: 5914431552
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf16 [3592-3670]
(77 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.313 0.130 6.96e+252
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,095,211
Number of Sequences: 1038431
Number of extensions: 556314
Number of successful extensions: 635
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 635
Number of HSP's gapped (non-prelim): 0
length of query: 77
length of database: 328,470,697
effective HSP length: 53
effective length of query: 24
effective length of database: 273,433,854
effective search space: 6562412496
effective search space used: 6562412496
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf17 [3667-3790]
(122 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064751.1| (NC_002519) gp13 [Roseophage SIO1] >gi|994... 188 8e-048
>ref|NP_064751.1| (NC_002519) gp13 [Roseophage SIO1]
gb|AAG02596.1|AF189021_15 (AF189021) Roseophage SIO1 complete genome
Length = 122
Score = 188 bits (478), Expect = 8e-048
Identities = 93/122 (76%), Positives = 93/122 (76%)
Query: 1 MSEAPERIWAWWDDAYDVGLLNKHGDKRYTPNDAKEYVRADRIEELETQLSEARQVGKEW 60
MSEAPERIWAWWDDAYDVGLLNKHGDKRYTPNDAKEYVRADRIEELETQLSEARQVGKEW
Sbjct: 1 MSEAPERIWAWWDDAYDVGLLNKHGDKRYTPNDAKEYVRADRIEELETQLSEARQVGKEW 60
Query: 61 FEEQXXXXXXXXXXXXXXXXXIDXXXXXXXXXXXXESLVDWANLHVLKARTLLAELKGET 120
FEEQ ID ESLVDWANLHVLKARTLLAELKGET
Sbjct: 61 FEEQKKARKDANAAEAKLAKAIDFVAGVAGGFKFGESLVDWANLHVLKARTLLAELKGET 120
Query: 121 DE 122
DE
Sbjct: 121 DE 122
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.312 0.131 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 60,023,664
Number of Sequences: 1038431
Number of extensions: 1573670
Number of successful extensions: 3090
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3089
Number of HSP's gapped (non-prelim): 1
length of query: 122
length of database: 328,470,697
effective HSP length: 98
effective length of query: 24
effective length of database: 226,704,459
effective search space: 5440907016
effective search space used: 5440907016
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf18 [3788-3871]
(82 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.320 0.137 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,307,291
Number of Sequences: 1038431
Number of extensions: 1861419
Number of successful extensions: 4846
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4846
Number of HSP's gapped (non-prelim): 0
length of query: 82
length of database: 328,470,697
effective HSP length: 58
effective length of query: 24
effective length of database: 268,241,699
effective search space: 6437800776
effective search space used: 6437800776
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf19 [3866-3945]
(78 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.322 0.139 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 51,319,670
Number of Sequences: 1038431
Number of extensions: 1725130
Number of successful extensions: 3208
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3208
Number of HSP's gapped (non-prelim): 0
length of query: 78
length of database: 328,470,697
effective HSP length: 54
effective length of query: 24
effective length of database: 272,395,423
effective search space: 6537490152
effective search space used: 6537490152
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf20 [3946-4009]
(62 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.330 0.150 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 20,764,316
Number of Sequences: 1038431
Number of extensions: 212493
Number of successful extensions: 236
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 236
Number of HSP's gapped (non-prelim): 0
length of query: 62
length of database: 328,470,697
effective HSP length: 38
effective length of query: 24
effective length of database: 289,010,319
effective search space: 6936247656
effective search space used: 6936247656
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf21 [4052-4570]
(517 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064752.1| (NC_002519) RPprimase/helicase [Roseophage... 1040 0.0
ref|NP_570328.1| (NC_003390) RP primase/helicase [Cyanophag... 335 1e-090
ref|NP_523315.1| (NC_003298) DNA primase/helicase [Bacterio... 276 4e-073
ref|NP_052087.1| (NC_001271) DNA primase/helicase [Bacterio... 274 2e-072
ref|NP_041975.1| (NC_001604) gene 4A, primase/helicase [14,... 273 4e-072
ref|NP_523316.1| (NC_003298) helicase [Bacteriophage T3] >g... 255 1e-066
pdb|1CR1|A Chain A, Crystal Structure Of The Helicase Domai... 156 5e-037
gb|AAD41902.1|AF159357_1 (AF159357) DNA primase [Enterobact... 154 2e-036
ref|NP_602175.1| (NC_003450) COG0305:Replicative DNA helica... 76 1e-012
gb|EAA07012.1| (AAAB01008848) ebiP580 [Anopheles gambiae st... 71 4e-011
gb|AAF52820.1| (AE003626) CG5924 gene product [Drosophila m... 66 1e-009
ref|NP_391924.1| (NC_000964) replicative DNA helicase [Baci... 63 1e-008
ref|NP_214015.1| (NC_000918) replicative DNA helicase [Aqui... 62 1e-008
ref|NP_563550.1| (NC_003366) replicative DNA helicase [Clos... 56 9e-007
ref|NP_072756.1| (NC_000908) replicative DNA helicase (dnaB... 54 5e-006
ref|NP_108071.1| (NC_002678) replicative DNA helicase (dnaB... 49 1e-004
ref|NP_439720.1| (NC_000907) replicative DNA helicase (dnaB... 49 1e-004
ref|NP_543071.1| (NC_003356) putative helicase [Bacteriopha... 46 0.001
ref|NP_458617.1| (NC_003198) DNA helicase [Salmonella enter... 45 0.002
>ref|NP_064752.1| (NC_002519) RPprimase/helicase [Roseophage SIO1]
gb|AAG02597.1|AF189021_16 (AF189021) Roseophage SIO1 complete genome
Length = 522
Score = 1040 bits (2689), Expect = 0.0
Identities = 517/517 (100%), Positives = 517/517 (100%)
Query: 1 MDYEHKKHQPCQSCGSSDGAYPYEDGIYCHVCKTKTFNDDGDTQVTEQKTLPPIRGKIMA 60
MDYEHKKHQPCQSCGSSDGAYPYEDGIYCHVCKTKTFNDDGDTQVTEQKTLPPIRGKIMA
Sbjct: 6 MDYEHKKHQPCQSCGSSDGAYPYEDGIYCHVCKTKTFNDDGDTQVTEQKTLPPIRGKIMA 65
Query: 61 VPSRGLNKATAEKYRALTSDDKISLLYTTEGKVTSFKERGLSEKTFKFNGPAQTDLFGQS 120
VPSRGLNKATAEKYRALTSDDKISLLYTTEGKVTSFKERGLSEKTFKFNGPAQTDLFGQS
Sbjct: 66 VPSRGLNKATAEKYRALTSDDKISLLYTTEGKVTSFKERGLSEKTFKFNGPAQTDLFGQS 125
Query: 121 AFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYEWLDSFEKINI 180
AFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYEWLDSFEKINI
Sbjct: 126 AFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYEWLDSFEKINI 185
Query: 181 AFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKAAPFTPDG 240
AFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKAAPFTPDG
Sbjct: 186 AFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKAAPFTPDG 245
Query: 241 IIAGSSLYDLVSTPPEYDCVPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYS 300
IIAGSSLYDLVSTPPEYDCVPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYS
Sbjct: 246 IIAGSSLYDLVSTPPEYDCVPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYS 305
Query: 301 LIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTLGTGRVF 360
LIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTLGTGRVF
Sbjct: 306 LIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTLGTGRVF 365
Query: 361 LYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLT 420
LYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLT
Sbjct: 366 LYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLT 425
Query: 421 VELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELE 480
VELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELE
Sbjct: 426 VELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELE 485
Query: 481 RNTTRVRVIKNRFTGETGVADSLSYSRHTGRLKSYEG 517
RNTTRVRVIKNRFTGETGVADSLSYSRHTGRLKSYEG
Sbjct: 486 RNTTRVRVIKNRFTGETGVADSLSYSRHTGRLKSYEG 522
>ref|NP_570328.1| (NC_003390) RP primase/helicase [Cyanophage P60]
gb|AAL73266.1| (AF338467) RP primase/helicase [Cyanophage P60]
Length = 531
Score = 335 bits (858), Expect = 1e-090
Identities = 204/519 (39%), Positives = 306/519 (58%), Gaps = 25/519 (4%)
Query: 4 EHKKHQPCQSCGSSDGAYPYEDG-IYCHVCKTKTFNDDGDTQVTEQKTLP--PIRGKIMA 60
E +H+PC +C SSD Y DG YC C +++ G Q T P P+ G
Sbjct: 8 EFTRHEPCPNCPSSDALARYSDGGAYCFSCG---YSERGTGQSTHTSRAPMIPLEGDFHR 64
Query: 61 VPSRGLNKATAEKYRALTSDDKISL-LYTTEGKVTSFKERGLSEKTFKFNGP-AQTDLFG 118
+PSR L + T K+ +D +I Y ++ +K RG EK F++ G LFG
Sbjct: 65 IPSRNLTEETCRKFNVRVNDGRIQFPYYDNSRQLVGYKHRG-KEKDFRWQGKNPDNQLFG 123
Query: 119 QSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYEWLDSFEKI 178
Q + KG K++ +TEGEFDA++ +Q + P VSV NG+ A K +L +FE+I
Sbjct: 124 QQLWGKG-KAIVVTEGEFDALSVFQAR-KNWPVVSVPNGAQSAKKALAAQLTYLLNFEEI 181
Query: 179 NIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKA----A 234
+ FDND AG AA LF +V T ++ D + +D+ DA +A +
Sbjct: 182 ILMFDNDDAGIQAAEECVSLFPADRV--FTAFIDGYKDASEAVSAKDY-DAVTQAIWNKS 238
Query: 235 PFTPDGIIAGSSLYDLVSTPPEYDCVPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVM 294
+TP II G +L+D+V+TP YPF+ LN T GLR GEL+T AG+G GK+ +
Sbjct: 239 TYTPKSIIDGRTLFDVVTTPLHGRDAEYPFAALNSTTGGLRLGELVTITAGSGTGKSTLC 298
Query: 295 REILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTL 354
EI SLI QD+ VG + LEE V+ TGL +M++ A KPLHL + + T +E A+++TL
Sbjct: 299 GEIAVSLINQDQ-RVGYIALEESVKRTGLRLMTVAANKPLHLNNELPT-DELRTAFDSTL 356
Query: 355 GTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVS-DHAKDERKALDEIA 413
GTGRVFL D FGS + +++ +R+L ++ + ++I+LDH+SI++S + + DERK +D +
Sbjct: 357 GTGRVFLRDGFGSVDPDSLLNDIRFLTKAHEVQWIVLDHLSILLSGNESTDERKMIDIVM 416
Query: 414 TKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNT 473
TKL++ E I ++++SHL R++ K E+G ++L +RG+ I QLS+++IALER+
Sbjct: 417 TKLRSFVEETGIGMILISHLRRNQGDKGHEDGAQVSLSQLRGSHSIAQLSDLVIALERDI 476
Query: 474 QADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRL 512
D N +++RV+KNRF G+TG A LSYS TGR+
Sbjct: 477 SKGD----NRSQLRVLKNRFNGQTGPAGGLSYSVETGRM 511
>ref|NP_523315.1| (NC_003298) DNA primase/helicase [Bacteriophage T3]
sp|P20315|PRIM_BPT3 DNA PRIMASE/HELICASE
pir||S07508 DNA primase - phage T3
emb|CAA35135.1| (X17255) 4A [Bacteriophage T3]
emb|CAC86278.1| (AJ318471) DNA primase/helicase [Bacteriophage T3]
Length = 566
Score = 276 bits (706), Expect = 4e-073
Identities = 180/541 (33%), Positives = 275/541 (50%), Gaps = 35/541 (6%)
Query: 8 HQPCQSCGSSDGAYPYEDGI-YCHVCKTKT-FNDDGDTQVTEQK-------------TLP 52
H PC++CGSSDG Y DG +C VC+ + N++ + +++ ++
Sbjct: 12 HAPCENCGSSDGNSVYSDGHEWCFVCEHRVPANEEREAKLSTRRRTGGSKPMSYDVWNFG 71
Query: 53 PIRGKIMAVPSRGLNKATAEK--YRALTSDDK---ISLLYTTEGKVTSFKERGLSEKTFK 107
G+ + +RG++K T +K Y D++ ++ G + S K R +K FK
Sbjct: 72 DSNGRYSDLTARGISKETCQKAGYWLAKVDNRMYQVADYRDQNGSIVSQKVRD-KDKNFK 130
Query: 108 FNGPAQTDLFGQSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKR 167
G ++D GGK + +TEGE DA+ ++ P VS+ +G+S A K C
Sbjct: 131 TTGSHKSDALFLKHLWSGGKKIVVTEGEIDALTVMELQDCKYPVVSLGHGASAAKKTCAA 190
Query: 168 NYEWLDSFEKINIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFI 227
NYE+ D FE+I + FD D AG+ A A++ KVR+ + D N+ G ++ +
Sbjct: 191 NYEYFDQFEQIILMFDMDDAGRKAVEEAAQVLPAGKVRVAVLPCKDANECHIMGEDKAIL 250
Query: 228 DAHKKAAPFTPDGIIAGSSLYDLV-STPPEYDCVPYPF---SGLNRMTKGLRTGELITFV 283
+ A P+ PDG+++ SL D V D V F GLN T G R GE++
Sbjct: 251 EQVWNANPWVPDGVVSALSLKDRVKEAMTSEDAVGLLFDGCQGLNDRTLGARGGEVVMVT 310
Query: 284 AGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTD----T 339
+G+G+GK+ +R+ + ++ VG LEE V DT MM + + L +D
Sbjct: 311 SGSGMGKSTFVRQQALAWGKRMGKRVGLAMLEESVEDTIQDMMGLNNKVRLRQSDEVKKA 370
Query: 340 IYTKEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVS 399
I FD Y+ G LYDSF +R+++ + Y+ C I+LDHISIVVS
Sbjct: 371 IAEDGRFDEWYDELFGDDTFHLYDSFAEAEADRLLAKLAYMRTGLGCDVIVLDHISIVVS 430
Query: 400 -DHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAG 458
DERK +D + TKLK + L+++ HL + K EEG +++ D+RG+
Sbjct: 431 ASEESDERKMIDRLMTKLKGFAKSTGVVLVVICHLKNPEKGKAHEEGRAVSITDLRGSGA 490
Query: 459 IGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRLK--SYE 516
+ QLS+ IIALERN Q D N VR++K RFTG+TG+A + Y+R TG L+ SY
Sbjct: 491 LRQLSDTIIALERNQQGD---MPNLVLVRLLKCRFTGDTGIAGYMEYNRETGWLEPSSYT 547
Query: 517 G 517
G
Sbjct: 548 G 548
>ref|NP_052087.1| (NC_001271) DNA primase/helicase [Bacteriophage phiYeO3-12]
emb|CAB63608.1| (AJ251805) DNA primase/helicase [Bacteriophage phiYeO3-12]
Length = 566
Score = 274 bits (700), Expect = 2e-072
Identities = 184/544 (33%), Positives = 273/544 (49%), Gaps = 41/544 (7%)
Query: 8 HQPCQSCGSSDGAYPYEDG-IYCHVCKTKTFNDDGDTQVTEQKTL-------PPI----- 54
H PC++CGSSD + DG YC+ C+ T GD Q EQ + P+
Sbjct: 12 HIPCENCGSSDANSLFSDGHTYCYACENWT---PGDEQKAEQLSTRRRTGGSKPMSYDVW 68
Query: 55 -----RGKIMAVPSRGLNKATAEK--YRALTSDDK---ISLLYTTEGKVTSFKERGLSEK 104
G+ + +RG++K T +K Y D++ ++ G + S K R +K
Sbjct: 69 NFGDSNGRYSDLTARGISKETCQKAGYWLAKVDNRMYQVADYRDQNGSIVSQKVRD-KDK 127
Query: 105 TFKFNGPAQTDLFGQSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKD 164
FK G ++D GGK + +TEGE DA+ ++ P VS+ +G+S A K
Sbjct: 128 NFKTTGSHKSDALFLKHLWSGGKKIVVTEGEIDALTVMELQDCKYPVVSLGHGASAAKKT 187
Query: 165 CKRNYEWLDSFEKINIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRER 224
C NYE+ D FE+I + FD D AG+ A A++ KVR+ + D N+ G ++
Sbjct: 188 CAANYEYFDQFEQIILMFDMDDAGRKAVEEAAQVLPAGKVRVAVLPYKDANECHIMGEDK 247
Query: 225 DFIDAHKKAAPFTPDGIIAGSSLYDLV-STPPEYDCVPYPF---SGLNRMTKGLRTGELI 280
++ A P+ PDG+++ SL D V D V F GLN T G R GE+I
Sbjct: 248 AILEQVWNANPWVPDGVVSALSLKDRVKEAMTSEDAVGLLFDGCQGLNDRTLGARGGEVI 307
Query: 281 TFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTD-- 338
+G+G+GK+ +R+ + ++ VG LEE V DT MM + + L +D
Sbjct: 308 MVTSGSGMGKSTFVRQQALAWGKRMGKRVGLAMLEESVEDTIQDMMGLNNKVRLRQSDEV 367
Query: 339 --TIYTKEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISI 396
I FD Y+ G LYDSF +R+++ + Y+ C I+LDHISI
Sbjct: 368 KKAIAEDGRFDEWYDELFGDDTFHLYDSFAEAEADRLLAKLAYMRTGLGCDVIVLDHISI 427
Query: 397 VVS-DHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRG 455
VVS DERK +D + TKLK + L+++ HL + K EEG +++ D+RG
Sbjct: 428 VVSASEESDERKMIDRLMTKLKGFAKSTGVVLVVICHLKNPEKGKAHEEGRAVSITDLRG 487
Query: 456 TAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRLK-- 513
+ + QLS+ IIALERN Q D + VR++K RFTG+TG+A + Y+R TG L+
Sbjct: 488 SGALRQLSDTIIALERNQQGD---MPHLVLVRLLKCRFTGDTGIAGYMEYNRETGWLEPS 544
Query: 514 SYEG 517
SY G
Sbjct: 545 SYTG 548
>ref|NP_041975.1| (NC_001604) gene 4A, primase/helicase [14,15] [Enterobacteria phage
T7]
sp|P03692|PRIM_BPT7 DNA primase/helicase
pir||YDBPA7 DNA primase - phage T7
emb|CAA24405.1| (V01146) gene 4A, primase/helicase [14,15] [Enterobacteria phage
T7]
Length = 566
Score = 273 bits (698), Expect = 4e-072
Identities = 179/542 (33%), Positives = 279/542 (51%), Gaps = 38/542 (7%)
Query: 8 HQPCQSCGSSDGAYPYEDG-IYCHVCKTKTFNDDGDTQVTEQKTLPP------------- 53
H PC +CGSSDG + DG +C+VC+ T ++ DT+ K P
Sbjct: 14 HIPCDNCGSSDGNSLFSDGHTFCYVCEKWTAGNE-DTKERASKRKPSGGKPMTYNVWNFG 72
Query: 54 -IRGKIMAVPSRGLNKATAEK--YRALTSDD---KISLLYTTEGKVTSFKERGLSEKTFK 107
G+ A+ +RG++K T +K Y D +++ G + S K R +K FK
Sbjct: 73 ESNGRYSALTARGISKETCQKAGYWIAKVDGVMYQVADYRDQNGNIVSQKVRD-KDKNFK 131
Query: 108 FNGPAQTD-LFGQSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCK 166
G ++D LFG+ ++ GGK + +TEGE D + ++ P VS+ +G+S A K C
Sbjct: 132 TTGSHKSDALFGKHLWN-GGKKIVVTEGEIDMLTVMELQDCKYPVVSLGHGASAAKKTCA 190
Query: 167 RNYEWLDSFEKINIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDF 226
NYE+ D FE+I + FD D+AG+ A A++ KVR+ + D N+ G +R+
Sbjct: 191 ANYEYFDQFEQIILMFDMDEAGRKAVEEAAQVLPAGKVRVAVLPCKDANECHLNGHDREI 250
Query: 227 IDAHKKAAPFTPDGIIAGSSLYDLV----STPPEYDCVPYPFSGLNRMTKGLRTGELITF 282
++ A P+ PDG+++ SL + + S+ + +G+N T G R GE+I
Sbjct: 251 MEQVWNAGPWIPDGVVSALSLRERIREHLSSEESVGLLFSGCTGINDKTLGARGGEVIMV 310
Query: 283 VAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDT--- 339
+G+G+GK+ +R+ VG LEE V +T ++ + L +D+
Sbjct: 311 TSGSGMGKSTFVRQQALQWGTAMGKKVGLAMLEESVEETAEDLIGLHNRVRLRQSDSLKR 370
Query: 340 -IYTKEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVV 398
I +FD ++ G LYDSF +R+++ + Y+ C IILDHISIVV
Sbjct: 371 EIIENGKFDQWFDELFGNDTFHLYDSFAEAETDRLLAKLAYMRSGLGCDVIILDHISIVV 430
Query: 399 SDHAK-DERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTA 457
S + DERK +D + TKLK + L+++ HL K EEG +++ D+RG+
Sbjct: 431 SASGESDERKMIDNLMTKLKGFAKSTGVVLVVICHLKNPDKGKAHEEGRPVSITDLRGSG 490
Query: 458 GIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRLK--SY 515
+ QLS+ IIALERN Q D N VR++K RFTG+TG+A + Y++ TG L+ SY
Sbjct: 491 ALRQLSDTIIALERNQQGD---MPNLVLVRILKCRFTGDTGIAGYMEYNKETGWLEPSSY 547
Query: 516 EG 517
G
Sbjct: 548 SG 549
>ref|NP_523316.1| (NC_003298) helicase [Bacteriophage T3]
emb|CAA35136.1| (X17255) 4B [Bacteriophage T3]
emb|CAC86279.1| (AJ318471) helicase [Bacteriophage T3]
Length = 504
Score = 255 bits (651), Expect = 1e-066
Identities = 164/478 (34%), Positives = 246/478 (51%), Gaps = 20/478 (4%)
Query: 56 GKIMAVPSRGLNKATAEK--YRALTSDDK---ISLLYTTEGKVTSFKERGLSEKTFKFNG 110
G+ + +RG++K T +K Y D++ ++ G + S K R +K FK G
Sbjct: 13 GRYSDLTARGISKETCQKAGYWLAKVDNRMYQVADYRDQNGSIVSQKVRD-KDKNFKTTG 71
Query: 111 PAQTDLFGQSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYE 170
++D GGK + +TEGE DA+ ++ P VS+ +G+S A K C NYE
Sbjct: 72 SHKSDALFLKHLWSGGKKIVVTEGEIDALTVMELQDCKYPVVSLGHGASAAKKTCAANYE 131
Query: 171 WLDSFEKINIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAH 230
+ D FE+I + FD D AG+ A A++ KVR+ + D N+ G ++ ++
Sbjct: 132 YFDQFEQIILMFDMDDAGRKAVEEAAQVLPAGKVRVAVLPCKDANECHIMGEDKAILEQV 191
Query: 231 KKAAPFTPDGIIAGSSLYDLV-STPPEYDCVPYPF---SGLNRMTKGLRTGELITFVAGT 286
A P+ PDG+++ SL D V D V F GLN T G R GE++ +G+
Sbjct: 192 WNANPWVPDGVVSALSLKDRVKEAMTSEDAVGLLFDGCQGLNDRTLGARGGEVVMVTSGS 251
Query: 287 GVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTD----TIYT 342
G+GK+ +R+ + ++ VG LEE V DT MM + + L +D I
Sbjct: 252 GMGKSTFVRQQALAWGKRMGKRVGLAMLEESVEDTIQDMMGLNNKVRLRQSDEVKKAIAE 311
Query: 343 KEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVS-DH 401
FD Y+ G LYDSF +R+++ + Y+ C I+LDHISIVVS
Sbjct: 312 DGRFDEWYDELFGDDTFHLYDSFAEAEADRLLAKLAYMRTGLGCDVIVLDHISIVVSASE 371
Query: 402 AKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQ 461
DERK +D + TKLK + L+++ HL + K EEG +++ D+RG+ + Q
Sbjct: 372 ESDERKMIDRLMTKLKGFAKSTGVVLVVICHLKNPEKGKAHEEGRAVSITDLRGSGALRQ 431
Query: 462 LSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRLK--SYEG 517
LS+ IIALERN Q D N VR++K RFTG+TG+A + Y+R TG L+ SY G
Sbjct: 432 LSDTIIALERNQQGD---MPNLVLVRLLKCRFTGDTGIAGYMEYNRETGWLEPSSYTG 486
>pdb|1CR1|A Chain A, Crystal Structure Of The Helicase Domain Of The Gene 4
Protein Of Bacteriophage T7: Complex With Dttp
pdb|1CR4|A Chain A, Crystal Structure Of The Helicase Domain Of The Gene 4
Protein Of Bacteriophage T7: Complex With Dtdp
pdb|1CR2|A Chain A, Crystal Structure Of The Helicase Domain Of The Gene 4
Protein Of Bacteriophage T7: Complex With Datp
pdb|1CR0|A Chain A, Crystal Structure Of The Helicase Domain Of The Gene4
Protein Of Bacteriophage T7
Length = 296
Score = 156 bits (395), Expect = 5e-037
Identities = 94/260 (36%), Positives = 140/260 (53%), Gaps = 10/260 (3%)
Query: 265 SGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLG 324
+G+N T G R GE+I +G+G+GK+ +R+ VG LEE V +T
Sbjct: 23 TGINDKTLGARGGEVIMVTSGSGMGKSTFVRQQALQWGTAMGKKVGLAMLEESVEETAED 82
Query: 325 MMSIQAEKPLHLTDT----IYTKEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYL 380
++ + L +D+ I +FD ++ G LYDSF +R+++ + Y+
Sbjct: 83 LIGLHNRVRLRQSDSLKREIIENGKFDQWFDELFGNDTFHLYDSFAEAETDRLLAKLAYM 142
Query: 381 ARSCDCKFIILDHISIVVSDHAK-DERKALDEIATKLKTLTVELDICLLMVSHLNRDKNR 439
C IILDHISIVVS + DERK +D + TKLK + L+++ HL
Sbjct: 143 RSGLGCDVIILDHISIVVSASGESDERKMIDNLMTKLKGFAKSTGVVLVVICHLKNPDKG 202
Query: 440 KPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGV 499
K EEG +++ D+RG+ + QLS+ IIALERN Q D N VR++K RFTG+TG+
Sbjct: 203 KAHEEGRPVSITDLRGSGALRQLSDTIIALERNQQGD---MPNLVLVRILKCRFTGDTGI 259
Query: 500 ADSLSYSRHTGRLK--SYEG 517
A + Y++ TG L+ SY G
Sbjct: 260 AGYMEYNKETGWLEPSSYSG 279
>gb|AAD41902.1|AF159357_1 (AF159357) DNA primase [Enterobacteria phage SP6]
Length = 661
Score = 154 bits (390), Expect = 2e-036
Identities = 140/465 (30%), Positives = 217/465 (46%), Gaps = 49/465 (10%)
Query: 91 GKVTSFKERGLSEKTFKFNGPAQT----DLFGQSAFS----KGGKSVT--ITEGEFDAMA 140
GK+ K R L K FKF + D+FG + S KG + T I GE DA+A
Sbjct: 166 GKLVGAKCRTLP-KDFKFGHLGKLFGNQDMFGMNTMSNVLDKGRRKDTLLIVGGELDALA 224
Query: 141 AYQMLYMS--------EPC-VSVINGSSGAVKDCKRNYEWLDSFEKINIAFDNDKAGQDA 191
A QML S +P V IN +++ +N E + F+KI FD D+ GQ
Sbjct: 225 AQQMLLDSAKGTKWEGQPYHVWSINKGEACLEEIVQNREHISQFKKIMWGFDGDEIGQKL 284
Query: 192 AMAVAELFDPRKVRLVTMTLN--DPNDYIKQGRERDFIDAHKKAAPFTPDGIIAGSSLYD 249
A LF P K ++ D N + G+ ++F+DA A + GS +
Sbjct: 285 NQQAARLF-PGKSYIIEYPAGCKDANKALMAGKSKEFVDAWFNAKSSDE---VFGSQIKS 340
Query: 250 LVSTPPEYDC------VPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQ 303
+ S + + +P+ LN++T G+R +LI AG+GVGKT+ +RE++ LI+
Sbjct: 341 IASQRDKLKAARPEPGLSWPWPRLNKITLGIRKHQLIIVGAGSGVGKTEFLREVVKHLIE 400
Query: 304 QDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDT------------IYTKEEFDNAYE 351
+ +VG + E+P+ + +K + L T YT+EE + A +
Sbjct: 401 EHGESVGIISTEDPMVKVSRAFIGKWIDKRIELPPTNDPREDGYREVFDYTEEEANAAID 460
Query: 352 NTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDERK--AL 409
TG++F+ D G S+E+ V + II+D+++ + D K AL
Sbjct: 461 YVADTGKLFVADLEGDYSMEK-VEQTCLEFEAMGISNIIIDNLTGIKLDERNFGGKVGAL 519
Query: 410 DEIATKLKTLTVELDICLLMVSHLNRDK-NRKPPEEGGTINLQDIRGTAGIGQLSNIIIA 468
DE ++ T+ + + +VSHL R R EEGG + L D RG+ IG ++ +
Sbjct: 520 DECVKRIGTIKDRHPVTIFLVSHLTRPSGQRTSHEEGGEVILSDFRGSGAIGFWASYALG 579
Query: 469 LERNTQADDELERNTTRVRVIKNRFTG-ETGVADSLSYSRHTGRL 512
+ERNT+A+ ER TT + +K+R G TG L TGRL
Sbjct: 580 IERNTRAETLDERTTTYISCVKDRDQGIYTGTKVMLKGDVSTGRL 624
>ref|NP_602175.1| (NC_003450) COG0305:Replicative DNA helicase [Corynebacterium
glutamicum] [Corynebacterium glutamicum ATCC 13032]
dbj|BAC00374.1| (AP005283) Replicative DNA helicase [Corynebacterium glutamicum
ATCC 13032]
Length = 510
Score = 75.9 bits (185), Expect = 1e-012
Identities = 68/258 (26%), Positives = 120/258 (46%), Gaps = 17/258 (6%)
Query: 260 VPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVR 319
+P F L+ +T GLR G++I A GVGK+ + + + S K N+ ++ +
Sbjct: 253 IPTGFKDLDDLTNGLRGGQMIIVAARPGVGKSTIALDFMRS--ASIKNNMASVIFSLEMS 310
Query: 320 DTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYE------NTLGTGRVFLYDSFGSNSVERI 373
+ + M + AE + L D K + + A+E + + +F+ DS +E I
Sbjct: 311 KSEIVMRLLSAETEIRLADMRGGKMD-ETAWEKMVQKLDKVAQAPLFIDDSANLTMME-I 368
Query: 374 VSMVRYLARSCDCKFIILDHISIVVSDHAKDER-KALDEIATKLKTLTVELDICLLMVSH 432
S R L + D K I++D++ ++ S + R + + E + +LK L ELD+ L+ +S
Sbjct: 369 RSKARKLKQKHDLKMIVVDYLQLMSSGKRVESRQQEVSEFSRQLKLLAKELDVPLIAISQ 428
Query: 433 LNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNR 492
LNR + + L D+R + + Q ++I++ L R D + ER ++
Sbjct: 429 LNRGPESRTDKRP---QLADLRESGSLEQDADIVMLLYRPDSQDKDDERAGEADIILAKH 485
Query: 493 FTG---ETGVADSLSYSR 507
G VA L YSR
Sbjct: 486 RGGPIDTVQVAHQLHYSR 503
>gb|EAA07012.1| (AAAB01008848) ebiP580 [Anopheles gambiae str. PEST]
Length = 350
Score = 70.9 bits (172), Expect = 4e-011
Identities = 84/357 (23%), Positives = 153/357 (42%), Gaps = 44/357 (12%)
Query: 172 LDSFEKINIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDY---IKQGRERDFID 228
L+ F+++ + F+ D AG D A A+ D R+ V T P + ++ E I
Sbjct: 16 LERFQRLTLWFNYDTAGWDTARNYAKKLDERRCLFVRPTDQHPTPFRALLEGTLEPKAIL 75
Query: 229 AHKKAAPFTPDGIIAGSSL-YDLVSTPPEYDCVP------YPFSGLNRMTKGLRTGELIT 281
+ KA P I SL D++S D V YP LN++ KG R GEL
Sbjct: 76 S--KAQPILHQSITTFRSLRQDVLSDLQNIDKVQGVKWKRYPT--LNKLLKGHRKGELTV 131
Query: 282 FVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIY 341
TG GKT M + L QQ V TL+ +R+T L + ++ L + +
Sbjct: 132 LTGPTGCGKTTFMSDYSLDLAQQ---GVSTLWGSFEIRNTRLAVTLLRQMVGRPLDENLS 188
Query: 342 TKEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDH 401
E++ +A+E ++ G ++ ++ + + D + +I+D++ ++
Sbjct: 189 EFEQWADAFERL----PIYFMTFHGQQPIKIVMEAIEHAQYVHDIQHVIIDNLQFMMG-- 242
Query: 402 AKDERKALDE------IATKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRG 455
DE K LD I +T + + + +V H ++++ +E T ++ G
Sbjct: 243 VLDESKHLDRYWKQDAIIAAFRTFATKRNCHVTLVIHPRKERD---TDELTTSSI--FGG 297
Query: 456 TAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVAD------SLSYS 506
+ N++I ++ + R ++V KNR++G+ G+ SLSY+
Sbjct: 298 AKASQEADNVLIIQDKRLTS----VRGKKYLQVAKNRYSGDLGIMPLDFDKASLSYA 350
>gb|AAF52820.1| (AE003626) CG5924 gene product [Drosophila melanogaster]
Length = 613
Score = 65.9 bits (159), Expect = 1e-009
Identities = 74/331 (22%), Positives = 135/331 (40%), Gaps = 31/331 (9%)
Query: 190 DAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKAAPFTPDGIIA-GSSLY 248
DAA A A D R+ L+ T +P ++ R + KA P I G+
Sbjct: 283 DAARAFALKLDERRCLLIRPTETEPAPHLALRRRLNLRHILAKATPVQHKAITTFGAMRN 342
Query: 249 DLVSTPPEYDCVP----YPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQ 304
D++S + V F LN++ KG R GEL TG GKT M E L Q
Sbjct: 343 DILSELQNIEKVNGVKWKRFPVLNKLLKGHRRGELTILTGPTGSGKTTFMSEYSLDLAMQ 402
Query: 305 DKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTLGTGRVFLYDS 364
V TL+ +R+T L ++ L D ++ + +E ++
Sbjct: 403 ---GVNTLWGSFEIRNTRLAATLLRQYVGYPLDDRLHEFNHWAAEFERL----PLYFMTF 455
Query: 365 FGSNSVERIVSMVRYLARSCDCKFIILDHISIV--VSDHAKDERKALDEIATKLKTLTVE 422
G ++ ++ + + + D +I+D++ + VS D+ D I ++ +
Sbjct: 456 HGQQPLKPVLEAIEHASYVHDVMHVIIDNLQFMMGVSTFRGDKFFEQDSIIAAFRSFATK 515
Query: 423 LDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQ-LSNIIIALERNTQADDELER 481
++ + +V H +++ + + + GTA Q N++I ++ + R
Sbjct: 516 HNVHVTLVMHPRKER------QEDELTTSSVFGTAKATQEADNVLIIQDKRLTS----VR 565
Query: 482 NTTRVRVIKNRFTGETGV------ADSLSYS 506
+++ KNR++G+ G+ D LSYS
Sbjct: 566 GKKYLQIAKNRYSGDLGIMPLEFDKDGLSYS 596
>ref|NP_391924.1| (NC_000964) replicative DNA helicase [Bacillus subtilis]
sp|P37469|DNAC_BACSU Replicative DNA helicase
pir||S65970 replicative DNA helicase dnaC - Bacillus subtilis
dbj|BAA05176.1| (D26185) replicative DNA helicase [Bacillus subtilis]
emb|CAB16081.1| (Z99124) replicative DNA helicase [Bacillus subtilis]
Length = 454
Score = 62.8 bits (151), Expect = 1e-008
Identities = 61/262 (23%), Positives = 118/262 (44%), Gaps = 10/262 (3%)
Query: 260 VPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVR 319
+P F+ L+RMT G + +LI A VGKT I ++ + +V LE
Sbjct: 186 IPTGFTELDRMTAGFQRNDLIIVAARPSVGKTAFALNIAQNVATKTDESVAIFSLEMGAE 245
Query: 320 DTGLGMMSIQAE-KPLHLTDTIYTKEEFD--NAYENTLGTGRVFLYDSFGSNSVERIVSM 376
+ M+ + +L T+E++ +L +++ D+ G V I +
Sbjct: 246 QLVMRMLCAEGNINAQNLRTGNLTEEDWGKLTMAMGSLSNSGIYIDDTPGIR-VSEIRAK 304
Query: 377 VRYLARSCDCKFIILDHISIVV-SDHAKDER-KALDEIATKLKTLTVELDICLLMVSHLN 434
R L + I++D++ ++ S +KD R + + EI+ +LK++ EL + ++ +S L+
Sbjct: 305 CRRLKQESGLGMILIDYLQLIQGSGRSKDNRQQEVSEISRELKSIARELQVPVIALSQLS 364
Query: 435 RDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFT 494
R ++ + + DIR + I Q ++I+ L R+ D E E +I +
Sbjct: 365 RGVEQRQDKRP---MMSDIRESGSIEQDADIVAFLYRDDYYDKETENKNIIEIIIAKQRN 421
Query: 495 GETGVADSLSYSRHTGRLKSYE 516
G G SL++ + + + E
Sbjct: 422 GPVGTV-SLAFVKEYNKFVNLE 442
>ref|NP_214015.1| (NC_000918) replicative DNA helicase [Aquifex aeolicus]
pir||H70427 replicative DNA helicase - Aquifex aeolicus
gb|AAC07414.1| (AE000742) replicative DNA helicase [Aquifex aeolicus]
Length = 468
Score = 62.4 bits (150), Expect = 1e-008
Identities = 52/216 (24%), Positives = 105/216 (48%), Gaps = 8/216 (3%)
Query: 260 VPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLF-LEEPV 318
+P F+ L+ T G G+LI A G+GKT M I+Y+L +D+G +F LE
Sbjct: 182 LPSGFTELDLKTTGFHPGDLIILAARPGMGKTAFMLSIIYNL-AKDEGKPSAVFSLEMSK 240
Query: 319 RDTGLGMMSIQAEKPL-HLTDTIYTKEEFDNAYENTLGTGRVFLY-DSFGSNSVERIVSM 376
+ ++S+ +E PL + + E+ + + + +Y D + + +
Sbjct: 241 EQLVMRLLSMMSEVPLFKIRSGSISNEDLKKLEASAIELAKYDIYLDDTPALTTTDLRIR 300
Query: 377 VRYLARSCDCKFIILDHISIVVSDHAKDER-KALDEIATKLKTLTVELDICLLMVSHLNR 435
R L + + +F+ +D++ ++ K R + + E++ LK L EL I ++ ++ L+R
Sbjct: 301 ARKLRKEKEVEFVAVDYLQLLRPPVRKSSRQEEVAEVSRNLKALAKELHIPVMALAQLSR 360
Query: 436 DKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALER 471
+ ++ + L D+R + I Q +++I+ L R
Sbjct: 361 EVEKRSDKRP---QLADLRESGQIEQDADLILFLHR 393
>ref|NP_563550.1| (NC_003366) replicative DNA helicase [Clostridium perfringens]
dbj|BAB82340.1| (AP003194) replicative DNA helicase [Clostridium perfringens str.
13]
Length = 444
Score = 56.2 bits (134), Expect = 9e-007
Identities = 54/256 (21%), Positives = 110/256 (42%), Gaps = 9/256 (3%)
Query: 264 FSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGL 323
F L+ MT G + G+++ A +GKT I ++ +V LE
Sbjct: 189 FKDLDEMTSGFQKGDMVLIAARPSMGKTTFALNIAEHAALREGKSVVIFSLEMSKEQLAY 248
Query: 324 GMMSIQAE-KPLHLTDTIYTKEEFDN--AYENTLGTGRVFLYDSFGSNSVERIVSMVRYL 380
++ +A L L T E++DN L + ++++ D+ G SV + S R L
Sbjct: 249 KLLCSEANVDMLSLRTGKLTPEDWDNIARASGPLASAKIYIDDTAGV-SVMEMRSKCRRL 307
Query: 381 ARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRK 440
I++D++ + +S + ++ + EI+ +K + E++ ++ +S L+R ++
Sbjct: 308 KMEYGIDLILIDYLQL-MSGSGESRQQEVAEISRNIKAIAKEMECPVIALSQLSRAPEQR 366
Query: 441 PPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVA 500
L D+R + I Q +++++ L R+ + E E +I + G G
Sbjct: 367 ADHRP---MLSDLRESGSIEQDADLVMFLYRDEYYNKETEDKNVGECIIAKQRNGPVGTV 423
Query: 501 DSLSYSRHTGRLKSYE 516
L++ + YE
Sbjct: 424 -KLAWLGQYSKFADYE 438
>ref|NP_072756.1| (NC_000908) replicative DNA helicase (dnaB) [Mycoplasma genitalium]
pir||D64210 replicative DNA helicase (dnaB) homolog - Mycoplasma genitalium
gb|AAC71312.1| (U39689) replicative DNA helicase (dnaB) [Mycoplasma genitalium]
Length = 446
Score = 53.9 bits (128), Expect = 5e-006
Identities = 49/236 (20%), Positives = 107/236 (44%), Gaps = 22/236 (9%)
Query: 264 FSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQ-DKGNVGTLFLEEPVRDTG 322
F L+++T G + GELI A G GKT ++ + S +Q D LF +R+
Sbjct: 178 FLKLDQLTSGWKPGELIVIAARPGRGKTALLINFMASAAKQIDPKTDVVLFFSLEMRN-- 235
Query: 323 LGMMSIQAEKPLHLTDTIYT---KEEFDNAYENTLGTGR------VFLYDSFGSNSVERI 373
I +H + T YT ++ +N +E + + L+D + S +++ I
Sbjct: 236 ---REIYQRHLMHESQTSYTLTNRQRINNVFEELMEASSRIKNLPIKLFD-YSSLTLQEI 291
Query: 374 VSMVRYLARSCDCKFIILDHISIV---VSDHAKDERKALDEIATKLKTLTVELDICLLMV 430
+ + ++++ + + +I+D++ +V +++ ++ + I+ LK E + ++
Sbjct: 292 RNQITEVSKTSNVRLVIIDYLQLVNALKNNYGLTRQQEVTMISQSLKAFAKEFNTPIIAA 351
Query: 431 SHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRV 486
+ L+R R + L D+R + I Q +++++ + R E E T +
Sbjct: 352 AQLSR---RIEERKDSRPILSDLRESGSIEQDADMVLFIHRTNDDKKEQEEENTNL 404
>ref|NP_108071.1| (NC_002678) replicative DNA helicase (dnaB) [Mesorhizobium loti]
dbj|BAB54216.1| (AP003012) replicative DNA helicase; DnaB [Mesorhizobium loti]
Length = 487
Score = 49.3 bits (116), Expect = 1e-004
Identities = 51/225 (22%), Positives = 99/225 (43%), Gaps = 24/225 (10%)
Query: 267 LNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQ--------------QDKGNVGTL 312
L+R GL++ +LI G+GKT + I +++ + + G VG
Sbjct: 196 LDRRMGGLQSSDLIVLAGRPGMGKTSLATNIAFNIAEAYVPAQQADGSFKAANGGVVGFF 255
Query: 313 FLEEPVRDTGLGMMSIQAE-KPLHLTDTIYTKEEFDN--AYENTLGTGRVFLYDSFGSNS 369
LE ++S Q E + ++ +F+ A T+ +F+ D G S
Sbjct: 256 SLEMSSEQLATRIISEQTEISSSKIRRGEISEMDFEKLVACSQTMQKIPLFI-DQTGGIS 314
Query: 370 VERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDER---KALDEIATKLKTLTVELDIC 426
+ ++ + R L R I++D+I ++ A+ + + + EI T LK L EL +
Sbjct: 315 IAQLSARARRLKRQRGLDLIVIDYIQLMQGSSARASQNRVQEITEITTGLKALAKELAVP 374
Query: 427 LLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALER 471
++ +S L+R + E L D+R + I Q +++++ + R
Sbjct: 375 IIALSQLSRQVESR---EDKRPQLSDLRESGSIEQDADVVMFVYR 416
>ref|NP_439720.1| (NC_000907) replicative DNA helicase (dnaB) [Haemophilus influenzae
Rd]
pir||D64130 dnaB protein - Haemophilus influenzae (strain Rd KW20)
gb|AAC23217.1| (U32831) replicative DNA helicase (dnaB) [Haemophilus influenzae
Rd]
Length = 504
Score = 48.9 bits (115), Expect = 1e-004
Identities = 53/241 (21%), Positives = 101/241 (40%), Gaps = 9/241 (3%)
Query: 264 FSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGL 323
F+ L++ T GL+ +LI A +GKT + + + V LE P +
Sbjct: 241 FTDLDKKTAGLQPSDLIIVAARPSMGKTTFAMNLCENAAMASEKPVLVFSLEMPAEQIMM 300
Query: 324 GMMSIQAEKPLHLTDTIYTKEEFD-NAYENTLG----TGRVFLYDSFGSNSVERIVSMVR 378
M++ A T +E + N + +G +F+ DS G + R
Sbjct: 301 RMIASLARVDQTKIRTGQNLDEIEWNKIASVVGMFKQKNNLFIDDSSGLTPTDVRSRARR 360
Query: 379 YLARSCDCKFIILDHISIVVSDHAKDERK-ALDEIATKLKTLTVELDICLLMVSHLNRDK 437
+ I++D++ ++ + D R + EI+ LK L EL + ++ +S LNR
Sbjct: 361 VYRENGGLSMIMVDYLQLMRAPAFSDNRTLEIAEISRSLKALAKELQVPVVALSQLNRTL 420
Query: 438 NRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGET 497
++ + +N D+R + I Q +++I+ + R+ +D E +I + G
Sbjct: 421 EQRGDKR--PVN-SDLRESGSIEQDADLIMFIYRDEVYNDNSEDKGVAEIIIGKQRNGPI 477
Query: 498 G 498
G
Sbjct: 478 G 478
>ref|NP_543071.1| (NC_003356) putative helicase [Bacteriophage P27]
emb|CAC83537.1| (AJ298298) putative helicase [Bacteriophage P27]
Length = 463
Score = 45.8 bits (107), Expect = 0.001
Identities = 53/247 (21%), Positives = 105/247 (42%), Gaps = 10/247 (4%)
Query: 267 LNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMM 326
L+ T G+ G+L+ A +GKT++ +I+ + +Q G V +E G M+
Sbjct: 184 LDAKTGGMEPGDLVFIAARPSMGKTELALDIIDKVTEQGHG-VLLFTMEMANIQIGERMV 242
Query: 327 SIQAEKPLHLTDTIYTKEEFDNA-YENTLG--TGR-VFLYDSFGSNSVERIVSMVRYLAR 382
S P+ ++ E+ D A + +G TGR +++ D E + ++ +
Sbjct: 243 SAAGGMPVSRLKSVAHFEDEDWARFSQGVGRMTGRNIWMVDQANLTIDEICATTKHHMIK 302
Query: 383 SCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPP 442
+ +++D++ ++ + A+ EI+ LK L L+ +S L+R +P
Sbjct: 303 HPETALVVVDYLGLIKTRTTGRHDLAVGEISKGLKGLAKSGGFPLIALSQLSRGVESRPN 362
Query: 443 EEGGTINLQDIRGTAGIGQLSNIIIALERN-TQADDELERNTTRVRVIKNRFTGETGVAD 501
+ D++ + I ++II+ L R+ D R + + K R G G
Sbjct: 363 KRPMN---SDLKNSGEIEADADIILMLYRDEVYNPDTQARGIAEINITKQR-NGSLGTIY 418
Query: 502 SLSYSRH 508
Y+ H
Sbjct: 419 RRFYNGH 425
>ref|NP_458617.1| (NC_003198) DNA helicase [Salmonella enterica subsp. enterica
serovar Typhi]
emb|CAD09305.1| (AL627282) DNA helicase [Salmonella enterica subsp. enterica
serovar Typhi]
Length = 454
Score = 45.1 bits (105), Expect = 0.002
Identities = 43/215 (20%), Positives = 99/215 (46%), Gaps = 9/215 (4%)
Query: 264 FSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLF-LEEPVRDTG 322
F+ L+ T G + G+LI A +GKT + + ++ G + +E P +
Sbjct: 191 FAELDEKTCGWQDGDLILLGARPSMGKTAEALKHAVAALEGSPGKTVQFYSIEMPTQQLI 250
Query: 323 LGMMSIQAEKPL-HLTDTIYTKEEF----DNAYENTLGTGRVFLYDSFGSNSVERIVSMV 377
+ ++S+ A P +L + ++ ++ D + T GR+ + D+ +S
Sbjct: 251 MRLLSMLARVPFDNLRKGLLSEGQWSLLADAVAKLTSWEGRLLIDDTSYQTPSTLRISAR 310
Query: 378 RYLARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDK 437
R + + I++D++ ++ ++ + + EI+ LK+L E+ + +S LNR
Sbjct: 311 RNVRKYGQPSLILVDYLQLMSCPGRENRTQEIQEISRSLKSLAKEIRCPVAALSQLNRSV 370
Query: 438 NRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERN 472
++ + + D+R + + Q +++I+ L R+
Sbjct: 371 EQRQDKRP---MVSDLRDSGSLEQDADVIMFLYRD 402
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.316 0.135 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 330,934,130
Number of Sequences: 1038431
Number of extensions: 13855045
Number of successful extensions: 33097
Number of sequences better than 5.0e-02: 117
Number of HSP's better than 0.1 without gapping: 14
Number of HSP's successfully gapped in prelim test: 103
Number of HSP's that attempted gapping in prelim test: 32995
Number of HSP's gapped (non-prelim): 128
length of query: 517
length of database: 328,470,697
effective HSP length: 126
effective length of query: 391
effective length of database: 197,628,391
effective search space: 77272700881
effective search space used: 77272700881
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 94 (40.8 bits)
Query= orf22 [4571-4645]
(73 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.315 0.140 1.40e-076
Gapped
Lambda K H
0.267 0.0410 1.31e+214
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 51,195,436
Number of Sequences: 1038431
Number of extensions: 1864250
Number of successful extensions: 4127
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4127
Number of HSP's gapped (non-prelim): 0
length of query: 73
length of database: 328,470,697
effective HSP length: 49
effective length of query: 24
effective length of database: 277,587,578
effective search space: 6662101872
effective search space used: 6662101872
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf23 [4646-4762]
(115 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.321 0.139 1.31e+214
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 85,859,441
Number of Sequences: 1038431
Number of extensions: 3460041
Number of successful extensions: 7747
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7747
Number of HSP's gapped (non-prelim): 0
length of query: 115
length of database: 328,470,697
effective HSP length: 91
effective length of query: 24
effective length of database: 233,973,476
effective search space: 5615363424
effective search space used: 5615363424
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf24 [4757-5338]
(580 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064753.1| (NC_002519) RP DNA pol [Roseophage SIO1] >... 1180 0.0
ref|NP_570330.1| (NC_003390) DNA polymerase [Cyanophage P60... 209 7e-053
ref|NP_052093.1| (NC_001271) DNA polymerase [Bacteriophage ... 206 6e-052
ref|NP_523320.1| (NC_003298) DNA polymerase [Bacteriophage ... 202 1e-050
ref|NP_041982.1| (NC_001604) gene 5, DNA polymerase [Entero... 197 2e-049
pdb|1T7P|A Chain A, T7 Dna Polymerase Complexed To Dna Prim... 196 5e-049
ref|NP_052094.1| (NC_001271) 5B protein [Bacteriophage phiY... 172 8e-042
ref|NP_523322.1| (NC_003298) gene 5B protein [Bacteriophage... 171 3e-041
ref|NP_229419.1| (NC_000853) DNA-directed DNA polymerase I ... 113 7e-024
gb|AAD28505.1|AF121780_1 (AF121780) DNA polymerase I [Rhodo... 110 6e-023
ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas ... 109 1e-022
gb|AAG43104.1|AF038544_1 (AF038544) DNA polymerase I [Brach... 108 2e-022
gb|AAG43102.1|AF038542_1 (AF038542) DNA polymerase I [Pseud... 107 4e-022
sp|Q9RAA9|DPO1_RICFE DNA polymerase I (POL I) >gi|5912337|e... 104 3e-021
ref|NP_283286.1| (NC_003116) DNA polymerase I [Neisseria me... 103 4e-021
gb|AAG43148.1|AF063186_1 (AF063186) DNA polymerase I [Rhodo... 103 6e-021
ref|NP_274975.1| (NC_003112) DNA polymerase I [Neisseria me... 102 1e-020
sp|O52225|DPO1_THEFI DNA polymerase I, thermostable (TFI po... 101 2e-020
gb|AAF24859.2|AF050678_1 (AF050678) DNA polymerase I [Therm... 99 1e-019
pdb|1KFD| Dna Polymerase I (Klenow Fragment) (E.C.2.7.7.7)... 98 3e-019
ref|NP_418300.1| (NC_000913) DNA polymerase I, 3' --> 5' po... 98 3e-019
ref|NP_290488.1| (NC_002655) DNA polymerase I, 3' --> 5' po... 97 4e-019
emb|CAB56093.1| (AJ238761) DNA polymerase I [Rickettsia rhi... 97 5e-019
ref|NP_458049.1| (NC_003198) DNA polymerase I [Salmonella e... 95 2e-018
ref|NP_268299.1| (NC_002662) DNA polymerase I (EC 2.7.7.7) ... 94 6e-018
ref|NP_229767.1| (NC_002505) DNA polymerase I [Vibrio chole... 91 3e-017
ref|NP_644409.1| (NC_003919) DNA polymerase I [Xanthomonas ... 91 3e-017
ref|NP_295430.1| (NC_001263) DNA-directed DNA polymerase [D... 91 3e-017
pdb|5KTQ|A Chain A, Large Fragment Of Taq Dna Polymerase Bo... 90 7e-017
sp|P19821|DPO1_THEAQ DNA polymerase I, thermostable (Taq po... 90 7e-017
pdb|4KTQ|A Chain A, Binary Complex Of The Large Fragment Of... 90 7e-017
ref|NP_372214.1| (NC_002758) DNA polymerase I [Staphylococc... 87 7e-016
ref|NP_188690.1| (NM_112946) DNA polymerase, putative; prot... 45 0.002
>ref|NP_064753.1| (NC_002519) RP DNA pol [Roseophage SIO1]
gb|AAG02598.1|AF189021_17 (AF189021) Roseophage SIO1 complete genome
Length = 580
Score = 1180 bits (3053), Expect = 0.0
Identities = 580/580 (100%), Positives = 580/580 (100%)
Query: 1 MEVVFDIETDALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIP 60
MEVVFDIETDALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIP
Sbjct: 1 MEVVFDIETDALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIP 60
Query: 61 QIKKLWGYGIPLKDVADTLVMSRLLDPTRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYC 120
QIKKLWGYGIPLKDVADTLVMSRLLDPTRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYC
Sbjct: 61 QIKKLWGYGIPLKDVADTLVMSRLLDPTRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYC 120
Query: 121 KQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLR 180
KQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLR
Sbjct: 121 KQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLR 180
Query: 181 ETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGS 240
ETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGS
Sbjct: 181 ETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGS 240
Query: 241 PAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQ 300
PAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQ
Sbjct: 241 PAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQ 300
Query: 301 EWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQ 360
EWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQ
Sbjct: 301 EWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQ 360
Query: 361 AFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDGDIHTFNKEALGRYCKDRPTAK 420
AFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDGDIHTFNKEALGRYCKDRPTAK
Sbjct: 361 AFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDGDIHTFNKEALGRYCKDRPTAK 420
Query: 421 TFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAASMGWMKGLD 480
TFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAASMGWMKGLD
Sbjct: 421 TFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAASMGWMKGLD 480
Query: 481 GRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDED 540
GRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDED
Sbjct: 481 GRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDED 540
Query: 541 QAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
QAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH
Sbjct: 541 QAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
>ref|NP_570330.1| (NC_003390) DNA polymerase [Cyanophage P60]
gb|AAL73268.1| (AF338467) DNA polymerase [Cyanophage P60]
Length = 587
Score = 209 bits (532), Expect = 7e-053
Identities = 182/628 (28%), Positives = 282/628 (43%), Gaps = 89/628 (14%)
Query: 1 MEVVFDIETDAL--DATVIHVLVAKRVG---QKGFYVVRDAETFKR----LAKQVTLWIG 51
M++ FDIETD L D T+IH +VA+ + + F D + K L+K LW G
Sbjct: 1 MKLAFDIETDGLIPDLTIIHCIVARDIDTDEEFRFDGTGDYPSIKEGLELLSKADELW-G 59
Query: 52 HNVIGFDIPQIKKLWGYGIPLKDVADTLVMSRL----------------LDPTRKGGHSL 95
HN++ +D P I+KL P DTL++SRL + G HSL
Sbjct: 60 HNIVNYDYPAIQKLHPDWTPPSCTRDTLILSRLFFTDLLDRDFRSRPALMPGNLYGRHSL 119
Query: 96 DALS---GNEKIDFH-----DFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQ 147
+A G+ K +F D+STY+PEML YC +DV ++ + ++ + + S+
Sbjct: 120 EAWGHRLGHHKSEFGKSLEGDWSTYSPEMLEYCARDVEVSVALAKTFVPKIPEY-QWSVD 178
Query: 148 LEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKR 207
EH++ I+ QE+ GF D A+ + E + + +++E + V + KR
Sbjct: 179 TEHEIARIMSWQEQMGFPFDVRAAQALEGKLRLELDTLSDDMRETFHFVDGGV--MTPKR 236
Query: 208 KKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGSPAQVNKRLDR-LGWKPTVKTKSGNSYK 266
Y + K+ EFN S + + GW+P +T G K
Sbjct: 237 SNKVRHYFEN--------APFCKL--REFNPTSRHHIAWAFEHHRGWEPKERTAGGQP-K 285
Query: 267 ICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQ------EWLQKSQETGRVHGRVILTG 320
I E L I + L ++LE + L Q WL K + GR+H +L
Sbjct: 286 IDDEILREI-----NTKESLAFARILELQKHLGQLSEGKNAWL-KLERKGRLHHSCVLN- 338
Query: 321 AVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQL 380
T R AH PN+A +PS E R FK + L VG+DA+G++L
Sbjct: 339 TNTGRQAHMRPNLAQVPSAH----------------EYRSLFKPSDNHLQVGSDASGLEL 382
Query: 381 RVLAHYMN---DPIYTEQVIDGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIAS 437
R L HY++ + E+V++GDIHT E G DR + K Y + G G +
Sbjct: 383 RCLGHYLSRYDGGKFAEEVVNGDIHTALAEIYGT---DRKSGKGVTYCLIYGGGNHKLGL 439
Query: 438 ILGCN----NRQANEAMANFYEAIPSLKKLKSQASQAASMGWMKGLDGRVLRI-GSDHLA 492
G + +R+ E + + L + + A G +KGLDGR +R+ G +H A
Sbjct: 440 TAGASKSSASRKGQEIRGKIMQGLSGFADLNAAIQERAKSGVLKGLDGRPIRLQGKNHAA 499
Query: 493 LSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQA 552
L+ LQ +I +L + ++ GI++ A+VHDE Q V DQA+ ++ A
Sbjct: 500 LNYLLQSAGAIICKLWVIRTHELLQEAGIDYYPLAFVHDEQQLSVRADQAEMAAQLTTLA 559
Query: 553 IKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+KD K C +D E +IG +W + H
Sbjct: 560 MKDVEHQVKFRCALDSEYQIGNSWADCH 587
>ref|NP_052093.1| (NC_001271) DNA polymerase [Bacteriophage phiYeO3-12]
emb|CAB63614.1| (AJ251805) DNA polymerase [Bacteriophage phiYeO3-12]
Length = 704
Score = 206 bits (524), Expect = 6e-052
Identities = 184/688 (26%), Positives = 292/688 (41%), Gaps = 149/688 (21%)
Query: 24 RVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLW------GYGIPLKDVAD 77
R G G Y+ + + K+ L + HN +D+P ++KL + +P ++ D
Sbjct: 35 RPGDFGAYL----DALEAEVKRGGLIVFHNGHKYDVPALEKLAKLQLNRDFKLPRENCID 90
Query: 78 TLVMSRLLDPTRK-----------------GGHSLDALS--------------------- 99
TLV+SRLL K G H+L+A
Sbjct: 91 TLVLSRLLHANLKDTDMGLLRSGKLPGRRFGSHALEAWGYRLGEMKGEYKDDFKAMLEEQ 150
Query: 100 GNEKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKA--------------- 144
G E +D ++ ++ M+AY QDV + K+ + +L F
Sbjct: 151 GEEYVDGAEWVSFNEPMMAYNVQDVVVTVKLLEKFLTDLHYFPAGMDFTKYDADLFWREA 210
Query: 145 --SIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPV-- 200
S+ +EH+ ++ +QE+NGF +T EE+Y + + ++ E P
Sbjct: 211 GESVDIEHRAAWLLAKQERNGFPFNTQAIEELYVELAAKRAELLRKLTETFGSWYQPKGG 270
Query: 201 KEVI--------------IKRKKDGSIYSN-------QLLEGCNVQ-------GDYTKIA 232
KE+ + K G +Y + LE C++ YT +
Sbjct: 271 KELFKHPRTGKPLPSYPRVVYPKQGGVYKKPRNKAQREGLEPCDLDTRDYMEGAPYTPVE 330
Query: 233 WEEFNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPE---AVKGLKAW 289
+ F S + K+L GW P T+ G + E L + PE + +K +
Sbjct: 331 FVTFKPSSRDHIQKKLQEAGWVPEKFTEKGAPV-VDDEVLEHVRVDDPEKQACIDLIKEY 389
Query: 290 KVLETRWKLAQE----WLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDG 345
+++ R A E WL+ Q+ G++HG V GAVT RA H PN+A +P V
Sbjct: 390 LMIQKRIGQAAEGDKAWLRYVQDDGKIHGAVNPNGAVTGRATHSFPNLAQVPGV------ 443
Query: 346 ILWKMEGMYGAECRQAFKVPE------GK--LLVGTDAAGIQLRVLAHYM---NDPIYTE 394
YG +CR AF GK + G DA+G++LR LAH+M ++ Y
Sbjct: 444 -----RSPYGEQCRSAFGAEHHLDGITGKPWVQAGIDASGLELRCLAHFMARFDNGEYAN 498
Query: 395 QVIDGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFY 454
++++GDIHT N+ A +D AKTFIY FL GAG I I+G + E F
Sbjct: 499 EILNGDIHTKNQNAAELPTRD--NAKTFIYGFLYGAGDEKIGQIVGAGKERGKELKKKFL 556
Query: 455 EAIPSLKKLK-----SQASQAASMG----------WMKGLDGRVLRIGSDHLALSVYLQG 499
E P++ L+ S + +A +G W+KGLDGR + + S H AL+ LQ
Sbjct: 557 ENTPAIAALREAIQQSLVNSSAWIGGEQKVQWKRRWIKGLDGRKVHVRSPHAALNTLLQS 616
Query: 500 GETVIMRLANVFWQRQAKKEGI------NFKQCAWVHDEWQTEV-DEDQAQRLGEIQVQA 552
+I +L + + ++G+ +F Q W+HDE Q E+ A+ + E+ +A
Sbjct: 617 AGALICKLWIIKTEEMLIEKGLKHGWDGDFAQMGWIHDEVQIACRTEEVAKVVVEVAQEA 676
Query: 553 IKDAGTFFKLNCPMDGEAKIGKNWLETH 580
++ G + C +D E KIG NW H
Sbjct: 677 MRWVGEHWNFRCRLDTEGKIGPNWAVCH 704
>ref|NP_523320.1| (NC_003298) DNA polymerase [Bacteriophage T3]
sp|P20311|DPOL_BPT3 DNA polymerase
pir||S07512 DNA-directed DNA polymerase (EC 2.7.7.7) - phage T3
emb|CAA35140.1| (X17255) 5 [Bacteriophage T3]
emb|CAC86283.1| (AJ318471) DNA polymerase [Bacteriophage T3]
Length = 704
Score = 202 bits (513), Expect = 1e-050
Identities = 184/690 (26%), Positives = 288/690 (41%), Gaps = 153/690 (22%)
Query: 24 RVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLWG------YGIPLKDVAD 77
R G G Y+ + + K+ L + HN +D+P + KL + +P ++ D
Sbjct: 35 RPGDFGAYL----DALEAEVKRGGLIVFHNGHKYDVPALTKLAKLQLNREFHLPRENCID 90
Query: 78 TLVMSRLLDPTRK-----------------GGHSLDALS--------------------- 99
TLV+SRL+ K G H+L+A
Sbjct: 91 TLVLSRLIHSNLKDTDMGLLRSGKLPGKRFGSHALEAWGYRLGEMKGEYKDDFKRMLEEQ 150
Query: 100 GNEKIDFHDFSTYTPEMLAYCKQDVAI---------NEKVYLQLKEELSNFGKAS----- 145
G E +D ++ + EM+ Y QDV + ++K Y + + ++ G +
Sbjct: 151 GEEYVDGMEWWNFNEEMMDYNVQDVVVTKALLEKLLSDKHYFPPEIDFTDVGYTTFWSES 210
Query: 146 ---IQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKE----------- 191
+ +EH+ ++ +QE+NGF DT EE+Y + + + E
Sbjct: 211 LEAVDVEHRAAWLLAKQERNGFPFDTKAIEELYVELAARRSELLRNLTETFGSWYQPKGG 270
Query: 192 ---FMVP-----------IAVPVKEVIIKRKKDGSIYSNQLLEGCNVQ-------GDYTK 230
F P I +P I K+ K+ + + E C + YT
Sbjct: 271 TEMFCHPRTGKPLPKYPRIKIPKVGGIFKKPKNKAQREGR--EPCELDTREYVAGAPYTP 328
Query: 231 IAWEEFNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPE---AVKGLK 287
+ FN S + K+L GW PT T G + E L + PE A+ +K
Sbjct: 329 VEHVVFNPSSRDHIQKKLQEAGWVPTKFTDKGAPV-VDDEVLEGVRVDDPEKQAAIDLIK 387
Query: 288 AWKVLETRWKLAQE----WLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGK 343
+ +++ R + E WL+ E G++HG V GAVT RA H PN+A IP V
Sbjct: 388 EYLMIQKRIGQSAEGDKAWLRYVAEDGKIHGSVNPNGAVTGRATHAFPNLAQIPGV---- 443
Query: 344 DGILWKMEGMYGAECRQAFKVPE------GK--LLVGTDAAGIQLRVLAHYM---NDPIY 392
YG +CR AF GK + G DA+G++LR LAH+M ++ Y
Sbjct: 444 -------RSPYGEQCRAAFGAEHHLDGITGKPWVQAGIDASGLELRCLAHFMARFDNGEY 496
Query: 393 TEQVIDGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMAN 452
++++GDIHT N+ A +D AKTFIY FL GAG I I+G + E
Sbjct: 497 AHEILNGDIHTKNQMAAELPTRD--NAKTFIYGFLYGAGDEKIGQIVGAGKERGKELKKK 554
Query: 453 FYEAIPSLKKLKSQASQAA---------------SMGWMKGLDGRVLRIGSDHLALSVYL 497
F E P++ L+ Q W+KGLDGR + + S H AL+ L
Sbjct: 555 FLENTPAIAALRESIQQTLVESSQWVAGEQQVKWKRRWIKGLDGRKVHVRSPHAALNTLL 614
Query: 498 QGGETVIMRLANVFWQRQAKKEGI------NFKQCAWVHDEWQTEV-DEDQAQRLGEIQV 550
Q +I +L + + ++G+ +F AW+HDE Q E+ A+ + E+
Sbjct: 615 QSAGALICKLWIIKTEEMLVEKGLKHGWDGDFAYMAWIHDEIQVACRTEEIAKTVIEVAQ 674
Query: 551 QAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+A++ G + C +D E K+G NW E H
Sbjct: 675 EAMRWVGEHWNFRCLLDTEGKMGANWKECH 704
>ref|NP_041982.1| (NC_001604) gene 5, DNA polymerase [Enterobacteria phage T7]
sp|P00581|DPOL_BPT7 DNA polymerase (T7 DNA polymerase)
pir||DJBPT7 DNA-directed DNA polymerase (EC 2.7.7.7) phage chain - phage T7
emb|CAA24412.1| (V01146) gene 5, DNA polymerase [Enterobacteria phage T7]
Length = 704
Score = 197 bits (502), Expect = 2e-049
Identities = 179/664 (26%), Positives = 273/664 (40%), Gaps = 145/664 (21%)
Query: 48 LWIGHNVIGFDIPQIKKLWG------YGIPLKDVADTLVMSRLLDPTRK----------- 90
L + HN +D+P + KL + +P ++ DTLV+SRL+ K
Sbjct: 55 LIVFHNGHKYDVPALTKLAKLQLNREFHLPRENCIDTLVLSRLIHSNLKDTDMGLLRSGK 114
Query: 91 ------GGHSLDALS---------------------GNEKIDFHDFSTYTPEMLAYCKQD 123
G H+L+A G E +D ++ + EM+ Y QD
Sbjct: 115 LPGKRFGSHALEAWGYRLGEMKGEYKDDFKRMLEEQGEEYVDGMEWWNFNEEMMDYNVQD 174
Query: 124 VAI---------NEKVYLQLKEELSNFGKAS--------IQLEHQMQAIVCEQEKNGFML 166
V + ++K Y + + ++ G + + +EH+ ++ +QE+NGF
Sbjct: 175 VVVTKALLEKLLSDKHYFPPEIDFTDVGYTTFWSESLEAVDIEHRAAWLLAKQERNGFPF 234
Query: 167 DTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPV----------------KEVIIKRKKD 210
DT EE+Y + + ++ E P K IK K
Sbjct: 235 DTKAIEELYVELAARRSELLRKLTETFGSWYQPKGGTEMFCHPRTGKPLPKYPRIKTPKV 294
Query: 211 GSIYSN-------QLLEGCNVQ-------GDYTKIAWEEFNLGSPAQVNKRLDRLGWKPT 256
G I+ + E C + YT + FN S + K+L GW PT
Sbjct: 295 GGIFKKPKNKAQREGREPCELDTREYVAGAPYTPVEHVVFNPSSRDHIQKKLQEAGWVPT 354
Query: 257 VKTKSGNSYKICPENLATIPDTAPE---AVKGLKAWKVLETRWKLAQE----WLQKSQET 309
T G + E L + PE A+ +K + +++ R + E WL+ E
Sbjct: 355 KYTDKGAPV-VDDEVLEGVRVDDPEKQAAIDLIKEYLMIQKRIGQSAEGDKAWLRYVAED 413
Query: 310 GRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPE--- 366
G++HG V GAVT RA H PN+A IP V YG +CR AF
Sbjct: 414 GKIHGSVNPNGAVTGRATHAFPNLAQIPGV-----------RSPYGEQCRAAFGAEHHLD 462
Query: 367 ---GK--LLVGTDAAGIQLRVLAHYM---NDPIYTEQVIDGDIHTFNKEALGRYCKDRPT 418
GK + G DA+G++LR LAH+M ++ Y ++++GDIHT K + R
Sbjct: 463 GITGKPWVQAGIDASGLELRCLAHFMARFDNGEYAHEILNGDIHT--KNQIAAELPTRDN 520
Query: 419 AKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAA------- 471
AKTFIY FL GAG I I+G + E F E P++ L+ Q
Sbjct: 521 AKTFIYGFLYGAGDEKIGQIVGAGKERGKELKKKFLENTPAIAALRESIQQTLVESSQWV 580
Query: 472 --------SMGWMKGLDGRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGI-- 521
W+KGLDGR + + S H AL+ LQ +I +L + + ++G+
Sbjct: 581 AGEQQVKWKRRWIKGLDGRKVHVRSPHAALNTLLQSAGALICKLWIIKTEEMLVEKGLKH 640
Query: 522 ----NFKQCAWVHDEWQTEV-DEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNW 576
+F AWVHDE Q E+ AQ + E +A++ G + C +D E K+G NW
Sbjct: 641 GWDGDFAYMAWVHDEIQVGCRTEEIAQVVIETAQEAMRWVGDHWNFRCLLDTEGKMGPNW 700
Query: 577 LETH 580
H
Sbjct: 701 AICH 704
>pdb|1T7P|A Chain A, T7 Dna Polymerase Complexed To Dna PrimerTEMPLATE,A
Nucleoside Triphosphate, And Its Processivity Factor
Thioredoxin
Length = 698
Score = 196 bits (499), Expect = 5e-049
Identities = 178/658 (27%), Positives = 271/658 (41%), Gaps = 139/658 (21%)
Query: 48 LWIGHNVIGFDIPQIKKLWG------YGIPLKDVADTLVMSRLLDPTRKG---------- 91
L + HN +D+P + KL + +P ++ DTLV+SRL+ K
Sbjct: 55 LIVFHNGHKYDVPALTKLAKLQLNREFHLPRENCIDTLVLSRLIHSNLKDTDMGLLRSGK 114
Query: 92 --------GHSLDALSGNEKIDFH--------------DFSTYTPEMLAYCKQDVAI--- 126
G+ L + G K DF ++ + EM+ Y QDV +
Sbjct: 115 LPGALEAWGYRLGEMKGEYKDDFKRMLEEQGEEYVDGMEWWNFNEEMMDYNVQDVVVTKA 174
Query: 127 ------NEKVYLQLKEELSNFGKAS--------IQLEHQMQAIVCEQEKNGFMLDTDIAE 172
++K Y + + ++ G + + +EH+ ++ +QE+NGF DT E
Sbjct: 175 LLEKLLSDKHYFPPEIDFTDVGYTTFWSESLEAVDIEHRAAWLLAKQERNGFPFDTKAIE 234
Query: 173 EIYTTCLRETNRIEAEIKEFMVPIAVPV----------------KEVIIKRKKDGSIYSN 216
E+Y + + ++ E P K IK K G I+
Sbjct: 235 ELYVELAARRSELLRKLTETFGSWYQPKGGTEMFCHPRTGKPLPKYPRIKTPKVGGIFKK 294
Query: 217 -------QLLEGCNVQ-------GDYTKIAWEEFNLGSPAQVNKRLDRLGWKPTVKTKSG 262
+ E C + YT + FN S + K+L GW PT T G
Sbjct: 295 PKNKAQREGREPCELDTREYVAGAPYTPVEHVVFNPSSRDHIQKKLQEAGWVPTKYTDKG 354
Query: 263 NSYKICPENLATIPDTAPE---AVKGLKAWKVLETRWKLAQE----WLQKSQETGRVHGR 315
+ E L + PE A+ +K + +++ R + E WL+ E G++HG
Sbjct: 355 APV-VDDEVLEGVRVDDPEKQAAIDLIKEYLMIQKRIGQSAEGDKAWLRYVAEDGKIHGS 413
Query: 316 VILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPE------GK- 368
V GAVT RA H PN+A IP V YG +CR AF GK
Sbjct: 414 VNPNGAVTGRATHAFPNLAQIPGV-----------RSPYGEQCRAAFGAEHHLDGITGKP 462
Query: 369 -LLVGTDAAGIQLRVLAHYM---NDPIYTEQVIDGDIHTFNKEALGRYCKDRPTAKTFIY 424
+ G DA+G++LR LAH+M ++ Y ++++GDIHT K + R AKTFIY
Sbjct: 463 WVQAGIDASGLELRCLAHFMARFDNGEYAHEILNGDIHT--KNQIAAELPTRDNAKTFIY 520
Query: 425 AFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAA------------- 471
FL GAG I I+G + E F E P++ L+ Q
Sbjct: 521 GFLYGAGDEKIGQIVGAGKERGKELKKKFLENTPAIAALRESIQQTLVESSQWVAGEQQV 580
Query: 472 --SMGWMKGLDGRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGI------NF 523
W+KGLDGR + + S H AL+ LQ +I +L + + ++G+ +F
Sbjct: 581 KWKRRWIKGLDGRKVHVRSPHAALNTLLQSAGALICKLWIIKTEEMLVEKGLKHGWDGDF 640
Query: 524 KQCAWVHDEWQTEV-DEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
AWVHDE Q E+ AQ + E +A++ G + C +D E K+G NW H
Sbjct: 641 AYMAWVHDEIQVGCRTEEIAQVVIETAQEAMRWVGDHWNFRCLLDTEGKMGPNWAICH 698
>ref|NP_052094.1| (NC_001271) 5B protein [Bacteriophage phiYeO3-12]
emb|CAB63615.1| (AJ251805) 5B protein [Bacteriophage phiYeO3-12]
Length = 384
Score = 172 bits (437), Expect = 8e-042
Identities = 124/393 (31%), Positives = 187/393 (47%), Gaps = 54/393 (13%)
Query: 228 YTKIAWEEFNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPE---AVK 284
YT + + F S + K+L GW P T+ G + E L + PE +
Sbjct: 6 YTPVEFVTFKPSSRDHIQKKLQEAGWVPEKFTEKGAPV-VDDEVLEHVRVDDPEKQACID 64
Query: 285 GLKAWKVLETRWKLAQE----WLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVP 340
+K + +++ R A E WL+ Q+ G++HG V GAVT RA H PN+A +P V
Sbjct: 65 LIKEYLMIQKRIGQAAEGDKAWLRYVQDDGKIHGAVNPNGAVTGRATHSFPNLAQVPGV- 123
Query: 341 HGKDGILWKMEGMYGAECRQAFKVPE------GK--LLVGTDAAGIQLRVLAHYM---ND 389
YG +CR AF GK + G DA+G++LR LAH+M ++
Sbjct: 124 ----------RSPYGEQCRSAFGAEHHLDGITGKPWVQAGIDASGLELRCLAHFMARFDN 173
Query: 390 PIYTEQVIDGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEA 449
Y ++++GDIHT N+ A +D AKTFIY FL GAG I I+G + E
Sbjct: 174 GEYANEILNGDIHTKNQNAAELPTRD--NAKTFIYGFLYGAGDEKIGQIVGAGKERGKEL 231
Query: 450 MANFYEAIPSLKKLK-----SQASQAASMG----------WMKGLDGRVLRIGSDHLALS 494
F E P++ L+ S + +A +G W+KGLDGR + + S H AL+
Sbjct: 232 KKKFLENTPAIAALREAIQQSLVNSSAWIGGEQKVQWKRRWIKGLDGRKVHVRSPHAALN 291
Query: 495 VYLQGGETVIMRLANVFWQRQAKKEGI------NFKQCAWVHDEWQTEV-DEDQAQRLGE 547
LQ +I +L + + ++G+ +F Q W+HDE Q E+ A+ + E
Sbjct: 292 TLLQSAGALICKLWIIKTEEMLIEKGLKHGWDGDFAQMGWIHDEVQIACRTEEVAKVVVE 351
Query: 548 IQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ +A++ G + C +D E KIG NW H
Sbjct: 352 VAQEAMRWVGEHWNFRCRLDTEGKIGPNWAVCH 384
>ref|NP_523322.1| (NC_003298) gene 5B protein [Bacteriophage T3]
emb|CAA35141.1| (X17255) 5B [Bacteriophage T3]
emb|CAC86285.1| (AJ318471) gene 5B protein [Bacteriophage T3]
Length = 372
Score = 171 bits (432), Expect = 3e-041
Identities = 123/385 (31%), Positives = 180/385 (45%), Gaps = 54/385 (14%)
Query: 236 FNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPE---AVKGLKAWKVL 292
FN S + K+L GW PT T G + E L + PE A+ +K + ++
Sbjct: 2 FNPSSRDHIQKKLQEAGWVPTKFTDKGAPV-VDDEVLEGVRVDDPEKQAAIDLIKEYLMI 60
Query: 293 ETRWKLAQE----WLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILW 348
+ R + E WL+ E G++HG V GAVT RA H PN+A IP V
Sbjct: 61 QKRIGQSAEGDKAWLRYVAEDGKIHGSVNPNGAVTGRATHAFPNLAQIPGV--------- 111
Query: 349 KMEGMYGAECRQAFKVPE------GK--LLVGTDAAGIQLRVLAHYM---NDPIYTEQVI 397
YG +CR AF GK + G DA+G++LR LAH+M ++ Y +++
Sbjct: 112 --RSPYGEQCRAAFGAEHHLDGITGKPWVQAGIDASGLELRCLAHFMARFDNGEYAHEIL 169
Query: 398 DGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAI 457
+GDIHT N+ A +D AKTFIY FL GAG I I+G + E F E
Sbjct: 170 NGDIHTKNQMAAELPTRD--NAKTFIYGFLYGAGDEKIGQIVGAGKERGKELKKKFLENT 227
Query: 458 PSLKKLKSQASQAA---------------SMGWMKGLDGRVLRIGSDHLALSVYLQGGET 502
P++ L+ Q W+KGLDGR + + S H AL+ LQ
Sbjct: 228 PAIAALRESIQQTLVESSQWVAGEQQVKWKRRWIKGLDGRKVHVRSPHAALNTLLQSAGA 287
Query: 503 VIMRLANVFWQRQAKKEGI------NFKQCAWVHDEWQTEV-DEDQAQRLGEIQVQAIKD 555
+I +L + + ++G+ +F AW+HDE Q E+ A+ + E+ +A++
Sbjct: 288 LICKLWIIKTEEMLVEKGLKHGWDGDFAYMAWIHDEIQVACRTEEIAKTVIEVAQEAMRW 347
Query: 556 AGTFFKLNCPMDGEAKIGKNWLETH 580
G + C +D E K+G NW E H
Sbjct: 348 VGEHWNFRCLLDTEGKMGANWKECH 372
>ref|NP_229419.1| (NC_000853) DNA-directed DNA polymerase I [Thermotoga maritima]
pir||E72232 DNA-directed DNA polymerase I - Thermotoga maritima (strain MSB8)
gb|AAD36686.1|AE001805_11 (AE001805) DNA-directed DNA polymerase I [Thermotoga maritima]
Length = 893
Score = 113 bits (282), Expect = 7e-024
Identities = 142/625 (22%), Positives = 267/625 (42%), Gaps = 109/625 (17%)
Query: 6 DIETDALD---ATVIHVLVAKRVGQKGFYV---------VRDAETFKRLAKQVTLWIGHN 53
D+ET +LD ++ + V+ + ++ +Y+ + + E K+L K++ G
Sbjct: 323 DLETSSLDPFDCDIVGISVSFKP-KEAYYIPLHHRNAQNLDEKEVLKKL-KEILEDPGAK 380
Query: 54 VIGFDIP-QIKKLWGYGI-PLKDVADTLVMSRLLDPTRKGGHSLDALS----GNEKIDFH 107
++G ++ K L G+ P+ DT++ + LL+P K +LD L+ G + +
Sbjct: 381 IVGQNLKFDYKVLMVKGVEPVPPYFDTMIAAYLLEPNEKK-FNLDDLALKFLGYKMTSYQ 439
Query: 108 DFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLD 167
+ +++ + + DV + E+ +N+ CE D
Sbjct: 440 ELMSFSFPLFGFSFADVPV---------EKAANYS--------------CE--------D 468
Query: 168 TDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQ-- 225
DI +Y T + + EA+++ I +P+ V+ + + +G + L+ + +
Sbjct: 469 ADITYRLYKTLSLKLH--EADLENVFYKIEMPLVNVLARMELNGVYVDTEFLKKLSEEYG 526
Query: 226 -------GDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKI---CPENLAT 274
+ +IA E FN+ SP QV++ L ++LG KP KT Y E LA
Sbjct: 527 KKLEELAEEIYRIAGEPFNINSPKQVSRILFEKLGIKPRGKTTKTGDYSTRIEVLEELAG 586
Query: 275 IPDTAPEAVKGLKAWKVLETRWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMA 334
+ P ++ K K+ T + + + +TGR+H TG T R + PN+
Sbjct: 587 EHEIIPLILEYRKIQKLKSTYIDALPKMV--NPKTGRIHASFNQTGTATGRLSSSDPNLQ 644
Query: 335 NIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGK--LLVGTDAAGIQLRVLAHYMNDPIY 392
N+P+ + G E R+A VP+ +V D + I+LR+LAH D
Sbjct: 645 NLPT------------KSEEGKEIRKAI-VPQDPNWWIVSADYSQIELRILAHLSGDENL 691
Query: 393 TEQVIDG-DIHTFNKEALGRYCKDRPT------AKTFIYAFLLGAGTGMIASILGCNNRQ 445
+G D+HT + + T K ++ + G ++ LG ++
Sbjct: 692 LRAFEEGIDVHTLTASRIFNVKPEEVTEEMRRAGKMVNFSIIYGVTPYGLSVRLGVPVKE 751
Query: 446 ANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGRVLRIGS------------DHLA 492
A + + N++ P ++ ++ S+A G+++ L GR I + +A
Sbjct: 752 AEKMIVNYFVLYPKVRDYIQRVVSEAKEKGYVRTLFGRKRDIPQLMARDRNTQAEGERIA 811
Query: 493 LSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQA 552
++ +QG I++LA + R+ K+ + K VHDE EV ++ L V+
Sbjct: 812 INTPIQGTAADIIKLAMIEIDRELKERKMRSKMIIQVHDELVFEVPNEEKDAL----VEL 867
Query: 553 IKDAGT-FFKLNCPMDGEAKIGKNW 576
+KD T KL+ P++ + IGK W
Sbjct: 868 VKDRMTNVVKLSVPLEVDVTIGKTW 892
>gb|AAD28505.1|AF121780_1 (AF121780) DNA polymerase I [Rhodothermus marinus]
Length = 924
Score = 110 bits (274), Expect = 6e-023
Identities = 151/627 (24%), Positives = 256/627 (40%), Gaps = 121/627 (19%)
Query: 9 TDALDATVIHVLVAKRVGQKGFYVVRD------AET-FKRLA---KQVTLWIGHNVIGFD 58
T+A+ A+++ + + GQ G+YV ET +RLA ++ +G N + +D
Sbjct: 364 TEAMWASLVGIAFSWEKGQ-GYYVPTPLPDGTPTETVLERLAPILRRAQRKVGQN-LKYD 421
Query: 59 IPQIKKLWGYGIPLKDVADTLVMSRLLDPTRKGGHSLDAL-----------------SGN 101
+ + + G +P DT+V L+ P H+LD L SG
Sbjct: 422 LVVLAR-HGVQVP-PPYFDTMVAHYLIAPEEP--HNLDVLARQYLRYQMVSITELIGSGR 477
Query: 102 EKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASI--QLEHQMQAIVCEQ 159
++ D S E+ Y +D I ++ L EL G I ++E + ++ +
Sbjct: 478 DQKSMRDVSI--DEVGPYACEDTDIALQLADVLAAELDRHGLRHIAEEMEFPLIEVLADM 535
Query: 160 EKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLL 219
E+ G +D + EI ++EAE+ E V I
Sbjct: 536 ERTGICIDRAVLREI-------GKQLEAELHELEVKIY---------------------- 566
Query: 220 EGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG--NSYKICPENLATIP 276
++A EFN+GSP Q+ L +LG KP +T +G ++ + + LAT
Sbjct: 567 ----------EVAGVEFNIGSPQQLADVLFKKLGLKPRARTSTGRPSTKESVLQELAT-- 614
Query: 277 DTAPEAVKGLKA-WKVLETRWKLAQEWLQK--SQETGRVHGRVILTGAVTHRAAHQGPNM 333
+ GL W+ L + L+ ETGR+H T T R + PN+
Sbjct: 615 ---QHPLPGLILDWRHLAKLKSTYVDGLEPLIHPETGRIHTTFNQTVTATGRLSSSNPNL 671
Query: 334 ANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYT 393
NIP + G E R+AF G L+ D I+LR+LA D
Sbjct: 672 QNIP------------VRTEMGREIRRAFVPRPGWKLLSADYVQIELRILAALSGDEALR 719
Query: 394 EQVIDG-DIHTFNKEALGRYCKDRPT------AKTFIYAFLLGAGTGMIASILGCNNRQA 446
++G DIHT + + ++ T AK Y G +A L C+ R+A
Sbjct: 720 RAFLEGQDIHTATAARVFKVPPEQVTPEQRRRAKMVNYGIPYGISAWGLAQRLRCSTREA 779
Query: 447 NEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGR------------VLRIGSDHLAL 493
E + + A P + + L +A G+++ L GR R ++ +A+
Sbjct: 780 QELIEEYQRAFPGVTRYLHRVVEEARQKGYVETLLGRRRYVPNINSRNRAERSMAERIAV 839
Query: 494 SVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAI 553
++ +QG + +++LA V + K+EG K VHDE E+ ++ + + ++ Q +
Sbjct: 840 NMPIQGTQADMIKLAMVHIYHRLKREGYRAKMLLQVHDELVFEMPPEEVEPVRQLVEQEM 899
Query: 554 KDAGTFFKLNCPMDGEAKIGKNWLETH 580
K A P++ + +G NWL+ H
Sbjct: 900 KQALPL--EGVPIEVDIGVGDNWLDAH 924
>ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas aeruginosa]
pir||F82958 DNA polymerase I PA5493 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG08878.1|AE004962_2 (AE004962) DNA polymerase I [Pseudomonas aeruginosa]
Length = 913
Score = 109 bits (272), Expect = 1e-022
Identities = 134/550 (24%), Positives = 230/550 (41%), Gaps = 92/550 (16%)
Query: 70 IPLKDVA-DTLVMSRLLDPTRKGGHSLDALS----GNEKIDFHDFSTYTPEMLAYCKQDV 124
I L+ VA DT++ S +LD T H +D+L+ G+ I F D +
Sbjct: 417 IALRGVAYDTMLESYVLDSTATR-HDMDSLALKYLGHSTIRFEDIA-------------- 461
Query: 125 AINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNR 184
GK + QL AI EQ D D+ ++ ++
Sbjct: 462 -----------------GKGAKQLTFDQIAI--EQAGPYAAEDADVTLRLHQALWQKLEA 502
Query: 185 IEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLL---------EGCNVQGDYTKIAWEE 235
I + + + I +P+ V+ + +++G++ LL + ++ +A +E
Sbjct: 503 IPS-LARVLTDIEMPLVPVLARIERNGALVDANLLGIQSRELGEKMVALERQAYDLAGQE 561
Query: 236 FNLGSPAQVNKRL-DRLGWKPTVKTKSG--NSYKICPENLATIPDTAPEAVKGLKAWKVL 292
FNLGSP Q+ L D+LG KT +G ++ + LA P+ + ++ L
Sbjct: 562 FNLGSPKQLGAILYDKLGLPVLSKTATGQPSTAEAVLAELAEQDFELPKVIMQYRSMSKL 621
Query: 293 ETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKM 350
++ + +L + Q + TGR+H A T R + PN+ NIP +
Sbjct: 622 KSTYTDRLPE---QINPRTGRIHTSYHQAVAATGRLSSSDPNLQNIP------------I 666
Query: 351 EGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQV-IDGDIHTFN-KEA 408
G RQAF P+G L+ D + I+LR++AH D + D D+H E
Sbjct: 667 RTAEGRRIRQAFVAPQGYKLLAADYSQIELRIMAHLAKDDGLLDAFRHDLDVHRATAAEV 726
Query: 409 LGRYCKD-----RPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPS-LKK 462
G +D R +AK + + G +A +G ++A + ++ P L
Sbjct: 727 FGVPLEDVSGDQRRSAKAINFGLIYGMSAFGLAKQIGVERKEAQAYIDRYFARYPGVLAY 786
Query: 463 LKSQASQAASMGWMKGLDGR------------VLRIGSDHLALSVYLQGGETVIMRLANV 510
++ +QAA G+++ L GR +R ++ A++ +QG IM+ A V
Sbjct: 787 MERTRAQAAEQGFVETLFGRRLYLPEIHSKNGAMRKAAERTAINAPMQGTAADIMKRAMV 846
Query: 511 FWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEA 570
++ G++ + VHDE EV ED +++ E + A T L+ P+ EA
Sbjct: 847 AVDNWLQESGLDARVILQVHDELVLEVREDLVEQVCEGIRPLMSGAAT---LDVPLVVEA 903
Query: 571 KIGKNWLETH 580
+G NW E H
Sbjct: 904 GVGSNWDEAH 913
>gb|AAG43104.1|AF038544_1 (AF038544) DNA polymerase I [Brachyspira hyodysenteriae]
Length = 920
Score = 108 bits (269), Expect = 2e-022
Identities = 152/640 (23%), Positives = 265/640 (40%), Gaps = 133/640 (20%)
Query: 3 VVFDIETDALDA---TVIHVLVAKRVGQKGFYVVRDAET-----------FKRLAKQVTL 48
V D ET LD T+I + A + + FY+ T F L K+
Sbjct: 352 VCIDFETTGLDTFKDTIIGISFAIK-SNEAFYLDLSGRTKIGKDKCMNLVFDTLEKEDIK 410
Query: 49 WIGHNVIGFDIPQIKKLWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSGNEKIDFH 107
IGHN + ++ +K + G + ++ DT+V + L++PTR G +++D
Sbjct: 411 VIGHN-LKYEYKMMKAI---GKSIGNMYFDTMVAAYLINPTR-GRYNMD----------- 454
Query: 108 DFSTYTPEMLAYCKQDVAINEKVYLQLK-EELSNFGKASIQLEHQMQAIV---CEQEKNG 163
D+AI Y +K +L++ K ++ L+ ++ +V CE
Sbjct: 455 ---------------DLAIAYLSYNTIKYSDLTDNAKKTL-LDAPLKDVVEYACE----- 493
Query: 164 FMLDTDIAEEIYTTC--LRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEG 221
D D+ Y L +T+ +E E I +P+ V+ + DG S++ ++
Sbjct: 494 ---DADVTFRFYECFAPLLKTHNLE----ELFFNIEMPLVSVLADMEFDGVYISSERMKA 546
Query: 222 CNVQGDYTKI-----------AWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICP 269
+ +Y+ + A EEFNL SP Q+ L D++G PT KTK+G S
Sbjct: 547 --LSDEYSSLLEKTKEKIYAEAGEEFNLQSPKQLEYILFDKMGIPPTKKTKTGFS----- 599
Query: 270 ENLATIPDTAPEAVKGLKAWKVLETRWKLAQ---EWLQK-----SQETGRVHGRVILTGA 321
T + E + K + + T K A+ +L +++TGR+H T
Sbjct: 600 ----TDEEVLTELSQKYKIAEYMLTYRKYAKLKNTYLDVFPTLINEKTGRIHASFNQTVT 655
Query: 322 VTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLR 381
T R + PN+ NIP + G G E R F +G LL+ D + I+LR
Sbjct: 656 ATGRLSSSEPNLQNIP------------VRGEEGREIRNTFIPEKGNLLIAADYSQIELR 703
Query: 382 VLAHYMNDPIYTEQVIDG-DIHTFNKEALGRYCKDRPT------AKTFIYAFLLGAGTGM 434
+LAH+ NDP E + DIH + K+ T AK ++ + G
Sbjct: 704 LLAHFSNDPALVEAFKNNDDIHRKTAMKIYSVSKEHVTASMRNIAKIINFSIIYGKTAFG 763
Query: 435 IASILGCNNRQANEAMANFYEAIPSLKKLKSQ-ASQAASMGWMKGLDGRV---------- 483
++ LG ++A + + + +K + + + G+++ + GR+
Sbjct: 764 LSKELGIKKKEAEDFIKGYSSTYSRVKPFCDEVVEEVRNKGFVRTMCGRIRDLSKTINSS 823
Query: 484 ---LRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDED 540
R ++ +AL+ +QG ++++A V ++ K K VHDE EV E
Sbjct: 824 NAMARNEAERMALNTLIQGSAADMIKVAMVAIHKEFKNHLKTAKIVMQVHDELVVEVSEA 883
Query: 541 QAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+A + I + ++ + K N P+ + G +W E H
Sbjct: 884 EADKAMTIMKEIMEHS---VKANVPIIVDIHKGNSWGEVH 920
>gb|AAG43102.1|AF038542_1 (AF038542) DNA polymerase I [Pseudomonas fluorescens]
Length = 939
Score = 107 bits (267), Expect = 4e-022
Identities = 119/472 (25%), Positives = 196/472 (41%), Gaps = 53/472 (11%)
Query: 142 GKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVK 201
GK + QL A+ EQ D D+ ++ T + + I + + + I +P+
Sbjct: 488 GKGAKQLTFDQIAL--EQAGPYAAEDADVTLRLHQTLFEKLSAIPS-LASVLTDIEIPLV 544
Query: 202 EVIIKRKKDGSIYSNQLL---------EGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRL 251
V+ + ++ G+ LL + ++ + +IA EEFNLGSP Q+ L ++L
Sbjct: 545 PVLARIERQGAFVDADLLGIQSIELGNKMVALEREAFEIAGEEFNLGSPKQLGVILYEKL 604
Query: 252 GWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWL--QKSQET 309
G KT G E LA + + K L + + + L Q + T
Sbjct: 605 GLPVLKKTAKGQP-STAEEVLAKLAEDDHRLPKVLMEHRSMSKLKSTYTDRLPEQINPRT 663
Query: 310 GRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKL 369
GR+H A T R + PN+ NIP + G RQAF P+G
Sbjct: 664 GRIHTSYHQAVASTGRLSSSDPNLQNIP------------VRTAEGRRIRQAFVAPKGYK 711
Query: 370 LVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHTFN-----KEALGRYCKD-RPTAKTF 422
L+ D + I+LR++AH D + D+HT K L D R AK
Sbjct: 712 LLAADYSQIELRIMAHLSRDEGLMNAFRNNLDVHTATAAEVFKVELNEVTSDQRRGAKAI 771
Query: 423 IYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQA-SQAASMGWMKGLDG 481
+ + G G + +G + + A + ++ P +++ + +QAA G+++ + G
Sbjct: 772 NFGLIYGMGAQKLGKDIGVDTKTAKAYIDTYFARYPGVREYMDRTRAQAADQGFVETIFG 831
Query: 482 RVL------------RIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWV 529
R L R ++ A++ +QG I++ A V G++ K V
Sbjct: 832 RRLYLPDINSNKPQERAAAERTAINAPMQGTAADIIKKAMVAVDNWLCASGLDAKVILQV 891
Query: 530 HDEWQTEVDEDQAQRL-GEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
HDE EV ED ++ EI+V + A KL+ P+ E +G NW E H
Sbjct: 892 HDELVLEVREDLVDQVREEIRVHMSEAA----KLDVPLLVEVGVGNNWDEAH 939
>sp|Q9RAA9|DPO1_RICFE DNA polymerase I (POL I)
emb|CAB56067.1| (AJ238763) DNA polymerase I [Rickettsia felis]
Length = 922
Score = 104 bits (260), Expect = 3e-021
Identities = 101/415 (24%), Positives = 181/415 (43%), Gaps = 45/415 (10%)
Query: 196 IAVPVKEVIIKRKKDG----SIYSNQL-----LEGCNVQGDYTKIAWEEFNLGSPAQVNK 246
I +P+ ++ K +K G + Y NQL E ++ + ++ +FN+GSP Q+ +
Sbjct: 523 IDLPICFILDKMEKIGIKVDANYLNQLSAEFGAEILKLEEEIFALSGTKFNIGSPKQLGE 582
Query: 247 RLDRLGWKPTVKTKS-GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQK 305
L P K + +SY E L + + L W+ L + L K
Sbjct: 583 ILFEKMQLPFGKASAKASSYSTGAEILEKLSEHGYNIADLLLRWRQLTKLKNTYTDSLPK 642
Query: 306 SQE--TGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFK 363
+ T RVH + T T R + Q PN+ N+P + G + RQAF
Sbjct: 643 QIDNITHRVHTTFLQTSTTTGRLSSQEPNLQNVP------------IRSSEGNKIRQAFI 690
Query: 364 VPEGKLLVGTDAAGIQLRVLAHYMN-DPIYTEQVIDGDIHT------FNKEALGRYCKDR 416
EG L+ D + I+LR+L+H N D + + DIHT FN + + R
Sbjct: 691 AEEGYKLISADYSQIELRILSHIANIDALKQAFINKDDIHTQTACQIFNLQKHELTSEHR 750
Query: 417 PTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQ-AASMGW 475
AK + + G +A L +N +A+E + ++ +++ Q A+S G+
Sbjct: 751 RKAKAINFGIIYGISAFGLAKQLNVSNGEASEYIKKYFAEYKGVQEYMEQTKAFASSNGY 810
Query: 476 MKGLDGR----------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQ 525
+ GR L+ ++ A++ +QG I+++A + ++ +K + +
Sbjct: 811 VINFFGRKCFVPLIHDKKLKQFAERAAINAPIQGTNADIIKIAMINLDQEIEKNNLKTRL 870
Query: 526 CAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+HDE EV ED+ + + I ++ I + T ++ P+ E ++G NW+E H
Sbjct: 871 VLQIHDELLFEVPEDEVELVTPI-IKKIMENST--NMDVPIITEIRVGNNWMEIH 922
>ref|NP_283286.1| (NC_003116) DNA polymerase I [Neisseria meningitidis Z2491]
pir||F81963 DNA-directed DNA polymerase (EC 2.7.7.7) I NMA0462 [imported] -
Neisseria meningitidis (group A strain Z2491)
emb|CAB83758.1| (AL162753) DNA polymerase I [Neisseria meningitidis Z2491]
Length = 938
Score = 103 bits (258), Expect = 4e-021
Identities = 149/641 (23%), Positives = 257/641 (39%), Gaps = 125/641 (19%)
Query: 6 DIET---DALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQI 62
D ET DA++A+++ + +A + G+ + V + T + +G + P +
Sbjct: 357 DTETTSLDAMNASLVGISIAFQAGEAVYIPVGHSLTAAPEQLDLQDVLGRLKPHLENPAL 416
Query: 63 KKLW-----------GYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALS----GNEKIDF 106
KK+ YGI L +A D ++ S +++ GH LD LS G E I +
Sbjct: 417 KKIGQNLKYDQHVFANYGIALNGIAGDAMLASYIIESHL--GHGLDELSERWLGLETITY 474
Query: 107 HDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFML 166
+ +++ DVAI Q E + +++LE ++A
Sbjct: 475 ESLCGKGAKQISFA--DVAIG-----QATEYAAQDADFALRLEAHLRA------------ 515
Query: 167 DTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLL------- 219
+++A+ E + +PV +V+ + +++G L
Sbjct: 516 -----------------QMDAKQLEMYEKMELPVAQVLFEMERNGVQIDRAELARQSAEL 558
Query: 220 --EGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLG--WKPTVKTKSGNSYKICPENLAT 274
E ++ + A + FNL SP Q+ + L D++G K KT G N A
Sbjct: 559 GAELMKLEQEAYAAAGQPFNLNSPKQLQEILFDKMGIPTKGLKKTAKGG----ISTNEAV 614
Query: 275 IPDTAPE------AVKGLKAWKVLETRWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAH 328
+ AP+ ++ K+ T E + S + GRVH A+T R A
Sbjct: 615 LEQLAPDYPLPKIILQNRSLAKLKSTYTDKLPEMI--SPKDGRVHTTYAQAVAITGRLAS 672
Query: 329 QGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMN 388
PN+ NIP + G R+AF P+G ++V D + I+LR++AH
Sbjct: 673 NNPNLQNIP------------IRTAEGRRVRRAFTAPQGSVIVSADYSQIELRIMAHLSG 720
Query: 389 DPIYTEQVIDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGC 441
D +G D+H F + R AKT + + G G +A LG
Sbjct: 721 DKTLIAAFQNGEDVHRRTAAEVFGIAPENVSSEQRRYAKTINFGLIYGMGQYGLAKSLGI 780
Query: 442 NNRQANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGRVL------------RIGS 488
+N A + ++ P + + ++ QAA+ G+++ L GR L R G+
Sbjct: 781 DNISAKNFIDRYFARYPGVAEYMQRTKEQAAAQGFVETLFGRRLYLPDIRNKNTNARAGA 840
Query: 489 DHLALSVYLQGGETVIMRLANVFWQRQAKK-------EGINFKQCAWVHDEWQTEVDEDQ 541
+ A++ +QG + +++ A + R + E + K VHDE EV E +
Sbjct: 841 ERAAINAPMQGTASDLIKRAMIDVSRWLSECEASPWDELLQSKLIMQVHDELVLEVVETE 900
Query: 542 AQRLGE--IQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ E Q+ A D G L+ P+ E +G+NW E H
Sbjct: 901 LDFVKEKLPQIMAKVDGGL---LDVPLVAEVGVGENWEEAH 938
>gb|AAG43148.1|AF063186_1 (AF063186) DNA polymerase I [Rhodococcus erythropolis]
Length = 920
Score = 103 bits (257), Expect = 6e-021
Identities = 105/423 (24%), Positives = 179/423 (41%), Gaps = 49/423 (11%)
Query: 188 EIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGC---------NVQGDYTKIAWEEFNL 238
E+ + I VPV V+ + +++G L+ N++ +IA E FN+
Sbjct: 517 ELVNILHNIEVPVARVLTQMEENGIELDLAFLDQLGGEFSNTMQNLENQIMEIAGESFNV 576
Query: 239 GSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWK 297
SP QV + L ++LG K KT +G Y L I + + + L++ +
Sbjct: 577 SSPKQVGEVLFEKLGLKGGKKTTTGQ-YSTSESVLEKIEHPIAQLILEYRGLSKLKSTYT 635
Query: 298 LAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAE 357
Q + +T RVH T R + PN+ NIP + G +
Sbjct: 636 DGL-CKQANPDTHRVHTSYHQALTATGRLSSTDPNLQNIP------------IRAEIGRQ 682
Query: 358 CRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHTFN-KEALGRYCKD 415
R+AF PEG++L+ D + I+LR++AH D + I G D+H E LG ++
Sbjct: 683 IRKAFVAPEGRVLLAADYSQIELRLMAHLSQDEALLDAFIHGQDVHRRTAAEVLGIPLEE 742
Query: 416 -----RPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQA 470
R AK + L G + LG +++ + + ++ P + + Q
Sbjct: 743 VTNDQRRQAKAVNFGLLYGMSEFGLIRQLGFTRQESQDYIKQYFHRYPGIYDYMQRTRQV 802
Query: 471 A-SMGWMKGLDGR------------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAK 517
A G+++ L GR ++R ++ A++ LQG I+++A + +
Sbjct: 803 ALEQGFVETLLGRRLYTPDIDARNMMVRKAAERAAINAPLQGSAADIIKMAMIEVDKMLP 862
Query: 518 KEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWL 577
K+ K VHDE EVDED A L + ++ + +++ P+ E GKNW
Sbjct: 863 KD--QAKMLLQVHDELVFEVDEDIADELAPKLAEVMQ---SVLQISVPLVVEVGKGKNWD 917
Query: 578 ETH 580
E H
Sbjct: 918 EAH 920
>ref|NP_274975.1| (NC_003112) DNA polymerase I [Neisseria meningitidis MC58]
pir||G81020 DNA polymerase I NMB1982 [imported] - Neisseria meningitidis (group
B strain MD58)
gb|AAF42310.1| (AE002546) DNA polymerase I [Neisseria meningitidis MC58]
Length = 938
Score = 102 bits (254), Expect = 1e-020
Identities = 149/633 (23%), Positives = 259/633 (40%), Gaps = 109/633 (17%)
Query: 6 DIET---DALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQI 62
D ET DA++A+++ + +A + G+ + V + T + +G P +
Sbjct: 357 DTETTSLDAMNASLVGISIAFQAGEAVYIPVGHSLTAAPEQLDLQDVLGRLKPHLGNPAL 416
Query: 63 KKLW-----------GYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALS----GNEKIDF 106
KK+ YGI L +A D ++ S +++ GH LD LS G E I +
Sbjct: 417 KKIGQNLKYDQHVFANYGIALNGIAGDAMLASYIIESHL--GHGLDELSERWLGLETITY 474
Query: 107 HDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGF-M 165
+ + + DVAI Q E + +++LE ++A + E++ +
Sbjct: 475 ESLCGKGAKQIGFA--DVAIG-----QATEYAAQDADFALRLEAHLRAQMDEKQLEMYEK 527
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQ 225
++ +A+ ++ E N ++ + E A E++ K + Y+
Sbjct: 528 MELPVAQVLFEM---ERNGVQIDRAELARQSAELGAELM---KLEQEAYA---------- 571
Query: 226 GDYTKIAWEEFNLGSPAQVNKRL-DRLG--WKPTVKTKSGNSYKICPENLATIPDTAPE- 281
A + FNL SP Q+ + L D++G K KT G N A + AP+
Sbjct: 572 -----AAGQPFNLNSPKQLQEILFDKMGIPTKGLKKTAKGG----ISTNEAVLEQLAPDY 622
Query: 282 -----AVKGLKAWKVLETRWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANI 336
++ K+ T E + S + GRVH A+T R A PN+ NI
Sbjct: 623 PLPKIILQNRSLAKLKSTYTDKLPEMI--SPKDGRVHTTYAQAVAITGRLASNNPNLQNI 680
Query: 337 PSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQV 396
P + G + R+AF P+G ++V D + I+LR++AH D
Sbjct: 681 P------------IRTEEGRKVRRAFTAPQGSVIVSADYSQIELRIMAHLSGDKTLIAAF 728
Query: 397 IDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEA 449
+G D+H F + R AK+ + + G G +A LG +N A
Sbjct: 729 QNGEDVHRRTAAEVFGTAPENVSSEQRRYAKSINFGLIYGMGQYGLAKSLGIDNLSAKNF 788
Query: 450 MANFYEAIPSLKK-LKSQASQAASMGWMKGLDGRVL------------RIGSDHLALSVY 496
+ ++ P + + ++ QAA+ G+++ L GR L R G++ A++
Sbjct: 789 IDRYFARYPGVAEYMQRTKEQAAAQGYVETLFGRRLYLPDIRNKNANARAGAERAAINAP 848
Query: 497 LQGGETVIMRLANVFWQRQAKK-------EGINFKQCAWVHDEWQTEVDEDQAQRLGE-- 547
+QG + +++ A + R + E + K VHDE EV E + + E
Sbjct: 849 MQGTASDLIKRAMIDVSRWLSECEASPWDELLQSKLIMQVHDELVLEVVETELDFVKEKL 908
Query: 548 IQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
Q+ A D G L+ P+ E +G+NW E H
Sbjct: 909 PQIMAKVDGGL---LDVPLVAEVGVGENWEEAH 938
>sp|O52225|DPO1_THEFI DNA polymerase I, thermostable (TFI polymerase 1)
gb|AAC46079.1| (AF030320) thermostable DNA polymerase [Thermus filiformis]
Length = 833
Score = 101 bits (252), Expect = 2e-020
Identities = 107/431 (24%), Positives = 188/431 (42%), Gaps = 56/431 (12%)
Query: 174 IYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAW 233
+Y R +R+ A ++ V + VP+ E + S + +E ++G+ ++A
Sbjct: 429 LYQEVERPLSRVLAHMEARGVRLDVPLLEAL-------SFELEKEME--RLEGEVFRLAG 479
Query: 234 EEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVL 292
FNL S Q+ + L D LG P +T+ L + P V+ + ++ L
Sbjct: 480 HPFNLNSRDQLERVLFDELGLTPVGRTEKTGKRSTAQGALEALRGAHP-IVELILQYREL 538
Query: 293 ETRWKLAQEWLQK-----SQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGIL 347
KL +L TGR+H R T T R + PN+ NIP
Sbjct: 539 S---KLKSTYLDPLPRLVHPRTGRLHTRFNQTATATGRLSSSDPNLQNIP---------- 585
Query: 348 WKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHT--- 403
+ G R+AF EG LL+ D + I+LRVLAH D +G DIHT
Sbjct: 586 --VRTPLGQRIRKAFVAEEGWLLLAADYSQIELRVLAHLSGDENLKRVFREGKDIHTETA 643
Query: 404 ---FNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSL 460
F + K R AKT + L G ++ LG + ++A + ++++ P +
Sbjct: 644 AWMFGLDPALVDPKMRRAAKTVNFGVLYGMSAHRLSQELGIDYKEAEAFIERYFQSFPKV 703
Query: 461 KK-LKSQASQAASMGWMKGLDG------------RVLRIGSDHLALSVYLQGGETVIMRL 507
+ ++ + + G+++ L G R +R ++ +A ++ +QG +M++
Sbjct: 704 RAWIERTLEEGRTRGYVETLFGRRRYVPDLASRVRSVREAAERMAFNMPVQGTAADLMKI 763
Query: 508 ANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMD 567
A V + K G + VHDE EV ED+A+ + + +++A + L+ P++
Sbjct: 764 AMVKLFPRLKPLGAHL--LLQVHDELVLEVPEDRAEEAKALVKEVMENA---YPLDVPLE 818
Query: 568 GEAKIGKNWLE 578
E +G++WLE
Sbjct: 819 VEVGVGRDWLE 829
>gb|AAF24859.2|AF050678_1 (AF050678) DNA polymerase I [Thermomicrobium roseum]
Length = 917
Score = 99.4 bits (246), Expect = 1e-019
Identities = 97/373 (26%), Positives = 168/373 (45%), Gaps = 34/373 (9%)
Query: 230 KIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAP--EAVKGL 286
++A FN+ SP Q++ L D LG +TK+G Y + E L + DT P EA+
Sbjct: 545 ELAGHPFNITSPQQLSTLLFDELGLPRGKRTKTG--YSVSQEVLENLRDTHPIVEAILEY 602
Query: 287 KAWKVLETRWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGI 346
+ L++ + A Q +TGRVH T A T R + PN+ NIP+ G+ G+
Sbjct: 603 RQLLKLKSTYVDALP-RQVHPQTGRVHTIFHQTVAATGRLSSSDPNLQNIPA--RGELGL 659
Query: 347 LWKMEGMYGAECRQAFKVP-EGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHTF 404
+ + A+ R +++ E LL+ D + I+LR++AH+ DP +G DIH
Sbjct: 660 A--VRRAFIADNRPGYRIADEPILLLSADYSQIELRLMAHFSQDPALLRAFAEGKDIHAA 717
Query: 405 NKEALGRYCKDRPT------AKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIP 458
+ D T AK + + G +A G + + A + +++ P
Sbjct: 718 TASEVFGVPLDAVTPEMRRIAKVVNFGIMYGMQAYGLARDTGMSRQDAQRFIEAYFQRFP 777
Query: 459 SLKK-LKSQASQAASMGWMKGLDGR------------VLRIGSDHLALSVYLQGGETVIM 505
+ + L+ +AA +G+++ L GR R ++ +A+++ LQG IM
Sbjct: 778 GVARYLEETRRRAAELGYVETLFGRRRYLPEITSSNPARRQAAERMAVNMPLQGTAADIM 837
Query: 506 RLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCP 565
+L + R + G+ + VHDE EV E + E+ + + + P
Sbjct: 838 KLVMIAVHRPLGERGLRSRMLLQVHDELVLEVPESELATTTELVTTRM---SRVVEPSVP 894
Query: 566 MDGEAKIGKNWLE 578
++ +AK G NW +
Sbjct: 895 LEVDAKAGPNWAD 907
>pdb|1KFD| Dna Polymerase I (Klenow Fragment) (E.C.2.7.7.7) Complexed With
Dctp
pdb|1DPI| DNA Polymerase I (Klenow Fragment) (E.C.2.7.7.7) - dCMP Complex
Length = 605
Score = 97.8 bits (242), Expect = 3e-019
Identities = 147/630 (23%), Positives = 254/630 (39%), Gaps = 109/630 (17%)
Query: 5 FDIETDALD---ATVIHVLVAKRVGQKGFYVV--------------RDAETFKRLAK-QV 46
FD ETD+LD A ++ + A G + V R E K L + +
Sbjct: 31 FDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEK 90
Query: 47 TLWIGHNVIGFDIPQIKKLWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSGNEKID 105
L +G N + +D + YGI L+ +A DT++ S +L+ G H +D+L+ E+
Sbjct: 91 ALKVGQN-LKYDRGILAN---YGIELRGIAFDTMLESYILNSVA-GRHDMDSLA--ERWL 143
Query: 106 FHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFM 165
H T+ EE++ GK + I E+
Sbjct: 144 KHKTITF-----------------------EEIAGKGKNQLTFNQ----IALEEAGRYAA 176
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEG---- 221
D D+ +++ + + + + F I +P+ V+ + +++G ++L
Sbjct: 177 EDADVTLQLHLKMWPDLQKHKGPLNVFE-NIEMPLVPVLSRIERNGVKIDPKVLHNHSEE 235
Query: 222 -----CNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG---NSYKICPENL 272
++ +IA EEFNL S Q+ L ++ G KP KT G S ++ E
Sbjct: 236 LTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELA 295
Query: 273 ATIPDTAPEAVKGLKAWKVLETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQG 330
P P+ + + L++ + KL L + +TGRVH T R +
Sbjct: 296 LDYP--LPKVILEYRGLAKLKSTYTDKLP---LMINPKTGRVHTSYHQAVTATGRLSSTD 350
Query: 331 PNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDP 390
PN+ NIP + G RQAF PE ++V D + I+LR++AH D
Sbjct: 351 PNLQNIP------------VRNEEGRRIRQAFIAPEDYVIVSADYSQIELRIMAHLSRDK 398
Query: 391 IYTEQVIDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNN 443
+G DIH F + R +AK + + G +A L
Sbjct: 399 GLLTAFAEGKDIHRATAAEVFGLPLETVTSEQRRSAKAINFGLIYGMSAFGLARQLNIPR 458
Query: 444 RQANEAMANFYEAIPS-LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDH 490
++A + M ++E P L+ ++ +QA G+++ LDGR L R ++
Sbjct: 459 KEAQKYMDLYFERYPGVLEYMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAER 518
Query: 491 LALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQV 550
A++ +QG I++ A + + E + VHDE EV +D + + Q+
Sbjct: 519 AAINAPMQGTAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAK-QI 577
Query: 551 QAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ + T +L+ P+ E G+NW + H
Sbjct: 578 HQLMENCT--RLDVPLLVEVGSGENWDQAH 605
>ref|NP_418300.1| (NC_000913) DNA polymerase I, 3' --> 5' polymerase, 5' --> 3' and
3' --> 5' exonuclease [Escherichia coli K12]
sp|P00582|DPO1_ECOLI DNA polymerase I (POL I)
pir||DJECI DNA-directed DNA polymerase (EC 2.7.7.7) I - Escherichia coli
emb|CAA23607.1| (V00317) DNA polymerase I [Escherichia coli]
gb|AAA24402.1| (J01663) DNA polymerase I [Escherichia coli]
gb|AAB02998.1| (L19201) DNA polymerase I [Escherichia coli]
gb|AAC76861.1| (AE000461) DNA polymerase I, 3' --> 5' polymerase, 5' --> 3' and 3'
--> 5' exonuclease [Escherichia coli K12]
Length = 928
Score = 97.8 bits (242), Expect = 3e-019
Identities = 147/630 (23%), Positives = 254/630 (39%), Gaps = 109/630 (17%)
Query: 5 FDIETDALD---ATVIHVLVAKRVGQKGFYVV--------------RDAETFKRLAK-QV 46
FD ETD+LD A ++ + A G + V R E K L + +
Sbjct: 354 FDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEK 413
Query: 47 TLWIGHNVIGFDIPQIKKLWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSGNEKID 105
L +G N + +D + YGI L+ +A DT++ S +L+ G H +D+L+ E+
Sbjct: 414 ALKVGQN-LKYDRGILAN---YGIELRGIAFDTMLESYILNSVA-GRHDMDSLA--ERWL 466
Query: 106 FHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFM 165
H T+ EE++ GK + I E+
Sbjct: 467 KHKTITF-----------------------EEIAGKGKNQLTFNQ----IALEEAGRYAA 499
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEG---- 221
D D+ +++ + + + + F I +P+ V+ + +++G ++L
Sbjct: 500 EDADVTLQLHLKMWPDLQKHKGPLNVFE-NIEMPLVPVLSRIERNGVKIDPKVLHNHSEE 558
Query: 222 -----CNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG---NSYKICPENL 272
++ +IA EEFNL S Q+ L ++ G KP KT G S ++ E
Sbjct: 559 LTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELA 618
Query: 273 ATIPDTAPEAVKGLKAWKVLETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQG 330
P P+ + + L++ + KL L + +TGRVH T R +
Sbjct: 619 LDYP--LPKVILEYRGLAKLKSTYTDKLP---LMINPKTGRVHTSYHQAVTATGRLSSTD 673
Query: 331 PNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDP 390
PN+ NIP + G RQAF PE ++V D + I+LR++AH D
Sbjct: 674 PNLQNIP------------VRNEEGRRIRQAFIAPEDYVIVSADYSQIELRIMAHLSRDK 721
Query: 391 IYTEQVIDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNN 443
+G DIH F + R +AK + + G +A L
Sbjct: 722 GLLTAFAEGKDIHRATAAEVFGLPLETVTSEQRRSAKAINFGLIYGMSAFGLARQLNIPR 781
Query: 444 RQANEAMANFYEAIPS-LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDH 490
++A + M ++E P L+ ++ +QA G+++ LDGR L R ++
Sbjct: 782 KEAQKYMDLYFERYPGVLEYMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAER 841
Query: 491 LALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQV 550
A++ +QG I++ A + + E + VHDE EV +D + + Q+
Sbjct: 842 AAINAPMQGTAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAK-QI 900
Query: 551 QAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ + T +L+ P+ E G+NW + H
Sbjct: 901 HQLMENCT--RLDVPLLVEVGSGENWDQAH 928
>ref|NP_290488.1| (NC_002655) DNA polymerase I, 3' --> 5' polymerase, 5' --> 3' and
3' --> 5' exonuclease [Escherichia coli O157:H7 EDL933]
ref|NP_312813.1| (NC_002695) DNA polymerase I [Escherichia coli O157:H7]
gb|AAG59052.1|AE005617_1 (AE005617) DNA polymerase I, 3' --> 5' polymerase, 5' --> 3' and 3'
--> 5' exonuclease [Escherichia coli O157:H7 EDL933]
dbj|BAB38209.1| (AP002567) DNA polymerase I [Escherichia coli O157:H7]
Length = 928
Score = 97.4 bits (241), Expect = 4e-019
Identities = 147/630 (23%), Positives = 253/630 (39%), Gaps = 109/630 (17%)
Query: 5 FDIETDALD---ATVIHVLVAKRVGQKGFYVV--------------RDAETFKRLAK-QV 46
FD ETD+LD A ++ + A G + V R E K L + +
Sbjct: 354 FDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEK 413
Query: 47 TLWIGHNVIGFDIPQIKKLWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSGNEKID 105
L +G N + +D + YGI L+ +A DT++ S +L+ G H +D+L+ E+
Sbjct: 414 ALKVGQN-LKYDRGILAN---YGIELRGIAFDTMLESYILNSVA-GRHDMDSLA--ERWL 466
Query: 106 FHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFM 165
H T+ EE++ GK + I E+
Sbjct: 467 KHKTITF-----------------------EEIAGKGKNQLTFNQ----IALEEAGRYAA 499
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEG---- 221
D D+ +++ + + + + F I +P+ V+ + +++G ++L
Sbjct: 500 EDADVTLQLHLKMWPDLQKHKGPLNVFE-NIEMPLVPVLSRIERNGVKIDPKVLHNHSEE 558
Query: 222 -----CNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG---NSYKICPENL 272
++ +IA EEFNL S Q+ L ++ G KP KT G S ++ E
Sbjct: 559 LTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELA 618
Query: 273 ATIPDTAPEAVKGLKAWKVLETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQG 330
P P+ + + L++ + KL L + +TGRVH T R +
Sbjct: 619 LDYP--LPKVILEYRGLAKLKSTYTDKLP---LMINPKTGRVHTSYHQAVTATGRLSSTD 673
Query: 331 PNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDP 390
PN+ NIP + G RQAF PE ++V D + I+LR++AH D
Sbjct: 674 PNLQNIP------------VRNEEGRRIRQAFIAPEDYVIVSADYSQIELRIMAHLSRDK 721
Query: 391 IYTEQVIDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNN 443
+G DIH F + R +AK + + G +A L
Sbjct: 722 GLLTAFAEGKDIHRATAAEVFGLPLETVTSEQRRSAKAINFGLIYGMSAFGLARQLNIPR 781
Query: 444 RQANEAMANFYEAIPS-LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDH 490
++A + M ++E P L+ ++ +QA G+++ LDGR L R ++
Sbjct: 782 KEAQKYMDLYFERYPGVLEYMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAER 841
Query: 491 LALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQV 550
A++ +QG I++ A + + E + VHDE EV D + + Q+
Sbjct: 842 AAINAPMQGTAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHRDDVDAVAK-QI 900
Query: 551 QAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ + T +L+ P+ E G+NW + H
Sbjct: 901 HQLMENCT--RLDVPLLVEVGSGENWDQAH 928
>emb|CAB56093.1| (AJ238761) DNA polymerase I [Rickettsia rhipicephali]
Length = 875
Score = 97.1 bits (240), Expect = 5e-019
Identities = 99/415 (23%), Positives = 176/415 (41%), Gaps = 45/415 (10%)
Query: 196 IAVPVKEVIIKRKKDG----SIYSNQL-----LEGCNVQGDYTKIAWEEFNLGSPAQVNK 246
I +P+ ++ K +K G + Y QL E ++ + ++ +FN+GSP Q+ +
Sbjct: 476 IDLPICFILDKMEKVGITVDANYLKQLSAEFGTEILKLEEEIFALSGTKFNIGSPKQLGE 535
Query: 247 RLDRLGWKPTVKTKS-GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQK 305
L P K + +SY E L + + L W+ L + L K
Sbjct: 536 ILFEKMQLPFGKASAKASSYSTGAEILEKLSEHGYNIADLLLRWRQLTKLKNTYTDSLPK 595
Query: 306 SQE--TGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFK 363
+ T RVH + T T R + Q PN+ NIP + G + RQAF
Sbjct: 596 QIDNITHRVHTTFLQTSTTTGRLSSQEPNLQNIP------------IRSSEGNQIRQAFI 643
Query: 364 VPEGKLLVGTDAAGIQLRVLAHYMN-DPIYTEQVIDGDIHT------FNKEALGRYCKDR 416
EG L+ D + I+LR+L+H N D + + DIHT FN + + R
Sbjct: 644 AEEGYKLISADYSQIELRILSHIANIDALKQAFINKDDIHTQTACQIFNLQKHELTSEHR 703
Query: 417 PTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQ-AASMGW 475
AK + + G +A L N +A+E + ++ +++ Q A+S G+
Sbjct: 704 RKAKAINFGIIYGISAFGLAKQLNATNGEASEYIKKYFAEYKGVQEYMEQTKAFASSNGY 763
Query: 476 MKGLDGR----------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQ 525
+ GR L+ ++ A++ +QG I+++A + ++ +K + +
Sbjct: 764 VTNFFGRKCFVPLIHDKKLKQFAERAAINAPIQGTNADIIKIAMINLDQEIEKRKLKTRL 823
Query: 526 CAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+HDE E D+ + + I ++ I + T ++ P+ E + G NW+E H
Sbjct: 824 VLQIHDELLFEAPIDEVEIITPI-IKKIMENST--NMDVPIITEIRAGNNWMEIH 875
>ref|NP_458049.1| (NC_003198) DNA polymerase I [Salmonella enterica subsp. enterica
serovar Typhi]
emb|CAD03100.1| (AL627280) DNA polymerase I [Salmonella enterica subsp. enterica
serovar Typhi]
Length = 928
Score = 95.1 bits (235), Expect = 2e-018
Identities = 129/561 (22%), Positives = 224/561 (38%), Gaps = 105/561 (18%)
Query: 65 LWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSG----NEKIDFHDFSTYTPEMLA- 118
L YGI L+ +A DT++ S +L+ G H +D+LS ++ I F D + L
Sbjct: 428 LQNYGIELRGIAFDTMLESYILNSVA-GRHDMDSLSDRWLKHKTITFEDIAGKGKNQLTF 486
Query: 119 ----------YCKQDVAINEKVYLQLKEELSNF-GKASI--QLEHQMQAIVCEQEKNGFM 165
Y +D + +++L++ EL G ++ +E + ++ E+NG
Sbjct: 487 NQIALEEAGRYAAEDADVTLQLHLKMWPELQQHKGPLNVFENIEMPLVPVLSRVERNGVK 546
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQ 225
+D + + + I + E + K+ D
Sbjct: 547 IDPAVLH-------KHSEEITLRLAE------------LEKKAHD--------------- 572
Query: 226 GDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG---NSYKICPENLATIPDTAPE 281
IA E FNL S Q+ L ++ G KP KT G S ++ E P P+
Sbjct: 573 -----IAGEAFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELALDYP--LPK 625
Query: 282 AVKGLKAWKVLETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSV 339
+ + L++ + KL L + +TGRVH T R + PN+ NIP
Sbjct: 626 VILEYRGLAKLKSTYTDKLP---LMINPKTGRVHTSYHQAVTATGRLSSTDPNLQNIP-- 680
Query: 340 PHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG 399
+ G RQAF PE L+V D + I+LR++AH D +G
Sbjct: 681 ----------VRNEEGRRIRQAFIAPEDYLIVSADYSQIELRIMAHLSRDKGLLTAFAEG 730
Query: 400 -DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMAN 452
DIH F + R +AK + + G ++ L ++A + M
Sbjct: 731 KDIHRATAAEVFGLPLDSVTGEQRRSAKAINFGLIYGMSAFGLSRQLNIPRKEAQKYMDL 790
Query: 453 FYEAIPS-LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDHLALSVYLQG 499
++E P L+ ++ +QA G+++ L+GR L R G++ A++ +QG
Sbjct: 791 YFERYPGVLEYMERTRAQAKEQGYVETLEGRRLYLPDIKSSNAARRAGAERAAINAPMQG 850
Query: 500 GETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTF 559
I++ A + + E + VHDE EV +D + + Q +++
Sbjct: 851 TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDLDAVAKRIHQLMENC--- 907
Query: 560 FKLNCPMDGEAKIGKNWLETH 580
+++ P+ E G+NW + H
Sbjct: 908 TRIDVPLLVEVGSGENWDQAH 928
>ref|NP_268299.1| (NC_002662) DNA polymerase I (EC 2.7.7.7) [Lactococcus lactis
subsp. lactis]
sp|Q9CDS1|DPO1_LACLA DNA polymerase I (POL I)
gb|AAK06240.1|AE006442_7 (AE006442) DNA polymerase I (EC 2.7.7.7) [Lactococcus lactis subsp.
lactis]
Length = 877
Score = 93.6 bits (231), Expect = 6e-018
Identities = 98/429 (22%), Positives = 181/429 (41%), Gaps = 56/429 (13%)
Query: 185 IEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYT---------KIAWEE 235
+E E + + + +P+ EV+ K + G S LE + + +A EE
Sbjct: 468 LENEQEHLLSDMELPLAEVLAKMEITGIAVSQNTLEEIGAENEEKLASLTREIYDLAGEE 527
Query: 236 FNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLET 294
FN+ SP Q+ L ++L KTK+G Y + L + +P K L+ ++ +
Sbjct: 528 FNINSPKQLGVILFEKLQLPVGKKTKTG--YSTAVDVLEDLAALSPVVAKILEYRQINKV 585
Query: 295 RWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMY 354
+ + + + + G++H R + T R + PN+ NIP ++E
Sbjct: 586 QSTYVKGLIPQIADDGKIHTRYVQDLTQTGRLSSVDPNLQNIPV----------RLEE-- 633
Query: 355 GAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVID-----GDIHTFN---- 405
G + R+AF + LL+ +D + I+LRVLAH D E +ID DIHT
Sbjct: 634 GRKIRKAFVPSKDSLLLSSDYSQIELRVLAHISGD----EHLIDAFKHGADIHTSTAMRV 689
Query: 406 ---KEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKK 462
++A DR AK + + G +A LG + A + ++E P +K
Sbjct: 690 FGIEKAEDVTANDRRNAKAVNFGVVYGISDFGLARNLGITRKDAKNYIETYFERYPGIKT 749
Query: 463 -LKSQASQAASMGWMKGLDGRVLRIG------------SDHLALSVYLQGGETVIMRLAN 509
+++ +A G+++ + R +I ++ A++ +QG I+++A
Sbjct: 750 YMENIVREARDKGFVETMSHRRRKIPDINARNFNVRGFAERTAINSPIQGSAADILKIAM 809
Query: 510 VFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGE 569
+ + K VHDE +V ++ + + Q +++A +L P+ +
Sbjct: 810 INLDKALSARDFKSKLLLQVHDEIILDVPLEELDEIKVLVKQTMEEA---IELAVPLKVD 866
Query: 570 AKIGKNWLE 578
GK W E
Sbjct: 867 ENTGKTWYE 875
>ref|NP_229767.1| (NC_002505) DNA polymerase I [Vibrio cholerae]
pir||B82364 DNA polymerase I VC0108 [imported] - Vibrio cholerae (group O1
strain N16961)
gb|AAF93286.1| (AE004101) DNA polymerase I [Vibrio cholerae]
Length = 934
Score = 91.3 bits (225), Expect = 3e-017
Identities = 109/434 (25%), Positives = 181/434 (41%), Gaps = 62/434 (14%)
Query: 186 EAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCN---------VQGDYTKIAWEEF 236
+A++++ I +P+ V+ + ++ G + + LL + ++ + ++A + F
Sbjct: 524 DAKLEQVYREIEMPLVPVLSRIERTGVMIDDMLLSAQSQEIALRLDQLEQNAYELAGQPF 583
Query: 237 NLGSPAQVNKRLDRLGWKPTV-KTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVL-ET 294
NL SP Q+ L P + KT SG +T + E KVL E
Sbjct: 584 NLSSPKQLQTILFEQMKLPVLQKTPSGTP--------STNEEVLQELALDYPLPKVLIEY 635
Query: 295 RW--KLAQEWLQK-----SQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGIL 347
R KL + K + TGRVH T R + PN+ NIP
Sbjct: 636 RGLAKLKSTYTDKLPKMINPSTGRVHTSYHQAVTATGRLSSTDPNLQNIP---------- 685
Query: 348 WKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHTFN- 405
+ G RQAF P G ++ D + I+LR++AH D + DG DIH
Sbjct: 686 --VRNEEGRRIRQAFVAPHGWKIMAVDYSQIELRIMAHLSGDQALLDAFRDGKDIHAATA 743
Query: 406 KEALG-----RYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPS- 459
E +G + R AK + + G +A LG +A E M ++E P
Sbjct: 744 AEIIGVPIDQVSSEQRRRAKAVNFGLIYGMSAFGLAKQLGIPRGEAQEYMDKYFERYPGV 803
Query: 460 LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDHLALSVYLQGGETVIMRL 507
++ ++ S+AA +G+++ + GR L R ++ A++ +QG I++
Sbjct: 804 MQYMEDTRSRAAQLGYVETIFGRRLHLPEITSRNAMRRKAAERAAINAPMQGTAADIIKK 863
Query: 508 ANVFWQRQAKKEGI-NFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPM 566
A + ++EG K VHDE EV E + E +VQ + ++ +L P+
Sbjct: 864 AMLLVDEWIEREGDGRVKLLMQVHDELVFEVKESSLSEI-ESKVQQLMESAA--ELAVPL 920
Query: 567 DGEAKIGKNWLETH 580
EA G NW + H
Sbjct: 921 VAEAGHGDNWEQAH 934
>ref|NP_644409.1| (NC_003919) DNA polymerase I [Xanthomonas axonopodis pv. citri str.
306]
gb|AAM38945.1| (AE012059) DNA polymerase I [Xanthomonas axonopodis pv. citri str.
306]
Length = 934
Score = 91.3 bits (225), Expect = 3e-017
Identities = 101/385 (26%), Positives = 154/385 (39%), Gaps = 60/385 (15%)
Query: 229 TKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLK 287
T++A FNL SP Q+ L D L VKT G P T EA++ +
Sbjct: 577 TELAGRTFNLDSPKQLQALLFDELKLPAVVKTPKGQ------------PSTNEEALEAIA 624
Query: 288 AWK-----VLETRW--KLAQEWLQKSQE-----TGRVHGRVILTGAVTHRAAHQGPNMAN 335
+LE R KL + K E +GRVH GA T R + PN+ N
Sbjct: 625 DQHELPRVILEYRGLTKLRSTYTDKLPEMIHPQSGRVHTSYHQAGAATGRLSSSDPNLQN 684
Query: 336 IPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQ 395
IP + G R+AF P G+ L+ D + I+LR++AH DP
Sbjct: 685 IP------------IRTEDGRRIRRAFVAPPGRKLIACDYSQIELRIMAHLSGDPGLVGA 732
Query: 396 VIDG-DIHTFN-KEALGR-----YCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANE 448
G D+H E GR +R AK + + G +A LG +A +
Sbjct: 733 FESGADVHRATAAEVFGRTIDTVSADERRAAKAINFGLMYGMSAFGLARQLGIGRGEAQD 792
Query: 449 AMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGRVL------------RIGSDHLALSV 495
+A ++ P ++ +++ QA G+++ + GR L R G++ A++
Sbjct: 793 YIALYFSRYPGVRDFMETTRQQARDKGYVETVFGRRLYLDFINAGSQGQRAGAERAAINA 852
Query: 496 YLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKD 555
+QG I++ A V K VHDE E D D L + +
Sbjct: 853 PMQGTAADIIKRAMVKVDGWIADHAQRAKMILQVHDELVFEADIDFVDTL--LSEVTTRM 910
Query: 556 AGTFFKLNCPMDGEAKIGKNWLETH 580
AG +L P+ ++ +G NW + H
Sbjct: 911 AGA-AELRVPLVVDSGVGDNWDQAH 934
>ref|NP_295430.1| (NC_001263) DNA-directed DNA polymerase [Deinococcus radiodurans]
sp|P52027|DPO1_DEIRA DNA polymerase I (POL I)
pir||E75363 DNA-directed DNA polymerase - Deinococcus radiodurans (strain R1)
gb|AAF11264.1|AE002012_4 (AE002012) DNA-directed DNA polymerase [Deinococcus radiodurans]
Length = 956
Score = 91.3 bits (225), Expect = 3e-017
Identities = 94/383 (24%), Positives = 163/383 (42%), Gaps = 47/383 (12%)
Query: 223 NVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAPE 281
+++ + A EEF++ SP Q+ L D+L + KTK L + D P
Sbjct: 594 DLESQIHEYAGEEFHIRSPKQLETVLYDKLELASSKKTKLTGQRSTAVSALEPLRDAHP- 652
Query: 282 AVKGLKAWKVLETRWKLAQEWLQK-----SQETGRVHGRVILTGAVTHRAAHQGPNMANI 336
+ + ++ L+ KL +L + TGR+H T T R + PN+ NI
Sbjct: 653 IIPLVLEFRELD---KLRGTYLDPIPNLVNPHTGRLHTTFAQTAVATGRLSSLNPNLQNI 709
Query: 337 PSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQV 396
P + G E R+ F +G L+ D + I+LR+LAH +DP+ +
Sbjct: 710 P------------IRSELGREIRKGFIAEDGFTLIAADYSQIELRLLAHIADDPLMQQAF 757
Query: 397 IDG-DIHTFN-KEALGR-----YCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEA 449
++G DIH + LG R AKT + L G +++ LG +A
Sbjct: 758 VEGADIHRRTAAQVLGLDEATVDANQRRAAKTVNFGVLYGMSAHRLSNDLGIPYAEAATF 817
Query: 450 MANFYEAIPSLKK-LKSQASQAASMGWMKGLDG------------RVLRIGSDHLALSVY 496
+ ++ P +++ + + G+++ L G RV R + LA ++
Sbjct: 818 IEIYFATYPGIRRYINHTLDFGRTHGYVETLYGRRRYVPGLSSRNRVQREAEERLAYNMP 877
Query: 497 LQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDA 556
+QG IM+LA V Q + + I + VHDE E D+A+++ + + +++
Sbjct: 878 IQGTAADIMKLAMV--QLDPQLDAIGARMLLQVHDELLIEAPLDKAEQVAALTKKVMENV 935
Query: 557 GTFFKLNCPMDGEAKIGKNWLET 579
+L P+ E G NW +T
Sbjct: 936 ---VQLKVPLAVEVGTGPNWFDT 955
>pdb|5KTQ|A Chain A, Large Fragment Of Taq Dna Polymerase Bound To Dctp
pdb|1KTQ| Dna Polymerase
Length = 543
Score = 90.1 bits (222), Expect = 7e-017
Identities = 128/581 (22%), Positives = 224/581 (38%), Gaps = 111/581 (19%)
Query: 28 KGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLWGYGIPLKDVADTLVMSRLLDP 87
+G V R E +K L L ++ D+ + G G+P D D ++++ LLDP
Sbjct: 39 RGGRVHRAPEPYKALRD---LKEARGLLAKDLSVLALREGLGLPPGD--DPMLLAYLLDP 93
Query: 88 TRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASI- 146
+ + G E +T E + A++E+++ L L G+ +
Sbjct: 94 SNTTPEGVARRYGGE---------WTEE----AGERAALSERLFANLWGRLE--GEERLL 138
Query: 147 ----QLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKE 202
++E + A++ E G LD + E R+EAE+
Sbjct: 139 WLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEV------------- 185
Query: 203 VIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKS 261
++A FNL S Q+ + L D LG KT+
Sbjct: 186 --------------------------FRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEK 219
Query: 262 GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQKSQE-----TGRVHGRV 316
L + + P K L+ ++ KL ++ + TGR+H R
Sbjct: 220 TGKRSTSAAVLEALREAHPIVEKILQYRELT----KLKSTYIDPLPDLIHPRTGRLHTRF 275
Query: 317 ILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAA 376
T T R + PN+ NIP + G R+AF EG LLV D +
Sbjct: 276 NQTATATGRLSSSDPNLQNIP------------VRTPLGQRIRRAFIAEEGWLLVALDYS 323
Query: 377 GIQLRVLAHYMNDPIYTEQVIDG-DIHTFNKE---ALGRYCKD---RPTAKTFIYAFLLG 429
I+LRVLAH D +G DIHT + R D R AKT + L G
Sbjct: 324 QIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYG 383
Query: 430 AGTGMIASILGCNNRQANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGR------ 482
++ L +A + ++++ P ++ ++ + G+++ L GR
Sbjct: 384 MSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPD 443
Query: 483 ------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTE 536
+R ++ +A ++ +QG +M+LA V + + E + + VHDE E
Sbjct: 444 LEARVKSVREAAERMAFNMPVQGTAADLMKLAMV--KLFPRLEEMGARMLLQVHDELVLE 501
Query: 537 VDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWL 577
+++A+ + + + ++ + L P++ E IG++WL
Sbjct: 502 APKERAEAVARLAKEVMEGV---YPLAVPLEVEVGIGEDWL 539
>sp|P19821|DPO1_THEAQ DNA polymerase I, thermostable (Taq polymerase 1)
pir||A33530 DNA-directed DNA polymerase (EC 2.7.7.7) I - Thermus aquaticus
pdb|1BGX|T Chain T, Taq Polymerase In Complex With Tp7, An Inhibitory Fab
pdb|1CMW|A Chain A, Crystal Structure Of Taq Dna-Polymerase Shows A New
Orientation For The Structure-Specific Nuclease Domain
gb|AAA27507.1| (J04639) DNA polymerase [Thermus aquaticus]
Length = 832
Score = 90.1 bits (222), Expect = 7e-017
Identities = 128/581 (22%), Positives = 224/581 (38%), Gaps = 111/581 (19%)
Query: 28 KGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLWGYGIPLKDVADTLVMSRLLDP 87
+G V R E +K L L ++ D+ + G G+P D D ++++ LLDP
Sbjct: 328 RGGRVHRAPEPYKALRD---LKEARGLLAKDLSVLALREGLGLPPGD--DPMLLAYLLDP 382
Query: 88 TRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASI- 146
+ + G E +T E + A++E+++ L L G+ +
Sbjct: 383 SNTTPEGVARRYGGE---------WTEE----AGERAALSERLFANLWGRLE--GEERLL 427
Query: 147 ----QLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKE 202
++E + A++ E G LD + E R+EAE+
Sbjct: 428 WLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEV------------- 474
Query: 203 VIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKS 261
++A FNL S Q+ + L D LG KT+
Sbjct: 475 --------------------------FRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEK 508
Query: 262 GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQKSQE-----TGRVHGRV 316
L + + P K L+ ++ KL ++ + TGR+H R
Sbjct: 509 TGKRSTSAAVLEALREAHPIVEKILQYRELT----KLKSTYIDPLPDLIHPRTGRLHTRF 564
Query: 317 ILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAA 376
T T R + PN+ NIP + G R+AF EG LLV D +
Sbjct: 565 NQTATATGRLSSSDPNLQNIP------------VRTPLGQRIRRAFIAEEGWLLVALDYS 612
Query: 377 GIQLRVLAHYMNDPIYTEQVIDG-DIHTFNKE---ALGRYCKD---RPTAKTFIYAFLLG 429
I+LRVLAH D +G DIHT + R D R AKT + L G
Sbjct: 613 QIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYG 672
Query: 430 AGTGMIASILGCNNRQANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGR------ 482
++ L +A + ++++ P ++ ++ + G+++ L GR
Sbjct: 673 MSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPD 732
Query: 483 ------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTE 536
+R ++ +A ++ +QG +M+LA V + + E + + VHDE E
Sbjct: 733 LEARVKSVREAAERMAFNMPVQGTAADLMKLAMV--KLFPRLEEMGARMLLQVHDELVLE 790
Query: 537 VDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWL 577
+++A+ + + + ++ + L P++ E IG++WL
Sbjct: 791 APKERAEAVARLAKEVMEGV---YPLAVPLEVEVGIGEDWL 828
>pdb|4KTQ|A Chain A, Binary Complex Of The Large Fragment Of Dna Polymerase I
From T. Aquaticus Bound To A PrimerTEMPLATE DNA
Length = 539
Score = 90.1 bits (222), Expect = 7e-017
Identities = 128/581 (22%), Positives = 224/581 (38%), Gaps = 111/581 (19%)
Query: 28 KGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLWGYGIPLKDVADTLVMSRLLDP 87
+G V R E +K L L ++ D+ + G G+P D D ++++ LLDP
Sbjct: 35 RGGRVHRAPEPYKALRD---LKEARGLLAKDLSVLALREGLGLPPGD--DPMLLAYLLDP 89
Query: 88 TRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASI- 146
+ + G E +T E + A++E+++ L L G+ +
Sbjct: 90 SNTTPEGVARRYGGE---------WTEE----AGERAALSERLFANLWGRLE--GEERLL 134
Query: 147 ----QLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKE 202
++E + A++ E G LD + E R+EAE+
Sbjct: 135 WLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEV------------- 181
Query: 203 VIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKS 261
++A FNL S Q+ + L D LG KT+
Sbjct: 182 --------------------------FRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEK 215
Query: 262 GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQKSQE-----TGRVHGRV 316
L + + P K L+ ++ KL ++ + TGR+H R
Sbjct: 216 TGKRSTSAAVLEALREAHPIVEKILQYRELT----KLKSTYIDPLPDLIHPRTGRLHTRF 271
Query: 317 ILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAA 376
T T R + PN+ NIP + G R+AF EG LLV D +
Sbjct: 272 NQTATATGRLSSSDPNLQNIP------------VRTPLGQRIRRAFIAEEGWLLVALDYS 319
Query: 377 GIQLRVLAHYMNDPIYTEQVIDG-DIHTFNKE---ALGRYCKD---RPTAKTFIYAFLLG 429
I+LRVLAH D +G DIHT + R D R AKT + L G
Sbjct: 320 QIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYG 379
Query: 430 AGTGMIASILGCNNRQANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGR------ 482
++ L +A + ++++ P ++ ++ + G+++ L GR
Sbjct: 380 MSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPD 439
Query: 483 ------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTE 536
+R ++ +A ++ +QG +M+LA V + + E + + VHDE E
Sbjct: 440 LEARVKSVREAAERMAFNMPVQGTAADLMKLAMV--KLFPRLEEMGARMLLQVHDELVLE 497
Query: 537 VDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWL 577
+++A+ + + + ++ + L P++ E IG++WL
Sbjct: 498 APKERAEAVARLAKEVMEGV---YPLAVPLEVEVGIGEDWL 535
>ref|NP_372214.1| (NC_002758) DNA polymerase I [Staphylococcus aureus subsp. aureus
Mu50]
ref|NP_374801.1| (NC_002745) DNA polymerase I [Staphylococcus aureus subsp. aureus
N315]
dbj|BAB42780.1| (AP003134) DNA polymerase I [Staphylococcus aureus subsp. aureus
N315]
dbj|BAB57852.1| (AP003363) DNA polymerase I [Staphylococcus aureus subsp. aureus
Mu50]
Length = 876
Score = 86.7 bits (213), Expect = 7e-016
Identities = 127/547 (23%), Positives = 224/547 (40%), Gaps = 94/547 (17%)
Query: 69 GIPLKDVA-DTLVMSRLLDPTR--KGGHSLDALSGNEKIDFHDFSTY-------TPEMLA 118
GI +++++ D ++ S ++DP+R S+ +L G + D S Y PE
Sbjct: 385 GIDIQNISFDIMLASYIIDPSRTISDVQSVVSLYGQSFVK-DDVSIYGKGKKFKVPEDDV 443
Query: 119 YCKQDVAINEKVYL---QLKEELSNFGKASI--QLEHQMQAIVCEQEKNGFMLDTDIAEE 173
+I + +Y + ++L + + + LE + I+ E E+ G D EE
Sbjct: 444 LNPYVASITDAIYFAKPNMDKQLEEYNQVELLADLELPLAKILSEMEEIGIFTDVHDLEE 503
Query: 174 IYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAW 233
+ E EI+E + +V+I+ D A
Sbjct: 504 M-----------EKEIQEKL--------DVLIRNIHDA--------------------AG 524
Query: 234 EEFNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLE 293
E+FN+ SP Q+ L P +K K+ Y + L + P + + ++ L
Sbjct: 525 EDFNINSPKQLGVVLFETLQLPVIK-KTKTGYSTAVDVLEQLQGEHP-IIDYILEYRQLS 582
Query: 294 TRWKLAQEWLQKS-QETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEG 352
E LQK + R+H R T A T R + PN+ NIP ++E
Sbjct: 583 KLQSTYVEGLQKVISDDQRIHTRFNQTLAQTGRLSSVDPNLQNIPV----------RLEE 632
Query: 353 MYGAECRQAFK-VPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDGD-IHT------F 404
G + R+AFK + +++ D + I+LRVLAH D E I+GD IHT F
Sbjct: 633 --GRKIRKAFKPTSKDSVILSADYSQIELRVLAHITQDESMKEAFINGDDIHTATAMKVF 690
Query: 405 NKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLK 464
EA R AK + + G ++ LG ++A + ++ + P +K+
Sbjct: 691 GVEADQVDSLMRRQAKAVNFGIVYGISDYGLSQSLGITRKKAKAFIDDYLASFPGVKQYM 750
Query: 465 SQ-ASQAASMGWMKGLDGRV------------LRIGSDHLALSVYLQGGETVIMRLANVF 511
S A ++G+++ L R LR ++ A++ +QG I++LA V
Sbjct: 751 SDIVKDAKALGYVETLLHRRRYIPDITSRNFNLRGFAERTAMNTPIQGSAADIIKLAMVK 810
Query: 512 WQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAK 571
+ ++ K+ K VHDE EV + + E + +++A +L+ P+ ++
Sbjct: 811 FAQKMKETTYQAKLLLQVHDELIFEVPKSEVDSFSEFVEEIMENA---LQLDVPLKVDSS 867
Query: 572 IGKNWLE 578
G W +
Sbjct: 868 YGATWYD 874
>ref|NP_188690.1| (NM_112946) DNA polymerase, putative; protein id: At3g20540.1
[Arabidopsis thaliana]
Length = 1160
Score = 45.1 bits (105), Expect = 0.002
Identities = 63/292 (21%), Positives = 117/292 (39%), Gaps = 36/292 (12%)
Query: 306 SQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVP 365
S + GRVH + + T R + + PN+ N P++ + I RQAF
Sbjct: 750 SGKDGRVHCSLNIN-TETGRLSARRPNLQNQPALEKDRYKI------------RQAFIAS 796
Query: 366 EGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVI-DGDIHTFNKEALGRYCK--------DR 416
G L+ D ++LR+LAH + E I GD H ++ A+ Y +R
Sbjct: 797 PGNSLIVADYGQLELRILAHLASCESMKEAFIAGGDFH--SRTAMNMYPHIREAVENGER 854
Query: 417 PTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKS-QASQAASMGW 475
AK ++ G ++ + +A + + +Y ++K + + +A G+
Sbjct: 855 RKAKMLNFSIAYGKTAIGLSRDWKVSREEAQDTVNLWYNDRQEVRKWQELRKKEAIQKGY 914
Query: 476 MKGLDGRV-------LRIGSDHL---ALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQ 525
+ L GR R +H+ A++ +QG + A + + + + +K
Sbjct: 915 VLTLLGRARKFPEYRSRAQKNHIERAAINTPVQGSAADVAMCAMLEISNNQRLKELGWKL 974
Query: 526 CAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFK-LNCPMDGEAKIGKNW 576
VHDE E + A+ +I V + + L+ + +AK +NW
Sbjct: 975 LLQVHDEVILEGPSESAENAKDIVVNCMSEPFNGKNILSVDLSVDAKCAQNW 1026
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.319 0.136 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 386,849,377
Number of Sequences: 1038431
Number of extensions: 16688377
Number of successful extensions: 35433
Number of sequences better than 5.0e-02: 134
Number of HSP's better than 0.1 without gapping: 10
Number of HSP's successfully gapped in prelim test: 124
Number of HSP's that attempted gapping in prelim test: 34985
Number of HSP's gapped (non-prelim): 203
length of query: 580
length of database: 328,470,697
effective HSP length: 127
effective length of query: 453
effective length of database: 196,589,960
effective search space: 89055251880
effective search space used: 89055251880
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 94 (40.8 bits)
Query= orf25 [5343-5536]
(192 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064754.1| (NC_002519) gp16 [Roseophage SIO1] >gi|994... 385 e-106
>ref|NP_064754.1| (NC_002519) gp16 [Roseophage SIO1]
gb|AAG02599.1|AF189021_18 (AF189021) Roseophage SIO1 complete genome
Length = 192
Score = 385 bits (988), Expect = e-106
Identities = 192/192 (100%), Positives = 192/192 (100%)
Query: 1 MYNNQTDQRHKEINMADKKIVLKNVEVSWAKLLEAGLKYKSETEYEFSVAVKANDQLRDL 60
MYNNQTDQRHKEINMADKKIVLKNVEVSWAKLLEAGLKYKSETEYEFSVAVKANDQLRDL
Sbjct: 1 MYNNQTDQRHKEINMADKKIVLKNVEVSWAKLLEAGLKYKSETEYEFSVAVKANDQLRDL 60
Query: 61 MKSFKLNKQFKSKDSTFDGEEFIQLTLDTRTKSGWVRHGEVFDEFGDPTKDLIGNGSKMN 120
MKSFKLNKQFKSKDSTFDGEEFIQLTLDTRTKSGWVRHGEVFDEFGDPTKDLIGNGSKMN
Sbjct: 61 MKSFKLNKQFKSKDSTFDGEEFIQLTLDTRTKSGWVRHGEVFDEFGDPTKDLIGNGSKMN 120
Query: 121 LFVSIGQSSYGNIIKLGHLEDMDMESKEMMFHFGQVMELVPFEAPSAVIKKEKEINESVE 180
LFVSIGQSSYGNIIKLGHLEDMDMESKEMMFHFGQVMELVPFEAPSAVIKKEKEINESVE
Sbjct: 121 LFVSIGQSSYGNIIKLGHLEDMDMESKEMMFHFGQVMELVPFEAPSAVIKKEKEINESVE 180
Query: 181 AMAEDEMEIPFG 192
AMAEDEMEIPFG
Sbjct: 181 AMAEDEMEIPFG 192
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.314 0.132 5.82e+180
Gapped
Lambda K H
0.267 0.0410 6.01e-154
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 118,190,795
Number of Sequences: 1038431
Number of extensions: 4733546
Number of successful extensions: 9020
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 9019
Number of HSP's gapped (non-prelim): 1
length of query: 192
length of database: 328,470,697
effective HSP length: 115
effective length of query: 77
effective length of database: 209,051,132
effective search space: 16096937164
effective search space used: 16096937164
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 88 (38.5 bits)
Query= orf26 [5533-5680]
(146 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.316 0.137 6.01e-154
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 103,184,742
Number of Sequences: 1038431
Number of extensions: 4397852
Number of successful extensions: 8859
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8859
Number of HSP's gapped (non-prelim): 0
length of query: 146
length of database: 328,470,697
effective HSP length: 122
effective length of query: 24
effective length of database: 201,782,115
effective search space: 4842770760
effective search space used: 4842770760
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf27 [5678-5966]
(287 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064755.1| (NC_002519) gp17 [Roseophage SIO1] >gi|994... 583 e-166
gb|AAL02215.1| (AF405604) DNA polymerase [uncultured organism] 129 5e-029
gb|AAL02210.1| (AF405599) DNA polymerase [uncultured organism] 126 2e-028
gb|AAL02212.1| (AF405601) DNA polymerase [uncultured organism] 126 2e-028
gb|AAL02211.1| (AF405600) DNA polymerase [uncultured organism] 126 3e-028
gb|AAL02214.1| (AF405603) DNA polymerase [uncultured organism] 125 7e-028
>ref|NP_064755.1| (NC_002519) gp17 [Roseophage SIO1]
gb|AAG02600.1|AF189021_19 (AF189021) Roseophage SIO1 complete genome
Length = 287
Score = 583 bits (1504), Expect = e-166
Identities = 287/287 (100%), Positives = 287/287 (100%)
Query: 1 MTEETKSIDTLIDDIYSVFTDGYSKSPDNEKLIDAFGEAMKGLMRSRLTPRESSGGTLRL 60
MTEETKSIDTLIDDIYSVFTDGYSKSPDNEKLIDAFGEAMKGLMRSRLTPRESSGGTLRL
Sbjct: 1 MTEETKSIDTLIDDIYSVFTDGYSKSPDNEKLIDAFGEAMKGLMRSRLTPRESSGGTLRL 60
Query: 61 SAIGKPARQLWYDSRGVEKPDFTGDQLLKFFYGDVIEEVLLTLAKLSGHSVTNEQQKVVV 120
SAIGKPARQLWYDSRGVEKPDFTGDQLLKFFYGDVIEEVLLTLAKLSGHSVTNEQQKVVV
Sbjct: 61 SAIGKPARQLWYDSRGVEKPDFTGDQLLKFFYGDVIEEVLLTLAKLSGHSVTNEQQKVVV 120
Query: 121 AGITGHMDAVIDGHVIDVKSASATAFKKFDQGSLVFDDPFGYMHQIAAYSEAVEGNKGSG 180
AGITGHMDAVIDGHVIDVKSASATAFKKFDQGSLVFDDPFGYMHQIAAYSEAVEGNKGSG
Sbjct: 121 AGITGHMDAVIDGHVIDVKSASATAFKKFDQGSLVFDDPFGYMHQIAAYSEAVEGNKGSG 180
Query: 181 FLAMNKVDGKLTLFQPDPDFLPDTQERVDYLKEALASDTPPERCYEEVTETNGNKKLPMG 240
FLAMNKVDGKLTLFQPDPDFLPDTQERVDYLKEALASDTPPERCYEEVTETNGNKKLPMG
Sbjct: 181 FLAMNKVDGKLTLFQPDPDFLPDTQERVDYLKEALASDTPPERCYEEVTETNGNKKLPMG 240
Query: 241 CAFCSFKKECWKDANDGSGLRGFGYSFGTVYLTHVEKAPRVGEVDVE 287
CAFCSFKKECWKDANDGSGLRGFGYSFGTVYLTHVEKAPRVGEVDVE
Sbjct: 241 CAFCSFKKECWKDANDGSGLRGFGYSFGTVYLTHVEKAPRVGEVDVE 287
>gb|AAL02215.1| (AF405604) DNA polymerase [uncultured organism]
Length = 182
Score = 129 bits (323), Expect = 5e-029
Identities = 73/161 (45%), Positives = 102/161 (63%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKSASA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQAEIEIDDIKGHIDCMIDGELVDVKSASAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L DD FGY+ QI+ Y+ A G K FLA +K DG+L +
Sbjct: 122 LPDDDAFGYISQISGYANAF-GKKSGTFLAFDKSDGELATY 161
>gb|AAL02210.1| (AF405599) DNA polymerase [uncultured organism]
Length = 182
Score = 126 bits (317), Expect = 2e-028
Identities = 72/161 (44%), Positives = 101/161 (62%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKSASA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQSEIEIDDIKGHIDCMIDGELVDVKSASAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L DD FGY+ QI+ Y+ A G K FLA +K G+L +
Sbjct: 122 LPDDDAFGYISQISGYANAF-GKKSGTFLAFDKSGGELATY 161
>gb|AAL02212.1| (AF405601) DNA polymerase [uncultured organism]
Length = 182
Score = 126 bits (317), Expect = 2e-028
Identities = 72/161 (44%), Positives = 102/161 (62%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKSASA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQAEIEIDDIKGHIDCMIDGGLVDVKSASAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L+ DD FGY+ QI+ Y+ A G K FLA +K G+L +
Sbjct: 122 LLDDDAFGYISQISGYANAF-GKKSGTFLAFDKSGGELATY 161
>gb|AAL02211.1| (AF405600) DNA polymerase [uncultured organism]
Length = 182
Score = 126 bits (316), Expect = 3e-028
Identities = 72/161 (44%), Positives = 101/161 (62%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKSASA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQAEIEIDDIKGHIDCMIDGELVDVKSASAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L DD FGY+ QI+ Y+ A G K FLA +K G+L +
Sbjct: 122 LPDDDAFGYISQISGYANAF-GKKSGTFLAFDKSGGELATY 161
>gb|AAL02214.1| (AF405603) DNA polymerase [uncultured organism]
Length = 182
Score = 125 bits (313), Expect = 7e-028
Identities = 71/161 (44%), Positives = 100/161 (62%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKS SA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQSEIEIDDIKGHIDCMIDGELVDVKSTSAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L DD FGY+ QI+ Y+ A G K FLA +K G+L +
Sbjct: 122 LPDDDAFGYISQISGYANAF-GKKSGTFLAFDKSGGELATY 161
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.316 0.136 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 200,666,134
Number of Sequences: 1038431
Number of extensions: 8962822
Number of successful extensions: 19565
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 7
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 19551
Number of HSP's gapped (non-prelim): 7
length of query: 287
length of database: 328,470,697
effective HSP length: 120
effective length of query: 167
effective length of database: 203,858,977
effective search space: 34044449159
effective search space used: 34044449159
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 91 (39.7 bits)
Query= orf28 [5962-6097]
(134 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064756.1| (NC_002519) RP Endonuclease I [Roseophage ... 275 5e-074
ref|NP_052083.1| (NC_001271) endonuclease [Bacteriophage ph... 128 1e-029
pdb|1M0D|A Chain A, Crystal Structure Of Bacteriophage T7 E... 125 6e-029
ref|NP_041972.1| (NC_001604) gene 3, endonuclease [Enteroba... 125 6e-029
pdb|1FZR|B Chain B, Crystal Structure Of Bacteriophage T7 E... 124 2e-028
>ref|NP_064756.1| (NC_002519) RP Endonuclease I [Roseophage SIO1]
gb|AAG02601.1|AF189021_20 (AF189021) Roseophage SIO1 complete genome
Length = 134
Score = 275 bits (704), Expect = 5e-074
Identities = 134/134 (100%), Positives = 134/134 (100%)
Query: 1 MLNSKSSTRKRALKAGYRSGLEEQTAKDLKKRKVLFTYEETKIKWLDSKVRTYTPDFVLP 60
MLNSKSSTRKRALKAGYRSGLEEQTAKDLKKRKVLFTYEETKIKWLDSKVRTYTPDFVLP
Sbjct: 1 MLNSKSSTRKRALKAGYRSGLEEQTAKDLKKRKVLFTYEETKIKWLDSKVRTYTPDFVLP 60
Query: 61 NGVIIETKGRFVAADRRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGF 120
NGVIIETKGRFVAADRRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGF
Sbjct: 61 NGVIIETKGRFVAADRRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGF 120
Query: 121 LYADKTIPEDWLNE 134
LYADKTIPEDWLNE
Sbjct: 121 LYADKTIPEDWLNE 134
>ref|NP_052083.1| (NC_001271) endonuclease [Bacteriophage phiYeO3-12]
emb|CAB63604.1| (AJ251805) endonuclease [Bacteriophage phiYeO3-12]
Length = 153
Score = 128 bits (321), Expect = 1e-029
Identities = 61/119 (51%), Positives = 88/119 (73%), Gaps = 2/119 (1%)
Query: 17 YRSGLEEQTAKDLKKRKVLFTYEETKIKW-LDSKVRTYTPDFVLPNGVIIETKGRFVAAD 75
+RSGLE++ +K L+ + + F YE +I + + + YTPDF+LPNG+ IETKG + + D
Sbjct: 16 FRSGLEDKVSKQLESKGIKFDYELWRIPYVIPASDHLYTPDFLLPNGIFIETKGLWDSDD 75
Query: 76 RRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGFLYADKTIPEDWLNE 134
R+KHL I++Q L DIR VF++S++KLYKG+ +SYA+WC KHG L+ADK IP +WL E
Sbjct: 76 RKKHLLIREQHPEL-DIRLVFSSSRSKLYKGSPTSYAEWCEKHGILFADKLIPVEWLKE 133
>pdb|1M0D|A Chain A, Crystal Structure Of Bacteriophage T7 Endonuclease I With
A Wild-Type Active Site And Bound Manganese Ions
pdb|1M0D|B Chain B, Crystal Structure Of Bacteriophage T7 Endonuclease I With
A Wild-Type Active Site And Bound Manganese Ions
pdb|1M0D|C Chain C, Crystal Structure Of Bacteriophage T7 Endonuclease I With
A Wild-Type Active Site And Bound Manganese Ions
pdb|1M0D|D Chain D, Crystal Structure Of Bacteriophage T7 Endonuclease I With
A Wild-Type Active Site And Bound Manganese Ions
Length = 138
Score = 125 bits (315), Expect = 6e-029
Identities = 58/119 (48%), Positives = 87/119 (72%), Gaps = 2/119 (1%)
Query: 17 YRSGLEEQTAKDLKKRKVLFTYEETKIKW-LDSKVRTYTPDFVLPNGVIIETKGRFVAAD 75
+RSGLE++ +K L+ + + F YEE K+ + + + TYTPDF+LPNG+ +ETKG + + D
Sbjct: 4 FRSGLEDKVSKQLESKGIKFEYEEWKVPYVIPASNHTYTPDFLLPNGIFVETKGLWESDD 63
Query: 76 RRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGFLYADKTIPEDWLNE 134
R+KHL I++Q L DIR VF++S+ KLYKG+ +SY ++C KHG +ADK IP +W+ E
Sbjct: 64 RKKHLLIREQHPEL-DIRIVFSSSRTKLYKGSPTSYGEFCEKHGIKFADKLIPAEWIKE 121
>ref|NP_041972.1| (NC_001604) gene 3, endonuclease [Enterobacteria phage T7]
sp|P00641|ENRN_BPT7 ENDODEOXYRIBONUCLEASE I (ENDONUCLEASE)
pir||NEBP37 endodeoxyribonuclease I (EC 3.1.21.-) - phage T7
emb|CAA24345.1| (V01127) 3 protein [Enterobacteria phage T7]
emb|CAA24402.1| (V01146) gene 3, endonuclease [Enterobacteria phage T7]
Length = 149
Score = 125 bits (315), Expect = 6e-029
Identities = 58/119 (48%), Positives = 87/119 (72%), Gaps = 2/119 (1%)
Query: 17 YRSGLEEQTAKDLKKRKVLFTYEETKIKW-LDSKVRTYTPDFVLPNGVIIETKGRFVAAD 75
+RSGLE++ +K L+ + + F YEE K+ + + + TYTPDF+LPNG+ +ETKG + + D
Sbjct: 15 FRSGLEDKVSKQLESKGIKFEYEEWKVPYVIPASNHTYTPDFLLPNGIFVETKGLWESDD 74
Query: 76 RRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGFLYADKTIPEDWLNE 134
R+KHL I++Q L DIR VF++S+ KLYKG+ +SY ++C KHG +ADK IP +W+ E
Sbjct: 75 RKKHLLIREQHPEL-DIRIVFSSSRTKLYKGSPTSYGEFCEKHGIKFADKLIPAEWIKE 132
>pdb|1FZR|B Chain B, Crystal Structure Of Bacteriophage T7 Endonuclease I
pdb|1FZR|D Chain D, Crystal Structure Of Bacteriophage T7 Endonuclease I
pdb|1FZR|A Chain A, Crystal Structure Of Bacteriophage T7 Endonuclease I
pdb|1FZR|C Chain C, Crystal Structure Of Bacteriophage T7 Endonuclease I
Length = 138
Score = 124 bits (311), Expect = 2e-028
Identities = 57/119 (47%), Positives = 87/119 (72%), Gaps = 2/119 (1%)
Query: 17 YRSGLEEQTAKDLKKRKVLFTYEETKIKW-LDSKVRTYTPDFVLPNGVIIETKGRFVAAD 75
+RSGLE++ +K L+ + + F YEE K+ + + + TYTPDF+LPNG+ ++TKG + + D
Sbjct: 4 FRSGLEDKVSKQLESKGIKFEYEEWKVPYVIPASNHTYTPDFLLPNGIFVKTKGLWESDD 63
Query: 76 RRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGFLYADKTIPEDWLNE 134
R+KHL I++Q L DIR VF++S+ KLYKG+ +SY ++C KHG +ADK IP +W+ E
Sbjct: 64 RKKHLLIREQHPEL-DIRIVFSSSRTKLYKGSPTSYGEFCEKHGIKFADKLIPAEWIKE 121
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.318 0.134 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 85,180,841
Number of Sequences: 1038431
Number of extensions: 3146254
Number of successful extensions: 7983
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 7
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7965
Number of HSP's gapped (non-prelim): 7
length of query: 134
length of database: 328,470,697
effective HSP length: 110
effective length of query: 24
effective length of database: 214,243,287
effective search space: 5141838888
effective search space used: 5141838888
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf29 [6094-6156]
(61 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.320 0.143 4.71e+034
Gapped
Lambda K H
0.267 0.0410 6.01e-154
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 35,940,828
Number of Sequences: 1038431
Number of extensions: 1064577
Number of successful extensions: 2691
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2691
Number of HSP's gapped (non-prelim): 0
length of query: 61
length of database: 328,470,697
effective HSP length: 37
effective length of query: 24
effective length of database: 290,048,750
effective search space: 6961170000
effective search space used: 6961170000
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf30 [6154-6256]
(101 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064757.1| (NC_002519) gp19 [Roseophage SIO1] >gi|994... 152 6e-037
ref|NP_039701.1| (NC_001335) Hypothetical protein [Mycobact... 103 4e-022
ref|NP_212894.1| (NC_001318) B. burgdorferi predicted codin... 83 7e-016
ref|NP_267982.1| (NC_002662) HYPOTHETICAL PROTEIN [Lactococ... 79 8e-015
ref|NP_203440.1| (NC_003085) hypothetical protein [Bacterio... 59 1e-008
ref|NP_465673.1| (NC_003210) similar to unknown proteins [L... 55 2e-007
ref|NP_559109.1| (NC_003364) hypothetical protein [Pyrobacu... 37 0.044
>ref|NP_064757.1| (NC_002519) gp19 [Roseophage SIO1]
gb|AAG02602.1|AF189021_21 (AF189021) Roseophage SIO1 complete genome
Length = 77
Score = 152 bits (385), Expect = 6e-037
Identities = 77/77 (100%), Positives = 77/77 (100%)
Query: 25 MGLANEAGEVLGKVKKIIRDGTFNRDDIADELGDVLWYAAALARDLNTDLSAIAQRNLDK 84
MGLANEAGEVLGKVKKIIRDGTFNRDDIADELGDVLWYAAALARDLNTDLSAIAQRNLDK
Sbjct: 1 MGLANEAGEVLGKVKKIIRDGTFNRDDIADELGDVLWYAAALARDLNTDLSAIAQRNLDK 60
Query: 85 LASRKERGTLKGSGDKR 101
LASRKERGTLKGSGDKR
Sbjct: 61 LASRKERGTLKGSGDKR 77
>ref|NP_039701.1| (NC_001335) Hypothetical protein [Mycobacterium phage L5]
sp|Q05247|VG37_BPML5 GENE 37 PROTEIN (GP37)
pir||S30982 gene 37 protein - Mycobacterium phage L5
emb|CAA79413.1| (Z18946) predicted 12.5Kd protein [Mycobacterium phage L5]
Length = 114
Score = 103 bits (257), Expect = 4e-022
Identities = 57/109 (52%), Positives = 70/109 (63%), Gaps = 11/109 (10%)
Query: 4 YQQQAVETAIYPSTAQ------VTYPAMGLANEAGEVLGKVKKIIRD--GTF---NRDDI 52
YQ A +TAIYP ++Y MGL EAGE+ KVKKI+RD G NRDD+
Sbjct: 6 YQIAASDTAIYPGAGDCDSIEGLSYVTMGLVGEAGEIANKVKKILRDKDGVITQENRDDL 65
Query: 53 ADELGDVLWYAAALARDLNTDLSAIAQRNLDKLASRKERGTLKGSGDKR 101
ELGDV+WY A LA L+ LS + QRNL+KL SRK+RG ++GSGD R
Sbjct: 66 QAELGDVMWYVAQLANQLDVWLSTVTQRNLNKLQSRKDRGVIQGSGDNR 114
>ref|NP_212894.1| (NC_001318) B. burgdorferi predicted coding region BB0760 [Borrelia
burgdorferi]
pir||G70194 hypothetical protein BB0760 - Lyme disease spirochete
gb|AAC67101.1| (AE001175) B. burgdorferi predicted coding region BB0760 [Borrelia
burgdorferi]
Length = 124
Score = 82.8 bits (203), Expect = 7e-016
Identities = 45/107 (42%), Positives = 65/107 (60%), Gaps = 6/107 (5%)
Query: 1 MNDYQQQAVETAIYPSTAQ-VTYPAMGLANEAGEVLGKVKKIIRDGTFNRDD-----IAD 54
+N+YQ++A +TA Y + + + +GLA E GEV+ K+KK+ RD + DD I
Sbjct: 18 LNEYQEKAKKTAKYKNKKEELILTTLGLAGETGEVVEKIKKLGRDKNYIIDDEYLISIKK 77
Query: 55 ELGDVLWYAAALARDLNTDLSAIAQRNLDKLASRKERGTLKGSGDKR 101
ELGDVLWY ++L+ +L L +A NL K+ R E GT+ G GD R
Sbjct: 78 ELGDVLWYLSSLSNNLGITLEDVALTNLKKIQKRHENGTINGEGDDR 124
>ref|NP_267982.1| (NC_002662) HYPOTHETICAL PROTEIN [Lactococcus lactis subsp. lactis]
gb|AAK05923.1|AE006412_10 (AE006412) HYPOTHETICAL PROTEIN [Lactococcus lactis subsp. lactis]
Length = 115
Score = 79.3 bits (194), Expect = 8e-015
Identities = 43/82 (52%), Positives = 53/82 (64%), Gaps = 5/82 (6%)
Query: 25 MGLANEAGEVLGKVKKIIRD--GTFNRDD---IADELGDVLWYAAALARDLNTDLSAIAQ 79
+GL+ EAGEV KVKK+IRD G + +D I ELGD+LWY A AR L L IA
Sbjct: 34 LGLSGEAGEVADKVKKVIRDQKGKLSANDKKAIGQELGDILWYVATSARYLGISLEQIAS 93
Query: 80 RNLDKLASRKERGTLKGSGDKR 101
N++KL SR RG + GSGD+R
Sbjct: 94 ENIEKLESRLSRGKISGSGDER 115
>ref|NP_203440.1| (NC_003085) hypothetical protein [Bacteriophage Mx8]
gb|AAK94361.1|AF396866_26 (AF396866) p26 [Bacteriophage Mx8]
Length = 124
Score = 58.5 bits (140), Expect = 1e-008
Identities = 36/96 (37%), Positives = 50/96 (51%), Gaps = 7/96 (7%)
Query: 4 YQQQAVETAIYPSTAQ------VTYPAMGLANEAGEVLGKVKKIIRDG-TFNRDDIADEL 56
YQQ A+ TA ++ + +GL EAGEV VKK + G + + + EL
Sbjct: 11 YQQAALRTANIGGPSEPFNHLKLESDCLGLTGEAGEVADHVKKFVGHGHDLDVEKVKKEL 70
Query: 57 GDVLWYAAALARDLNTDLSAIAQRNLDKLASRKERG 92
GDVLWY A +A L+ DL +A N++KL R G
Sbjct: 71 GDVLWYVAVIAARLDLDLGDVAAANVEKLRKRYPDG 106
>ref|NP_465673.1| (NC_003210) similar to unknown proteins [Listeria monocytogenes
EGD-e]
emb|CAD00227.1| (AL591982) similar to unknown proteins [Listeria monocytogenes]
Length = 105
Score = 55.1 bits (131), Expect = 2e-007
Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 2/91 (2%)
Query: 3 DYQQQAVETAIYPSTAQVTYPAMGLANEAGEVLGKVKKIIRDG-TFNRDDIADELGDVLW 61
+YQ A TA A Y +G+ EAGEV +KK G +++ + ELGDVLW
Sbjct: 5 EYQILANRTAATHEQALTNY-GLGITGEAGEVADLIKKYAFHGHDLDKESLTKELGDVLW 63
Query: 62 YAAALARDLNTDLSAIAQRNLDKLASRKERG 92
Y + +A+ + + +A+ N++KL R +G
Sbjct: 64 YVSQIAKWADISMETVAELNIEKLKRRYPQG 94
>ref|NP_559109.1| (NC_003364) hypothetical protein [Pyrobaculum aerophilum]
gb|AAL63291.1| (AE009807) hypothetical protein [Pyrobaculum aerophilum]
Length = 112
Score = 37.0 bits (84), Expect = 0.044
Identities = 25/79 (31%), Positives = 35/79 (43%), Gaps = 9/79 (11%)
Query: 19 QVTYPAMGLANEAGEVLGKVKKIIRDGTFNRDDI---------ADELGDVLWYAAALARD 69
++ Y LA E GE VKK++R + R D+ A+EL DV Y +A
Sbjct: 33 RLEYFTNALAGEVGEAANIVKKVVRSVVYGRGDVKLGDVKEALAEELTDVFIYVITMAGL 92
Query: 70 LNTDLSAIAQRNLDKLASR 88
L DL + L+K R
Sbjct: 93 LELDLEREFFKKLEKNKQR 111
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.313 0.131 6.01e-154
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 58,687,086
Number of Sequences: 1038431
Number of extensions: 2022910
Number of successful extensions: 3618
Number of sequences better than 5.0e-02: 9
Number of HSP's better than 0.1 without gapping: 8
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 3602
Number of HSP's gapped (non-prelim): 9
length of query: 101
length of database: 328,470,697
effective HSP length: 77
effective length of query: 24
effective length of database: 248,511,510
effective search space: 5964276240
effective search space used: 5964276240
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf31 [6254-6315]
(60 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_532351.1| (NC_003304) conserved hypothetical protein... 42 0.002
ref|NP_539989.1| (NC_003317) Hypothetical Protein [Brucella... 40 0.008
ref|NP_104800.1| (NC_002678) unknown protein [Mesorhizobium... 39 0.018
ref|NP_102735.1| (NC_002678) unknown protein [Mesorhizobium... 38 0.031
ref|NP_385651.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN... 38 0.031
>ref|NP_532351.1| (NC_003304) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAL42667.1| (AE009123) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 50
Score = 41.6 bits (96), Expect = 0.002
Identities = 17/35 (48%), Positives = 25/35 (70%)
Query: 11 LATWREHRKVIKQLNKFSDRELRDIGITRGDIDRL 45
L WR++R+ + +L + SDREL D+GI R DI R+
Sbjct: 9 LTNWRKYRQTVTELGRMSDRELNDLGIGRQDIRRV 43
>ref|NP_539989.1| (NC_003317) Hypothetical Protein [Brucella melitensis]
gb|AAL52253.1| (AE009547) Hypothetical Protein [Brucella melitensis]
Length = 48
Score = 39.7 bits (91), Expect = 0.008
Identities = 15/29 (51%), Positives = 22/29 (75%)
Query: 14 WREHRKVIKQLNKFSDRELRDIGITRGDI 42
WR +R+ + +LN+ S REL D+GI+R DI
Sbjct: 10 WRRYRETVNELNQLSTRELNDLGISRADI 38
>ref|NP_104800.1| (NC_002678) unknown protein [Mesorhizobium loti]
dbj|BAB50586.1| (AP003002) unknown protein [Mesorhizobium loti]
Length = 59
Score = 38.5 bits (88), Expect = 0.018
Identities = 14/38 (36%), Positives = 25/38 (64%)
Query: 8 INYLATWREHRKVIKQLNKFSDRELRDIGITRGDIDRL 45
I WR + + +++LN+ + R+L D+GI R DI+R+
Sbjct: 17 IRAFRNWRMYNETVRELNRLNARQLSDLGINRADIERI 54
>ref|NP_102735.1| (NC_002678) unknown protein [Mesorhizobium loti]
dbj|BAB48521.1| (AP002996) unknown protein [Mesorhizobium loti]
Length = 69
Score = 37.7 bits (86), Expect = 0.031
Identities = 14/29 (48%), Positives = 22/29 (75%)
Query: 14 WREHRKVIKQLNKFSDRELRDIGITRGDI 42
WR +R + +L++ S+REL D+GI+R DI
Sbjct: 33 WRRYRDTVSELSRLSNRELTDLGISRSDI 61
>ref|NP_385651.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium
meliloti]
emb|CAC46124.1| (AL591787) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium
meliloti]
Length = 47
Score = 37.7 bits (86), Expect = 0.031
Identities = 15/29 (51%), Positives = 21/29 (71%)
Query: 14 WREHRKVIKQLNKFSDRELRDIGITRGDI 42
WR++R+ +L + SDREL D+GI R DI
Sbjct: 10 WRKYRQTCNELGRMSDRELTDLGIGRADI 38
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.323 0.139 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.71e+034
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 42,314,758
Number of Sequences: 1038431
Number of extensions: 1259270
Number of successful extensions: 3090
Number of sequences better than 5.0e-02: 5
Number of HSP's better than 0.1 without gapping: 5
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3085
Number of HSP's gapped (non-prelim): 5
length of query: 60
length of database: 328,470,697
effective HSP length: 36
effective length of query: 24
effective length of database: 291,087,181
effective search space: 6986092344
effective search space used: 6986092344
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 85 (37.4 bits)
Query= orf32 [6314-6986]
(671 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064758.1| (NC_002519) RP ribonucleotide reductase [R... 1333 0.0
ref|NP_682117.1| (NC_004113) ORF_ID:tll1327~similar to RP r... 151 3e-035
pdb|1L1L|A Chain A, Crystal Structure Of B-12 Dependent (Cl... 150 4e-035
gb|AAA03078.1| (L20047) ribonucleoside triphosphate reducta... 150 4e-035
ref|NP_046865.1| (NC_001900) putative ribonucleotide reduct... 134 5e-030
ref|NP_039714.1| (NC_001335) Hypothetical protein [Mycobact... 133 8e-030
ref|NP_570337.1| (NC_003390) ribonucleoside triphosphate re... 71 5e-011
ref|NP_335005.1| (NC_002755) ribonucleoside-diphosphate red... 52 3e-005
ref|NP_070492.1| (NC_000917) ribonucleotide reductase (nrd)... 51 5e-005
>ref|NP_064758.1| (NC_002519) RP ribonucleotide reductase [Roseophage SIO1]
gb|AAG02603.1|AF189021_22 (AF189021) Roseophage SIO1 complete genome
Length = 671
Score = 1333 bits (3450), Expect = 0.0
Identities = 657/671 (97%), Positives = 657/671 (97%)
Query: 1 MNNYQEVSTRAEVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQ 60
MNNYQEVSTRAEVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQ
Sbjct: 1 MNNYQEVSTRAEVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQ 60
Query: 61 SAELTELRQLMLDRKATTSGRTLWLGGTDVAKKREASQFNCSFGNVETVHDVVDAQHLLL 120
SAELTELRQLMLDRKATTSGRTLWLGGTDVAKKREASQFNCSFGNVETVHDVVDAQHLLL
Sbjct: 61 SAELTELRQLMLDRKATTSGRTLWLGGTDVAKKREASQFNCSFGNVETVHDVVDAQHLLL 120
Query: 121 QGCGVGFYPSVGILSGFTAPVEVQVVRSEKQKSDPKGAEDNVEAFYQEDGKSVWEVVVGD 180
QGCGVGFYPSVGILSGFTAPVEVQVVRSEKQKSDPKGAEDNVEAFYQEDGKSVWEVVVGD
Sbjct: 121 QGCGVGFYPSVGILSGFTAPVEVQVVRSEKQKSDPKGAEDNVEAFYQEDGKSVWEVVVGD 180
Query: 181 SAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGIL 240
SAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGIL
Sbjct: 181 SAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGIL 240
Query: 241 NDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHR 300
NDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHR
Sbjct: 241 NDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHR 300
Query: 301 QQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFINAEAARKRAPWFKGVNPCAEILLG 360
QQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFINAEAARKRAPWFKGVNPCAEILLG
Sbjct: 301 QQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFINAEAARKRAPWFKGVNPCAEILLG 360
Query: 361 NKSFCNLVEVDWGKFLGDHGGLERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLC 420
NKSFCNLVEVDWGKFLGDHGGLERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLC
Sbjct: 361 NKSFCNLVEVDWGKFLGDHGGLERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLC 420
Query: 421 GVGGTGIVKFLDHYTGRNNVASMLQSLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLS 480
GVGGTGIVKFLDHYTGRNNVASMLQSLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLS
Sbjct: 421 GVGGTGIVKFLDHYTGRNNVASMLQSLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLS 480
Query: 481 KIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMPVCYDD 540
KIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMPVCYDD
Sbjct: 481 KIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMPVCYDD 540
Query: 541 VSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILTN 600
VSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILTN
Sbjct: 541 VSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILTN 600
Query: 601 WDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVITEEAYVAYTSQLKPXXXXXXXXXXXXXX 660
WDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVITEEAYVAYTSQLKP
Sbjct: 601 WDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVITEEAYVAYTSQLKPLDLSGLTSDDDLVD 660
Query: 661 EACATGACPIR 671
EACATGACPIR
Sbjct: 661 EACATGACPIR 671
>ref|NP_682117.1| (NC_004113) ORF_ID:tll1327~similar to RP ribonucleotide reductase
[Thermosynechococcus elongatus BP-1]
dbj|BAC08879.1| (AP005373) ORF_ID:tll1327~similar to RP ribonucleotide reductase
[Thermosynechococcus elongatus BP-1]
Length = 757
Score = 151 bits (381), Expect = 3e-035
Identities = 177/735 (24%), Positives = 287/735 (38%), Gaps = 129/735 (17%)
Query: 2 NNYQEVSTRAEVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQS 61
+++ + A V RTY+R +E E+W+Q DR + + E+
Sbjct: 10 SSFPTTAPTAYPVFYRTYSR--QNEAGERESWQQVCDRTLKGLQ---------EVGHLTE 58
Query: 62 AELTELRQLMLDRKATTSGRTLWLGGTDVAKKRE--ASQFNCSFGNVETVHDVVDAQHLL 119
AE+ LR++ KA SGR LW+GGTD ++ E + +NC+ NV L
Sbjct: 59 AEVALLREMQEALKALPSGRWLWVGGTDWVRQPENFSGAYNCTSTNVVDWDSFGLMMDLA 118
Query: 120 LQGCGVGFYPSVGILSGFTA---PVEVQVVRSEKQKSDPKGAEDNVEAFYQEDGKSVWEV 176
+QGCG G +S A +EV VV Q E + ++
Sbjct: 119 MQGCGTGAILEPKYISQLPAICNRLEVTVVGEIGQTPREARQEHTTVTLTGQ----TCQI 174
Query: 177 VVGDSAEAWAKSFGKLVAMKKRVDI-----IRLNYREIRAAGMRLKGYGWISSGDSDLSV 231
VGDS + W KS+ L+ + + + ++ R++R G RLKG+G +++ L
Sbjct: 175 KVGDSRQGWVKSYQTLLELASNPHLGGLVQVTIDIRDVRPTGERLKGFGGVAN-PVKLPH 233
Query: 232 AQTAICGILNDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKKD 291
+LN G+ L ++ L++ V+ + A + E +AF AK++
Sbjct: 234 LYHRCAALLNAAVGRQLTSVECCLLIDEAAVVVVAGNVRRSAGMRQGASEDEAFAQAKQN 293
Query: 292 FW---LHGN-------EHRQQSNNSIMFDTKPTKWE---------------LSYIFDKIV 326
W GN + + +N++ +F KP+ E + + + I
Sbjct: 294 LWQQDAQGNWRIDPKRDALRMANHTRVFHRKPSLEECINAVRSQFYSGEGAIQWAGEAIA 353
Query: 327 EAGGS-------EPGFINAEAARKRAPWFK------------------GVNPCAEILLGN 361
A + F+ A ++ + + G+NPC EI+ G
Sbjct: 354 RANADLLTTPHLKQAFLRAYENQQGEAFLRQLLPAMDQRELAHRLGRYGLNPCGEII-GA 412
Query: 362 KSFCNLVEVDWGKF-LGDHGGLERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLC 420
CNL E+ + D E+A +A L+ + + +
Sbjct: 413 DFHCNLAEIHLNQIDPADTATQEKAF----KAGALSVAALLNHRFVVERYQYSRAIDPIV 468
Query: 421 GVGGTGIVKFLDHYTG--------------------RNNVASMLQSLRAQAHKGANSMAE 460
GV TG+ F H G +N A+ LQ R H+ +
Sbjct: 469 GVSFTGLFDFFVHAFGVPWLEWWAAGRPDTPQGRQFKNQEAAFLQRWRQIVHETVWQYCD 528
Query: 461 EFGTPMPKAVTTVKPSGTLSKIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGY-- 518
G P TTV+PSGT S + + G H P + I ITF K DP+ + GY
Sbjct: 529 RHGLKRPNRCTTVQPSGTKSLLTGASPGWHPPKAQRFIRRITFRKGDPVAMACYDYGYNV 588
Query: 519 -----------KVFDKPFEP--SSKLVAMPVCYDDVSFDEVETDRGTVHVNLESAVEQLD 565
K+ D PF+P + LV +PV VS+ ++ T+ V+ SA+ Q D
Sbjct: 589 IPSQSDKDETGKLLDDPFDPRCTEWLVEIPVA---VSWADL-AGADTIDVSQFSALAQFD 644
Query: 566 RYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILT---NWDTYVGVSFIYRNDPTKTAEDL 622
Y + +Y HN S TI +EV + I T Y+ + + R D +T
Sbjct: 645 FYMQVQQHYTTHNTSATIELREAEVEPLATAIYTAIQQDQGYISAALLARFDAHET---- 700
Query: 623 GYAYLPQEVITEEAY 637
+ LP E I++E Y
Sbjct: 701 -FPRLPFEPISKERY 714
>pdb|1L1L|A Chain A, Crystal Structure Of B-12 Dependent (Class Ii)
Ribonucleotide Reductase
pdb|1L1L|B Chain B, Crystal Structure Of B-12 Dependent (Class Ii)
Ribonucleotide Reductase
pdb|1L1L|C Chain C, Crystal Structure Of B-12 Dependent (Class Ii)
Ribonucleotide Reductase
pdb|1L1L|D Chain D, Crystal Structure Of B-12 Dependent (Class Ii)
Ribonucleotide Reductase
Length = 739
Score = 150 bits (380), Expect = 4e-035
Identities = 198/748 (26%), Positives = 310/748 (40%), Gaps = 126/748 (16%)
Query: 13 VVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRA---ELNEKQSAELTELRQ 69
V +RTY R L ++G E W++TV RV++ L R Q + EL + + E L +
Sbjct: 29 VTYKRTYARWLPEKGRS-ENWDETVKRVVEGNINLDPRLQDSPSLELKQSLTEEAERLYK 87
Query: 70 LMLDRKATTSGRTLWLGGTDVAKKREASQFNCSF--------GNVETVHDVVDAQHL--- 118
L+ AT SGR LW+ GTD ++ S NC F G+ + V + Q
Sbjct: 88 LIYGLGATPSGRNLWISGTDYQRRTGDSLNNCWFVAIRPQKYGDSKIVPSYLGKQEKAVS 147
Query: 119 ---------LLQGCGVGFYPSVGILSGFTA---PVEVQVVRSEKQKSDPKGAEDNVEAFY 166
L++G GVGF + +S +++Q+V E +S +
Sbjct: 148 MPFSFLFDELMKGGGVGFSVARSNISQIPRVDFAIDLQLVVDETSESYDASVKVGAVGKN 207
Query: 167 Q--EDGKSVWEVVVGDSAEAWAKSFGKLVAMK------KRVDIIRLNYREIRAAGMRLKG 218
+ +D S++ + D+ E W + L+ + R + L+ +IR G + G
Sbjct: 208 ELVQDADSIY-YRLPDTREGWVLANALLIDLHFAQTNPDRKQKLILDLSDIRPYGAEIHG 266
Query: 219 YGWISSGDSDLSVAQTAICGILNDRAGQLLRHMDILDLLNHMGTVLSS---RRSAEIAVL 275
+G +SG L + +LN++AG L +D D+ N +G + + RRSAE+A+
Sbjct: 267 FGGTASGPMPLISMLLDVNEVLNNKAGGRLTAVDAADICNLIGKAVVAGNVRRSAELAL- 325
Query: 276 PVTNPEIDAFISAKKDF--WLHGNEHRQQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEP 333
+N + D FIS K+D +H HR SNNS+ D+ + ++ I I E G EP
Sbjct: 326 -GSNDDQD-FISMKQDQEKLMH---HRWASNNSVAVDSAFSGYQP--IAAGIRENG--EP 376
Query: 334 GFINAEAARKRAPWF-----------KGVNPCAEILLGNKSFCNLVEVDWGKFLGDHGG- 381
G +N + ++ +G NPC EI L N CNL EV + + G
Sbjct: 377 GIVNLDLSKNYGRIVDGYQAGIDGDVEGTNPCGEISLANGEPCNLFEVF--PLIAEEQGW 434
Query: 382 -LERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLCGVGGTGIVKFLDHYTGRNNV 440
L+ LAAR R T D + + E+ + R G+ +GI +L G V
Sbjct: 435 DLQEVFALAARYAKRVTFSPYDWEISR----EIIQKNRRIGISMSGIQDWLLTRLGNRVV 490
Query: 441 ASMLQSLRAQAH---------KGANSMAEEFGTPMPKAV---------------TTVKPS 476
+ H K A M ++ + KA TTVKPS
Sbjct: 491 TGFKDDFDPETHEAIKVPVYDKRAIKMVDQLYKAVVKADQDYSKTLGCNESIKHTTVKPS 550
Query: 477 GTLSKIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMP- 535
GT++K+ +EG+H G Y+I I F DP++ +L+A GY+ + ++ V P
Sbjct: 551 GTVAKLAGASEGMHFHYGAYLIQRIRFQDSDPLLPALKACGYRTEADIYTENTTCVEFPI 610
Query: 536 --VCYDDVSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAI 593
V D+ +F GTV S EQ L + D+ S TI++ SE +
Sbjct: 611 KAVGADNPNF----ASAGTV-----SIAEQFATQAFLQTYWSDNAVSCTITFQDSEGDQV 661
Query: 594 IDWI--------LTNWDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVI--TEEAYVAYTSQ 643
+ T+ Y G S + P + + Y QE+ EE + S
Sbjct: 662 ESLLRQYRFITKSTSLLPYFGGSL--QQAPKEPIDKETYEKRSQEITGNVEEVFSQLNSD 719
Query: 644 LKPXXXXXXXXXXXXXXEACATGACPIR 671
+K C GACPI+
Sbjct: 720 VKDLELVDQTD--------CEGGACPIK 739
>gb|AAA03078.1| (L20047) ribonucleoside triphosphate reductase [Lactobacillus
leichmannii]
Length = 739
Score = 150 bits (380), Expect = 4e-035
Identities = 198/748 (26%), Positives = 310/748 (40%), Gaps = 126/748 (16%)
Query: 13 VVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRA---ELNEKQSAELTELRQ 69
V +RTY R L ++G E W++TV RV++ L R Q + EL + + E L +
Sbjct: 29 VTYKRTYARWLPEKGRS-ENWDETVKRVVEGNINLDPRLQDSPSLELKQSLTEEAERLYK 87
Query: 70 LMLDRKATTSGRTLWLGGTDVAKKREASQFNCSF--------GNVETVHDVVDAQHL--- 118
L+ AT SGR LW+ GTD ++ S NC F G+ + V + Q
Sbjct: 88 LIYGLGATPSGRNLWISGTDYQRRTGDSLNNCWFVAIRPQKYGDSKIVPSYLGKQEKAVS 147
Query: 119 ---------LLQGCGVGFYPSVGILSGFTA---PVEVQVVRSEKQKSDPKGAEDNVEAFY 166
L++G GVGF + +S +++Q+V E +S +
Sbjct: 148 MPFSCLFDELMKGGGVGFSVARSNISQIPRVDFAIDLQLVVDETSESYDASVKVGAVGKN 207
Query: 167 Q--EDGKSVWEVVVGDSAEAWAKSFGKLVAMK------KRVDIIRLNYREIRAAGMRLKG 218
+ +D S++ + D+ E W + L+ + R + L+ +IR G + G
Sbjct: 208 ELVQDADSIY-YRLPDTREGWVLANALLIDLHFAQTNPDRKQKLILDLSDIRPYGAEIHG 266
Query: 219 YGWISSGDSDLSVAQTAICGILNDRAGQLLRHMDILDLLNHMGTVLSS---RRSAEIAVL 275
+G +SG L + +LN++AG L +D D+ N +G + + RRSAE+A+
Sbjct: 267 FGGTASGPMPLISMLLDVNEVLNNKAGGRLTAVDAADICNLIGKAVVAGNVRRSAELAL- 325
Query: 276 PVTNPEIDAFISAKKDF--WLHGNEHRQQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEP 333
+N + D FIS K+D +H HR SNNS+ D+ + ++ I I E G EP
Sbjct: 326 -GSNDDQD-FISMKQDQEKLMH---HRWASNNSVAVDSAFSGYQP--IAAGIRENG--EP 376
Query: 334 GFINAEAARKRAPWF-----------KGVNPCAEILLGNKSFCNLVEVDWGKFLGDHGG- 381
G +N + ++ +G NPC EI L N CNL EV + + G
Sbjct: 377 GIVNLDLSKNYGRIVDGYQAGIDGDVEGTNPCGEISLANGEPCNLFEVF--PLIAEEQGW 434
Query: 382 -LERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLCGVGGTGIVKFLDHYTGRNNV 440
L+ LAAR R T D + + E+ + R G+ +GI +L G V
Sbjct: 435 DLQEVFALAARYAKRVTFSPYDWEISR----EIIQKNRRIGISMSGIQDWLLTRLGNRVV 490
Query: 441 ASMLQSLRAQAH---------KGANSMAEEFGTPMPKAV---------------TTVKPS 476
+ H K A M ++ + KA TTVKPS
Sbjct: 491 TGFKDDFDPETHEAIKVPVYDKRAIKMVDQLYKAVVKADQDYSKTLGCNESIKHTTVKPS 550
Query: 477 GTLSKIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMP- 535
GT++K+ +EG+H G Y+I I F DP++ +L+A GY+ + ++ V P
Sbjct: 551 GTVAKLAGASEGMHFHYGAYLIQRIRFQDSDPLLPALKACGYRTEADIYTENTTCVEFPI 610
Query: 536 --VCYDDVSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAI 593
V D+ +F GTV S EQ L + D+ S TI++ SE +
Sbjct: 611 KAVGADNPNF----ASAGTV-----SIAEQFATQAFLQTYWSDNAVSCTITFQDSEGDQV 661
Query: 594 IDWI--------LTNWDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVI--TEEAYVAYTSQ 643
+ T+ Y G S + P + + Y QE+ EE + S
Sbjct: 662 ESLLRQYRFITKSTSLLPYFGGSL--QQAPKEPIDKETYEKRSQEITGNVEEVFSQLNSD 719
Query: 644 LKPXXXXXXXXXXXXXXEACATGACPIR 671
+K C GACPI+
Sbjct: 720 VKDLELVDQTD--------CEGGACPIK 739
>ref|NP_046865.1| (NC_001900) putative ribonucleotide reductase; gp50
[Mycobacteriophage D29]
sp|O64240|VG50_BPMD2 PUTATIVE ADENOSYLCOBALAMIN-DEPENDENT RIBONUCLEOTIDE REDUCTASE
(GP50)
pir||G72805 probable ribonucleotide reductase protein - Mycobacterium phage D29
gb|AAC18490.1| (AF022214) putative ribonucleotide reductase; gp50
[Mycobacteriophage D29]
Length = 693
Score = 134 bits (336), Expect = 5e-030
Identities = 178/724 (24%), Positives = 280/724 (38%), Gaps = 107/724 (14%)
Query: 12 EVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQSAELTELRQLM 71
E+V RTY+R + +GT ETW +TV+RV+ L + + + E +L +LM
Sbjct: 13 ELVYNRTYSR-VKPDGTR-ETWPETVERVVSGNLALVDSRYQLP------GEREDLLRLM 64
Query: 72 LDRKATTSGRTLWLGGTDVAKKREASQFNCSF-GNVETVHDVVDAQHL-LLQGCGVGFYP 129
+ K +GR LW G K FNC G E D + + L++G GVG
Sbjct: 65 REFKILPAGRHLWASGV----KNAQHLFNCWVSGWTEKPSDHFEFTFMRLMEGGGVGANY 120
Query: 130 SVGILSGFT---APVEVQVVRSEKQKSDPKGAE-DNVEAFYQEDGKSVWEVVVGDSAEAW 185
S L+ + +EV +V E AE + + Y D + V+ DS E W
Sbjct: 121 SNRFLADYPHVKQELEVHIVCDEDHDDYADLAEAGQLSSRYDSDWVDAF--VIEDSREGW 178
Query: 186 AKSFGKLVAMKKRVDIIR----LNYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGILN 241
A + L+ R D+ + +RAAG +LK +G +SG L+ T + IL+
Sbjct: 179 AAALVDLIDTHYRDDVAHKERVYDVSRVRAAGRKLKTFGGTASGPVPLAKMLTEVSEILS 238
Query: 242 DRA----------GQLLRHMDILDLLNHMGTVLSS---RRSAEIAVLPVTNPEIDAFISA 288
A G L +D +++ + + + + RRSA +A++ + +++ F +
Sbjct: 239 RCAREGEQYRFEDGGALTGLDAMEIDHAIAQCVVAGGVRRSARMAMMHWADWQVETFTNI 298
Query: 289 KKDFWLHGNEHRQQSNNSIMFDT--------KPTKWELSYIFDKIVEAG--GSEPGFINA 338
K+D H + + + P + + + E EPG ++
Sbjct: 299 KQDSGSHWTTNISVEVDDAFWSLAKAPVDPLNPRSTKAHRVLKALSEGAVRNGEPGMWDS 358
Query: 339 EAARKRAP-WFKGVNPCAEILLGNKSFCNLVEVDWGKFLGDHG-----GLERAVWLAARA 392
+ P NPC EI L CNL ++ F+ D G L RA L R
Sbjct: 359 SLSNVGEPNEVVCTNPCGEITLEPWEPCNLGHINLAAFVTDAGKTDYIDLIRAHRLMTRF 418
Query: 393 NYRQTCVNLDDGVLQRSWHELNEFLRLCGVGGTGIVKFL-------DHYTGRNNVASMLQ 445
R T V E+ + R GVG G+ +L G + L+
Sbjct: 419 LIRATF----SAVADPKSREVLDRNRRIGVGHLGVASYLALTGRRYSQAPGDKRFTAFLR 474
Query: 446 SLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLSKIMDTTEGVHKPLGKYIINNITFS- 504
+ A+ + A + E P+P TV P+GT++K+ +EG+H +Y I I FS
Sbjct: 475 EMAAEVDRAAEEFSHELRIPVPVKKRTVAPTGTIAKMPGVSEGIHPIFSRYFIRRIRFSV 534
Query: 505 --KDDPIIASLEAA-GYKVFDKPFEPSSKLVAMPVCYDDVSFDEVETDRGTVHVNLESAV 561
D + AS AA GY V ++ S + + D + V G ++ +
Sbjct: 535 LDNDQFLTASQYAADGYHVEKDQYDKSGNTWVVEIPTKDTLVEAVAARFGRDAEDIVESA 594
Query: 562 EQLDRYKLL------MDNYVDHNCSVTISYDPSEVPAIIDWILTNWDTYVGVSFIYRNDP 615
+L ++LL + D+ S T + DP D Y GV
Sbjct: 595 NELTLHQLLAFQALYQTCWADNAVSFTANVDP--------------DAYEGVD------- 633
Query: 616 TKTAEDL--------GYAYLPQEVITEEAYVAYTSQLKPXXXXXXXXXXXXXXEACATGA 667
A DL G P+E + Y T Q E CA GA
Sbjct: 634 --VAADLQRFSGLIKGSTIFPEESFPQAPYERITKQ--QYEAAAIKAVADGVDEECANGA 689
Query: 668 CPIR 671
CPI+
Sbjct: 690 CPIK 693
>ref|NP_039714.1| (NC_001335) Hypothetical protein [Mycobacterium phage L5]
sp|Q05262|VG50_BPML5 PUTATIVE ADENOSYLCOBALAMIN-DEPENDENT RIBONUCLEOTIDE REDUCTASE
(GP50)
pir||S30995 gene 50 protein - Mycobacterium phage L5
emb|CAA79426.1| (Z18946) predicted 76.3Kd protein [Mycobacterium phage L5]
Length = 682
Score = 133 bits (334), Expect = 8e-030
Identities = 168/709 (23%), Positives = 287/709 (39%), Gaps = 87/709 (12%)
Query: 12 EVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQSAELTELRQLM 71
E+V RTY R D ETW +TV+RV+D L +R+ E E +L +LM
Sbjct: 12 ELVYNRTYARTKPDGSK--ETWPETVERVVDGNLALVDRRHMLE------GEREDLVRLM 63
Query: 72 LDRKATTSGRTLWLGGTDVAKKREASQFNCSF-GNVETVHDVVDAQHL-LLQGCGVGFYP 129
K +GR LW G K FNC G E + D + L++G GVG
Sbjct: 64 TKFKVLPAGRHLWASGV----KNAQHLFNCWVAGWPEKISDHFQFTFMRLMEGGGVGANY 119
Query: 130 SVGILSGFTA---PVEVQVVRSEKQKSDPKGAEDN--VEAFYQEDGKSVWEVVVGDSAEA 184
S L G+ P+ V++V + + D + +D + Y D + + DS E
Sbjct: 120 SNHYLEGYPEVVNPLRVEIV-CDPEHVDYQAMKDVGILSEHYSHDWAGAY--AIEDSREG 176
Query: 185 WAKSFGKLVAMKKRVDIIRL----NYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGIL 240
WA + L+ R + + IR G +LK +G ++SG +V + I
Sbjct: 177 WAAALVDLIDTHYRPGTVHYQRVYDVSRIRPQGAKLKTFGGVASGPLPFAVMLQKVANIF 236
Query: 241 NDRAGQLLRHMDILDLLNHMGTVLSS---RRSAEIAVLPVTNPEIDAFISAKK------- 290
++ AG L + +++ + + + + RRSA ++++ + +ID FI+ K
Sbjct: 237 SEYAGITLDGIAAMEVDHAIAQCVVAGGVRRSARMSMMHWDDYQIDKFINIKASTGEHWT 296
Query: 291 ---------DFWLHGNEHRQQSNNSIMFDTKPTKWELSYIFDKIVEAG--GSEPGFINAE 339
+FW E + ++ + P + + + E EPG ++
Sbjct: 297 TNISVEVDDEFWRLAKEG--YGSFAVAREDMPKQKRAHRVLKALSEGAVRNGEPGMWDSS 354
Query: 340 AARKRAP-WFKGVNPCAEILLGNKSFCNLVEVDWGKFLG-----DHGGLERAVWLAARAN 393
+ P NPC EI L CNL ++ F+ D+ L RA L R
Sbjct: 355 LSNVGEPNRVVCTNPCGEITLEPWEPCNLGHINLAAFVTPAGKTDYLDLIRAHRLMTRFL 414
Query: 394 YRQTCVNLDDGVLQRSWHELNEFLRLCGVGGTGIVKFL-------DHYTGRNNVASMLQS 446
R T + D + E+ + R GVG G+ +L G L+
Sbjct: 415 IRATFSEVADPKSR----EVLDRNRRIGVGHLGVASYLALTGSRYSEAPGDKRFTQFLRE 470
Query: 447 LRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLSKIMDTTEGVHKPLGKYIINNITF--S 504
+ A+ A A + P+P T+ P+GT++K+ +EG+H +Y I + F +
Sbjct: 471 MAAEVDSEAERFAHDLRIPVPVKKRTIAPTGTVAKMPGVSEGIHPIFSRYFIRRVRFNLN 530
Query: 505 KDDPIIASLEAAGYKVFDKPFEPSSKLVAMPVCYDDV-SFDEVETDRGTVHVNLESAVEQ 563
D + L A GY++ F P++++V++P V + +E+ D V E + Q
Sbjct: 531 SDMAELRRLAAEGYEIEKDLFAPNTEVVSIPTKDTLVQAVEELGWDEEIVESADELTLHQ 590
Query: 564 LDRYKLLMDN-YVDHNCSVTISYDPSEVPAIIDWILTNWDTYVGVSFIYRNDPTKTAEDL 622
L ++ L + D+ S T + P A + + + + G+ + +
Sbjct: 591 LLAFQALYQMCWADNAVSFTANVHPDAYEA--EDVAADLKRFAGLI-----KGSTIFPES 643
Query: 623 GYAYLPQEVITEEAYVAYTSQLKPXXXXXXXXXXXXXXEACATGACPIR 671
+ P E IT++ Y + ++ E CA GACPI+
Sbjct: 644 SFPQAPYERITKQQYESAAAK----------AVEDGVDEECANGACPIK 682
>ref|NP_570337.1| (NC_003390) ribonucleoside triphosphate reductase B [Cyanophage
P60]
gb|AAL73275.1| (AF338467) ribonucleoside triphosphate reductase B [Cyanophage P60]
Length = 446
Score = 70.9 bits (172), Expect = 5e-011
Identities = 80/322 (24%), Positives = 128/322 (38%), Gaps = 45/322 (13%)
Query: 350 GVNPCAEILLGNKSFCNLVEVDWGKFLGDHGGLE----RAVWLAARA-------NYRQTC 398
G+NPC EIL G CNL EV D + +A LAA A R
Sbjct: 90 GLNPCGEIL-GKDFHCNLSEVHLNTLDPDDTDAQVKAFQAGALAAGALLHHKFTTERYQY 148
Query: 399 VNLDDGVLQRSWHELNEF---------LRLCGVGGTGIVKFLDHYTGRNNVASMLQSLRA 449
D ++ S+ L +F LR +G + +F + G A +L+ +
Sbjct: 149 SREQDPIVGVSFTGLFDFCVNKWGAEWLRWWSLGRPFVPEFHEFVLGE---ARLLEFWKG 205
Query: 450 QAHKGANSMAEEFGTPMPKAVTTVKPSGTLSKIMDTTEGVHKPLGKYIINNITFSKDDPI 509
+ + E+ G MP TTV+P+GT S + + G H P I ITF+++DP+
Sbjct: 206 VVKETLEAYCEKHGLKMPNRYTTVQPAGTKSLLTGASPGWHPPKSARFIRRITFAREDPV 265
Query: 510 IASLEAAGYKV-------------FDKPFEPSSKLVAMPVCYDDVSFDEVETDRGTVHVN 556
+ GY V + P++ K + + + V ++ D+ N
Sbjct: 266 ALACIDYGYTVVPSQSSRDEEGNLLNDPWDERVKEWLVEIPTETVWANKPGCDQ--YEPN 323
Query: 557 LESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILTN-WDTYVGVSFIYRNDP 615
S Q D Y + + HN S T+ + +E+ + + I D Y+ + + R D
Sbjct: 324 TFSVKAQFDFYMQVQKFWTTHNTSATLEFRENEIEELSELIYRGCGDGYISAALLARFDA 383
Query: 616 TKTAEDLGYAYLPQEVITEEAY 637
+T + LP E I E Y
Sbjct: 384 NET-----FPRLPFEPIDRETY 400
>ref|NP_335005.1| (NC_002755) ribonucleoside-diphosphate reductase, alpha subunit,
putative [Mycobacterium tuberculosis CDC1551]
gb|AAK44819.1| (AE006957) ribonucleoside-diphosphate reductase, alpha subunit,
putative [Mycobacterium tuberculosis CDC1551]
Length = 735
Score = 51.6 bits (122), Expect = 3e-005
Identities = 85/367 (23%), Positives = 137/367 (37%), Gaps = 85/367 (23%)
Query: 178 VGDSAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAIC 237
+ DS ++ + G+ +++ + +R AG R+ G +SG +
Sbjct: 241 IEDSLQSIFATLGQAAELQRAGGGTGYAFSHLRPAGDRVASTGGTASGPVSFLRLYDSAA 300
Query: 238 GILNDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKK------- 290
G+++ MG RR A +AVL V++P+I F++AK
Sbjct: 301 GVVS------------------MG---GRRRGACMAVLDVSHPDICDFVTAKAESPSELP 339
Query: 291 ---------DFWLHGNE----HRQQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFIN 337
D +L E HR + + + EL K AGG +PG +
Sbjct: 340 HFNLSVGVTDAFLRAVERNGLHRLVNPRTGKIVARMPAAELFDAICKAAHAGG-DPGLVF 398
Query: 338 AEAARKRAPW-----FKGVNPCAEILLGNKSFCNLVEVDWGKFLGDH----GGLERAVWL 388
+ + P + NPC E+ L CNL ++ + L D LE +
Sbjct: 399 LDTINRANPVPGRGRIEATNPCGEVPLLPYESCNLGSINLARMLADGRVDWDRLEEVAGV 458
Query: 389 AARANYRQTCVNLDD--GVLQRSWHELNEFLRLC---GVGGTGIVKFL-------DHYTG 436
A R LDD V + + EL E R G+G G+ + L D
Sbjct: 459 AVRF--------LDDVIDVSRYPFPELGEAARATRKIGLGVMGLAELLAALGIPYDSEEA 510
Query: 437 RNNVASMLQSLRAQAHKGANSMAEEFGT-------------PMPKA-VTTVKPSGTLSKI 482
+++ ++ AH + +AEE G P A VT+V P+GT+S I
Sbjct: 511 VRLATRLMRRIQQAAHTASRRLAEERGAFPAFTDSRFARSGPRRNAQVTSVAPTGTISLI 570
Query: 483 MDTTEGV 489
TT G+
Sbjct: 571 AGTTAGI 577
>ref|NP_070492.1| (NC_000917) ribonucleotide reductase (nrd) [Archaeoglobus fulgidus]
pir||G69457 ribonucleoside-diphosphate reductase (EC 1.17.4.1) large chain nrd
- Archaeoglobus fulgidus
gb|AAB89584.1| (AE000988) ribonucleotide reductase (nrd) [Archaeoglobus fulgidus]
Length = 752
Score = 50.8 bits (120), Expect = 5e-005
Identities = 67/277 (24%), Positives = 107/277 (38%), Gaps = 62/277 (22%)
Query: 267 RRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHRQQSNNSIMFDT--KPTKWEL------ 318
RR A + VL V +P+I+ FI AK + + N + + + K ++L
Sbjct: 159 RRGANMGVLNVHHPDIEEFIKAKWEEGVLRNFNISVAATDAFMEALKKDDDYDLINPRTG 218
Query: 319 --------SYIFDKIVEAG--GSEPGFINAEAARKRAPW-----FKGVNPCAEILLGNKS 363
IF+ IVE EPG I + + P + NPC E L
Sbjct: 219 EVVRRVPARKIFNLIVEGAWRNGEPGMIFIDTINRFNPTPHVGQIEATNPCGEQPLLPYE 278
Query: 364 FCNLVEVDWGKFLG----DHGGLERAVWLAARANYRQTCVNLDDGVLQRSW-----HELN 414
CNL ++ KF+ D GL VW+A R LDD + S+ ++
Sbjct: 279 SCNLGSINLAKFVKDGEIDWDGLREVVWIAVRF--------LDDVIDVNSYPIPEIEQMT 330
Query: 415 EFLRLCGVGGTGIVKFL-------DHYTGRNNVASMLQSLRAQAHKGANSMAEEFGT-PM 466
+ R G+G G L D +++ ++ ++H+ + +AEE G P
Sbjct: 331 KANRKIGLGVMGWADMLFLLNIPYDSDEAIQLAEKVMKFIQEESHRASQQLAEERGVFPN 390
Query: 467 PK--------------AVTTVKPSGTLSKIMDTTEGV 489
K TT+ P+GT+S I + G+
Sbjct: 391 WKGSVWEKKGIKMRNATTTTIAPTGTISIIAGCSSGI 427
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.317 0.134 4.71e+034
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 434,655,220
Number of Sequences: 1038431
Number of extensions: 18273350
Number of successful extensions: 41988
Number of sequences better than 5.0e-02: 14
Number of HSP's better than 0.1 without gapping: 8
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 41942
Number of HSP's gapped (non-prelim): 17
length of query: 671
length of database: 328,470,697
effective HSP length: 128
effective length of query: 543
effective length of database: 195,551,529
effective search space: 106184480247
effective search space used: 106184480247
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 95 (41.2 bits)
Query= orf33 [7022-7101]
(78 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_049698.1| (NC_000866) thioredoxin [Enterobacteria ph... 39 0.013
pdb|1DE1|A Chain A, Nmr Structures Of Oxidized Bacteriophag... 39 0.013
gb|EAA05108.1| (AAAB01008816) agCP6896 [Anopheles gambiae s... 39 0.017
ref|NP_569495.1| (NC_003385) putative glutaredoxin [Salmone... 38 0.022
ref|NP_233395.1| (NC_002506) glutaredoxin-related protein [... 37 0.049
ref|NP_246286.1| (NC_002663) unknown [Pasteurella multocida... 37 0.049
>ref|NP_049698.1| (NC_000866) thioredoxin [Enterobacteria phage T4]
sp|P00276|THIO_BPT4 Thioredoxin
pir||TXBPT4 thioredoxin [validated] - phage T4
pdb|1AAZ|A Chain A, Glutaredoxin
pdb|1AAZ|B Chain B, Glutaredoxin
gb|AAA32529.1| (M13894) thioredoxin [Enterobacteria phage T4]
gb|AAD42626.1|AF158101_213 (AF158101) thioredoxin [Enterobacteria phage T4]
Length = 87
Score = 38.9 bits (89), Expect = 0.013
Identities = 23/72 (31%), Positives = 37/72 (50%), Gaps = 11/72 (15%)
Query: 12 CKYCTAALGLLESLGHKVEVYDVQ-------DENMSLVVNML----RVTNTVPQLYAPDG 60
C YC A LL E ++ DE ++ ++ L ++ T+PQ++APDG
Sbjct: 14 CVYCDNAKRLLTVKKQPFEFINIMPEKGVFDDEKIAELLTKLGRDTQIGLTMPQVFAPDG 73
Query: 61 QFIGGYTELKEY 72
IGG+ +L+EY
Sbjct: 74 SHIGGFDQLREY 85
>pdb|1DE1|A Chain A, Nmr Structures Of Oxidized Bacteriophage T4 Glutaredoxin
pdb|1DE2|A Chain A, Nmr Structures Of Reduced Bacteriophage T4 Glutaredoxin
Length = 87
Score = 38.9 bits (89), Expect = 0.013
Identities = 23/72 (31%), Positives = 37/72 (50%), Gaps = 11/72 (15%)
Query: 12 CKYCTAALGLLESLGHKVEVYDVQ-------DENMSLVVNML----RVTNTVPQLYAPDG 60
C YC A LL E ++ DE ++ ++ L ++ T+PQ++APDG
Sbjct: 14 CVYCDNAKRLLTVKKQPFEFINIMPEKGVFDDEKIAELLTKLGRDTQIGLTMPQVFAPDG 73
Query: 61 QFIGGYTELKEY 72
IGG+ +L+EY
Sbjct: 74 SHIGGFDQLREY 85
>gb|EAA05108.1| (AAAB01008816) agCP6896 [Anopheles gambiae str. PEST]
Length = 122
Score = 38.5 bits (88), Expect = 0.017
Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 3 KYIMLTQDNCKYCTAALGLLESLGHKVEVYDVQDEN----MSLVVNMLRVTNTVPQLYAP 58
K ++ ++ C YCT A + L + Y++ N + V+ L TVP+++
Sbjct: 39 KVVIFSKTYCPYCTMAKEPFKKLNQEYACYELDKRNDGDEIQSVLGELTGARTVPRVFI- 97
Query: 59 DGQFIGGYTELKEYLND 75
G F+GG T++K+ +D
Sbjct: 98 GGNFVGGGTDIKKMYDD 114
>ref|NP_569495.1| (NC_003385) putative glutaredoxin [Salmonella enterica subsp.
enterica serovar Typhi]
emb|CAD09890.1| (AL513384) putative glutaredoxin [Salmonella enterica subsp.
enterica serovar Typhi]
Length = 81
Score = 38.1 bits (87), Expect = 0.022
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 12/75 (16%)
Query: 3 KYIMLTQDNCKYCTAALGLLESL-GHKVEVYD--------VQDENMSLVVNMLRVTNTVP 53
K ++ +DNC YC A+ L + L GH Y+ + E +S +V+ + T+P
Sbjct: 2 KVVIYGRDNCSYCKRAVELAKQLRGHGFGDYEYIDIVSAGIDKEKLSYLVS--KPVETIP 59
Query: 54 QLYAPDGQFIGGYTE 68
Q++ +G+ IGGY E
Sbjct: 60 QVFV-NGESIGGYEE 73
>ref|NP_233395.1| (NC_002506) glutaredoxin-related protein [Vibrio cholerae]
pir||H82389 glutaredoxin-related protein VCA1011 [imported] - Vibrio cholerae
(group O1 strain N16961)
gb|AAF96907.1| (AE004427) glutaredoxin-related protein [Vibrio cholerae]
Length = 212
Score = 37.0 bits (84), Expect = 0.049
Identities = 25/84 (29%), Positives = 43/84 (50%), Gaps = 9/84 (10%)
Query: 1 MQKYIMLTQDNCKYCTAALGLLESLGHKVEVYDVQDENMSLVVNMLRVTN--------TV 52
+Q I+ + C YC A LL++ G +DV E+ +L + V TV
Sbjct: 112 LQGNIVFGKATCPYCVKAKQLLDTAGIDYRYHDVVKESAALYRMIPEVKAIIGEKTPVTV 171
Query: 53 PQLYAPDGQFIGGYTELKEYLNDS 76
PQ++ +GQ+IGG L+++L ++
Sbjct: 172 PQIWL-NGQYIGGCDALEKWLQNN 194
>ref|NP_246286.1| (NC_002663) unknown [Pasteurella multocida]
gb|AAK03431.1| (AE006173) unknown [Pasteurella multocida]
Length = 241
Score = 37.0 bits (84), Expect = 0.049
Identities = 19/67 (28%), Positives = 37/67 (54%), Gaps = 1/67 (1%)
Query: 6 MLTQDNCKYCTAALGLLESLGHKVEVYDVQDENMSLVVNMLRVTNTVPQLYAPDGQFIGG 65
++T+ C YC A LL++ + E + + ++ V + +VPQ++ G++IGG
Sbjct: 174 VITKPGCPYCAKAKALLKAKDYTFEEIVLGRDASTISVRAITGKTSVPQVFI-GGKYIGG 232
Query: 66 YTELKEY 72
EL++Y
Sbjct: 233 SEELEKY 239
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 48,976,949
Number of Sequences: 1038431
Number of extensions: 1493397
Number of successful extensions: 3339
Number of sequences better than 5.0e-02: 6
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 3337
Number of HSP's gapped (non-prelim): 6
length of query: 78
length of database: 328,470,697
effective HSP length: 54
effective length of query: 24
effective length of database: 272,395,423
effective search space: 6537490152
effective search space used: 6537490152
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf34 [7095-7245]
(149 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_597931.1| (NC_002519) PhoH-like protein [Roseophage ... 281 1e-075
>ref|NP_597931.1| (NC_002519) PhoH-like protein [Roseophage SIO1]
Length = 384
Score = 281 bits (720), Expect = 1e-075
Identities = 137/149 (91%), Positives = 137/149 (91%)
Query: 1 MTVVRKQFDRALYEAYDTPARDALVGYLELKGHIIXXXXXXXXXXXXSQKGGYTYFNEVE 60
MTVVRKQFDRALYEAYDTPARDALVGYLELKGHII SQKGGYTYFNEVE
Sbjct: 1 MTVVRKQFDRALYEAYDTPARDALVGYLELKGHIIVNNEENFNVDVVSQKGGYTYFNEVE 60
Query: 61 VKTAWKGDWPTHWSEIRIPQRKQRLLDKHVDDGKSVLNFYIFRPDFKQAWRIKDTLLTQE 120
VKTAWKGDWPTHWSEIRIPQRKQRLLDKHVDDGKSVLNFYIFRPDFKQAWRIKDTLLTQE
Sbjct: 61 VKTAWKGDWPTHWSEIRIPQRKQRLLDKHVDDGKSVLNFYIFRPDFKQAWRIKDTLLTQE 120
Query: 121 SLKEAKGRYIQKGEKFFHIPFTQAELIKL 149
SLKEAKGRYIQKGEKFFHIPFTQAELIKL
Sbjct: 121 SLKEAKGRYIQKGEKFFHIPFTQAELIKL 149
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.322 0.139 4.71e+034
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 92,902,529
Number of Sequences: 1038431
Number of extensions: 3396265
Number of successful extensions: 5454
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 5453
Number of HSP's gapped (non-prelim): 1
length of query: 149
length of database: 328,470,697
effective HSP length: 110
effective length of query: 39
effective length of database: 214,243,287
effective search space: 8355488193
effective search space used: 8355488193
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 85 (37.4 bits)
Query= orf35 [7347-7480]
(132 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_597931.1| (NC_002519) PhoH-like protein [Roseophage ... 263 3e-070
ref|NP_569498.1| (NC_003385) putative PhoH-like protein [Sa... 95 1e-019
ref|NP_309293.1| (NC_002695) ATP-binding pho regulon compon... 95 2e-019
ref|NP_455620.1| (NC_003198) PhoH protein (phosphate starva... 95 2e-019
ref|NP_287027.1| (NC_002655) PhoB-dependent, ATP-binding ph... 95 2e-019
ref|NP_267240.1| (NC_002662) phosphate starvation inducible... 91 3e-018
ref|NP_405516.1| (NC_003143) putative phosphate starvation-... 91 3e-018
ref|NP_353389.1| (NC_003062) AGR_C_625p [Agrobacterium tume... 83 6e-016
ref|NP_268762.1| (NC_002737) phosphate starvation-induced p... 81 2e-015
ref|NP_518651.1| (NC_003295) PROBABLE PHOH-LIKE PROTEIN [Ra... 79 9e-015
ref|NP_298194.1| (NC_002488) ATP-binding protein [Xylella f... 79 9e-015
ref|NP_213930.1| (NC_000918) phosphate starvation-inducible... 79 9e-015
ref|NP_642777.1| (NC_003919) ATP-binding protein [Xanthomon... 76 8e-014
ref|NP_626771.1| (NC_003888) PhoH-like protein [Streptomyce... 75 2e-013
>ref|NP_597931.1| (NC_002519) PhoH-like protein [Roseophage SIO1]
Length = 384
Score = 263 bits (671), Expect = 3e-070
Identities = 132/132 (100%), Positives = 132/132 (100%)
Query: 1 MPWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHEL 60
MPWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHEL
Sbjct: 253 MPWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHEL 312
Query: 61 KMLLTRVGEGTTIVLNGDAQQSDLKEADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDIT 120
KMLLTRVGEGTTIVLNGDAQQSDLKEADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDIT
Sbjct: 313 KMLLTRVGEGTTIVLNGDAQQSDLKEADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDIT 372
Query: 121 AMWVRTFLKEGL 132
AMWVRTFLKEGL
Sbjct: 373 AMWVRTFLKEGL 384
>ref|NP_569498.1| (NC_003385) putative PhoH-like protein [Salmonella enterica subsp.
enterica serovar Typhi]
emb|CAD09893.1| (AL513384) putative PhoH-like protein [Salmonella enterica subsp.
enterica serovar Typhi]
Length = 277
Score = 95.1 bits (235), Expect = 1e-019
Identities = 52/122 (42%), Positives = 75/122 (60%), Gaps = 5/122 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV DVL+K LG +E +K +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 148 PFFRPVYDVLQKRLGGSFLEYCLKPEVAKVEIAPFAYMRGRTFENAVVILDEAQNVTASQ 207
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDL--KEADGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GL + K V +IEF +D VRS
Sbjct: 208 MKMFLTRMGENVTVIVNGDVTQCDLPGNVKSGLEDALERFKPSS-HVGLIEFSTEDCVRS 266
Query: 118 DI 119
++
Sbjct: 267 EL 268
>ref|NP_309293.1| (NC_002695) ATP-binding pho regulon component PhoH [Escherichia
coli O157:H7]
ref|NP_415539.1| (NC_000913) PhoB-dependent, ATP-binding pho regulon component; may
be helicase; induced by P starvation [Escherichia coli
K12]
sp|P31544|PHOH_ECOLI PhoH protein (Phosphate starvation-inducible protein psiH)
pir||B47065 phosphate starvation-inducible protein psiH - Escherichia coli
dbj|BAA01230.1| (D10391) PhoH protein [Escherichia coli]
dbj|BAA35801.1| (D90739) PhoH protein (phosphate starvation-inducible protein
PsiH). [Escherichia coli]
gb|AAC74105.1| (AE000204) PhoB-dependent, ATP-binding pho regulon component; may
be helicase; induced by P starvation [Escherichia coli
K12]
dbj|BAB34689.1| (AP002554) ATP-binding pho regulon component PhoH [Escherichia coli
O157:H7]
Length = 354
Score = 94.7 bits (234), Expect = 2e-019
Identities = 50/120 (41%), Positives = 77/120 (63%), Gaps = 5/120 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV DVL + LG ++ ++ G +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 225 PYFRPVYDVLVRRLGASFMQYCLRPEIGKVEIAPFAYMRGRTFENAVVILDEAQNVTAAQ 284
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDLKE--ADGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GLS + ++ ++ V I+ FG +D VRS
Sbjct: 285 MKMFLTRLGENVTVIVNGDITQCDLPRGVCSGLSDALERFEEDEM-VGIVRFGKEDCVRS 343
>ref|NP_455620.1| (NC_003198) PhoH protein (phosphate starvation-inducible protein
PsiH) [Salmonella enterica subsp. enterica serovar
Typhi]
ref|NP_460098.1| (NC_003197) PhoB-dependent, ATP-binding pho regulon component
[Salmonella typhimurium LT2]
gb|AAL20057.1| (AE008749) PhoB-dependent ATP-binding pho regulon component
[Salmonella typhimurium LT2]
emb|CAD08250.1| (AL627269) PhoH protein (phosphate starvation-inducible protein
PsiH) [Salmonella enterica subsp. enterica serovar
Typhi]
Length = 284
Score = 94.7 bits (234), Expect = 2e-019
Identities = 51/120 (42%), Positives = 76/120 (62%), Gaps = 5/120 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV DVL K LG ++ ++ G +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 155 PYFRPVYDVLLKRLGASFMQYCLRPEIGKVEIAPFAYMRGRTFENAVVILDEAQNVTAAQ 214
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDLKEA--DGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GLS + ++ ++ V I+ F DD VRS
Sbjct: 215 MKMFLTRLGENVTVIVNGDITQCDLPRGVRSGLSDALERFEEDEM-VGIVHFNKDDCVRS 273
>ref|NP_287027.1| (NC_002655) PhoB-dependent, ATP-binding pho regulon component; may
be helicase; induced by P starvation [Escherichia coli
O157:H7 EDL933]
gb|AAG55638.1|AE005302_1 (AE005302) PhoB-dependent, ATP-binding pho regulon component; may
be helicase; induced by P starvation [Escherichia coli
O157:H7 EDL933]
Length = 354
Score = 94.7 bits (234), Expect = 2e-019
Identities = 50/120 (41%), Positives = 77/120 (63%), Gaps = 5/120 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV DVL + LG ++ ++ G +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 225 PYFRPVYDVLVRRLGASFMQYCLRPEIGKVEIAPFAYMRGRTFENAVVILDEAQNVTAAQ 284
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDLKE--ADGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GLS + ++ ++ V I+ FG +D VRS
Sbjct: 285 MKMFLTRLGENVTVIVNGDITQCDLPRGVCSGLSDALERFEEDEM-VGIVRFGKEDCVRS 343
>ref|NP_267240.1| (NC_002662) phosphate starvation inducible protein [Lactococcus
lactis subsp. lactis]
gb|AAK05182.1|AE006341_1 (AE006341) phosphate starvation inducible protein [Lactococcus
lactis subsp. lactis]
Length = 322
Score = 90.5 bits (223), Expect = 3e-018
Identities = 50/132 (37%), Positives = 76/132 (56%), Gaps = 11/132 (8%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ PV D L + LGK A E ++ G IE+APLA MRGR+ D+AF+I+DE QN +T ++K
Sbjct: 193 PYLRPVYDALFQILGKTATERMMERGIIEIAPLAYMRGRTLDDAFVILDEAQNTTTMQMK 252
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKE------ADGLSKVIHLAKKHQLPVPIIEFGVDDIV 115
M LTR+G + +++NGD Q DL D K+ H+ + + F D+V
Sbjct: 253 MFLTRLGFNSKMIVNGDISQIDLPNKVKSGLIDAKDKLSHVKS-----IDFVYFMAKDVV 307
Query: 116 RSDITAMWVRTF 127
R + A ++ +
Sbjct: 308 RHPVVAEIIKAY 319
>ref|NP_405516.1| (NC_003143) putative phosphate starvation-inducible protein
[Yersinia pestis]
ref|NP_669660.1| (NC_004088) PhoB-dependent, ATP-binding pho regulon component
[Yersinia pestis KIM]
pir||T47002 hypothetical protein [imported] - Yersinia pestis
emb|CAA21345.1| (AL031866) ORF22, len: 262 aa, phoH, highly similar to phosphate
starvation-inducible protein phoH, from E. coli
ECOPHOH_2, (354 aa), 91.2% identity in 262 aa overlap.
Fasta scores: opt: 1567, E(): 0. [Yersinia pestis]
emb|CAC90772.1| (AJ414150) putative phosphate starvation-inducible protein
[Yersinia pestis]
gb|AAM85911.1|AE013838_2 (AE013838) PhoB-dependent, ATP-binding pho regulon component
[Yersinia pestis KIM]
Length = 262
Score = 90.5 bits (223), Expect = 3e-018
Identities = 48/120 (40%), Positives = 74/120 (61%), Gaps = 5/120 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV D+L + LG ++ ++ G +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 133 PYFRPVYDILLRRLGSSFMQYCLRPEIGKVEIAPFAYMRGRTFENAVVILDEAQNVTASQ 192
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDLKEA--DGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GLS + + ++ + II F D VRS
Sbjct: 193 MKMFLTRLGENVTVIVNGDITQCDLPRGVKSGLSDALERFTEDEM-IGIIRFDKQDCVRS 251
>ref|NP_353389.1| (NC_003062) AGR_C_625p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
gb|AAK86174.1| (AE007973) AGR_C_625p [Agrobacterium tumefaciens str. C58 (Cereon)]
Length = 358
Score = 82.8 bits (203), Expect = 6e-016
Identities = 48/128 (37%), Positives = 73/128 (56%), Gaps = 3/128 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L + VE I+ G IE+APLA MRGR+ NA +I+DE QN +T ++K
Sbjct: 217 PYLRPLYDALYDMMPGDKVERAIQAGVIEIAPLAFMRGRTLANAAVILDEAQNTTTMQMK 276
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKEA--DGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDI 119
M LTR+GE +++ GD Q DL GL + + + + + V ++ F D+VR +
Sbjct: 277 MFLTRLGENGRMIVTGDPSQVDLPRGVKSGLVEALQILPEVE-GVSVVRFKDVDVVRHPM 335
Query: 120 TAMWVRTF 127
A VR +
Sbjct: 336 VARIVRAY 343
>ref|NP_268762.1| (NC_002737) phosphate starvation-induced protein [Streptococcus
pyogenes] [Streptococcus pyogenes M1 GAS]
gb|AAK33483.1| (AE006506) phosphate starvation-induced protein [Streptococcus
pyogenes M1 GAS]
Length = 350
Score = 81.3 bits (199), Expect = 2e-015
Identities = 47/128 (36%), Positives = 73/128 (56%), Gaps = 3/128 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ PV D L LGK ++ IE+APLA MRGR+ D+AF+I+DE QN + ++K
Sbjct: 188 PYLRPVYDALYHILGKEQTTRLMERDVIEIAPLAYMRGRTLDDAFVILDEAQNTTIMQMK 247
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKEADGLSKVIHLAKKHQ--LPVPIIEFGVDDIVRSDI 119
M LTR+G + +++NGD Q DL + S +I +K Q + + F D+VR +
Sbjct: 248 MFLTRLGFNSKMIVNGDTSQIDLPR-NVKSGLIDATQKLQGIKQIDFVYFSAKDVVRHPV 306
Query: 120 TAMWVRTF 127
A ++ +
Sbjct: 307 VADIIKAY 314
>ref|NP_518651.1| (NC_003295) PROBABLE PHOH-LIKE PROTEIN [Ralstonia solanacearum]
emb|CAD14058.1| (AL646059) PROBABLE PHOH-LIKE PROTEIN [Ralstonia solanacearum]
Length = 325
Score = 79.0 bits (193), Expect = 9e-015
Identities = 47/128 (36%), Positives = 71/128 (54%), Gaps = 3/128 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L LG + + IE+APLA MRGR+ ++AFII+DE QN + ++K
Sbjct: 189 PYLRPLYDALYDLLGFDKTQKMFERQMIEIAPLAYMRGRTLNHAFIILDEAQNTTPEQMK 248
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLK--EADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDI 119
M LTR+G G+ V+ GD Q DL + GL + H+ + + V + F D+VR +
Sbjct: 249 MFLTRIGFGSKAVITGDTTQIDLPRGQKSGLIEAQHVLRDVR-GVSMTRFSSVDVVRHPL 307
Query: 120 TAMWVRTF 127
A V +
Sbjct: 308 VARIVEAY 315
>ref|NP_298194.1| (NC_002488) ATP-binding protein [Xylella fastidiosa 9a5c]
pir||D82747 ATP-binding protein XF0904 [imported] - Xylella fastidiosa (strain
9a5c)
gb|AAF83714.1|AE003930_4 (AE003930) ATP-binding protein [Xylella fastidiosa 9a5c]
Length = 327
Score = 79.0 bits (193), Expect = 9e-015
Identities = 49/131 (37%), Positives = 73/131 (55%), Gaps = 5/131 (3%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L + LG V ++ IE+APLA MRGR+ +NAF+I+DE QN + ++K
Sbjct: 188 PYLRPLYDALYEMLGVEKVLKLLEKNIIEIAPLAYMRGRTLNNAFVILDEAQNTTIEQMK 247
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKE--ADGLSKVIHLAKKHQLP-VPIIEFGVDDIVRSD 118
M LTR+G G+T V+ GD Q D+ + GL I + H + + F D+VR
Sbjct: 248 MFLTRLGFGSTAVVTGDLTQIDIPKHLKSGLDDAIEVL--HDIEGISFTFFKSHDVVRHP 305
Query: 119 ITAMWVRTFLK 129
+ A V + K
Sbjct: 306 LVARIVSAYDK 316
>ref|NP_213930.1| (NC_000918) phosphate starvation-inducible protein [Aquifex
aeolicus]
pir||C70417 phosphate starvation-inducible protein - Aquifex aeolicus
gb|AAC07324.1| (AE000736) phosphate starvation-inducible protein [Aquifex
aeolicus]
Length = 348
Score = 79.0 bits (193), Expect = 9e-015
Identities = 46/128 (35%), Positives = 71/128 (54%), Gaps = 3/128 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L + ++ IE+APLA MRGR+ ++AFII+DE QN + ++K
Sbjct: 198 PYLTPLYDALYDMVDYDKANYMLERNIIEIAPLAFMRGRTLNDAFIILDEAQNATRDQMK 257
Query: 62 MLLTRVGEGTTIVLNGDAQQSDL--KEADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDI 119
M LTR+G G+ V+ GD Q DL K+ GL + + + K + + I F D+VR I
Sbjct: 258 MFLTRIGFGSKAVITGDVTQIDLPKKKESGLIEAMEVLKNIE-GIEFIFFKQTDVVRHPI 316
Query: 120 TAMWVRTF 127
A + +
Sbjct: 317 VAKIINAY 324
>ref|NP_642777.1| (NC_003919) ATP-binding protein [Xanthomonas axonopodis pv. citri
str. 306]
gb|AAM37313.1| (AE011883) ATP-binding protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 328
Score = 75.9 bits (185), Expect = 8e-014
Identities = 47/133 (35%), Positives = 72/133 (53%), Gaps = 3/133 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L + LG V ++ IE+APLA MRGR+ ++A++I+DE QN + ++K
Sbjct: 188 PYLRPLYDALYEMLGVEKVVKLLERNVIEIAPLAYMRGRTLNDAYVILDEAQNTTIEQMK 247
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKE--ADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDI 119
M LTR+G G+T V+ GD Q DL + GL I + + + F D+VR +
Sbjct: 248 MFLTRLGFGSTAVVTGDLTQIDLPKHVKSGLRDAIEVLRDVE-GTSFTFFEARDVVRHPL 306
Query: 120 TAMWVRTFLKEGL 132
V + K L
Sbjct: 307 VQRIVTAYDKRDL 319
>ref|NP_626771.1| (NC_003888) PhoH-like protein [Streptomyces coelicolor A3(2)]
emb|CAB66426.1| (AL136534) PhoH-like protein [Streptomyces coelicolor A3(2)]
Length = 359
Score = 74.7 bits (182), Expect = 2e-013
Identities = 38/83 (45%), Positives = 53/83 (63%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L L ++ + G IE+APLA MRGR+ ++AFII+DE QN S ++K
Sbjct: 204 PYLRPLYDALHDMLDPDSIPRLMAAGTIEVAPLAYMRGRTLNDAFIILDEAQNTSPEQMK 263
Query: 62 MLLTRVGEGTTIVLNGDAQQSDL 84
M LTR+G + IV+ GD Q DL
Sbjct: 264 MFLTRLGFESKIVITGDVTQVDL 286
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.319 0.138 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 82,851,333
Number of Sequences: 1038431
Number of extensions: 3023404
Number of successful extensions: 7733
Number of sequences better than 5.0e-02: 76
Number of HSP's better than 0.1 without gapping: 76
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7656
Number of HSP's gapped (non-prelim): 76
length of query: 132
length of database: 328,470,697
effective HSP length: 108
effective length of query: 24
effective length of database: 216,320,149
effective search space: 5191683576
effective search space used: 5191683576
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf36 [7480-7543]
(62 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.325 0.141 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 41,981,966
Number of Sequences: 1038431
Number of extensions: 1250408
Number of successful extensions: 3232
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3232
Number of HSP's gapped (non-prelim): 0
length of query: 62
length of database: 328,470,697
effective HSP length: 38
effective length of query: 24
effective length of database: 289,010,319
effective search space: 6936247656
effective search space used: 6936247656
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 85 (37.4 bits)
Query= orf37 [7562-7609]
(46 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.316 0.130 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,549,286
Number of Sequences: 1038431
Number of extensions: 283691
Number of successful extensions: 477
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 477
Number of HSP's gapped (non-prelim): 0
length of query: 46
length of database: 328,470,697
effective HSP length: 22
effective length of query: 24
effective length of database: 305,625,215
effective search space: 7335005160
effective search space used: 7335005160
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 85 (37.4 bits)
Query= orf38 [7610-7695]
(84 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.314 0.131 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 62,925,772
Number of Sequences: 1038431
Number of extensions: 2423534
Number of successful extensions: 3596
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3596
Number of HSP's gapped (non-prelim): 0
length of query: 84
length of database: 328,470,697
effective HSP length: 60
effective length of query: 24
effective length of database: 266,164,837
effective search space: 6387956088
effective search space used: 6387956088
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf39 [7691-7778]
(86 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.320 0.129 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 39,381,868
Number of Sequences: 1038431
Number of extensions: 1002711
Number of successful extensions: 3736
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3736
Number of HSP's gapped (non-prelim): 0
length of query: 86
length of database: 328,470,697
effective HSP length: 62
effective length of query: 24
effective length of database: 264,087,975
effective search space: 6338111400
effective search space used: 6338111400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf40 [7776-7889]
(112 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064759.1| (NC_002519) gp22 [Roseophage SIO1] >gi|994... 187 3e-047
ref|XP_122285.1| (XM_122285) similar to hypothetical protei... 37 0.032
ref|NP_067468.1| (NM_021493) hypothetical protein, MNCb-130... 37 0.032
>ref|NP_064759.1| (NC_002519) gp22 [Roseophage SIO1]
gb|AAG02604.1|AF189021_24 (AF189021) Roseophage SIO1 complete genome
Length = 112
Score = 187 bits (474), Expect = 3e-047
Identities = 96/112 (85%), Positives = 96/112 (85%)
Query: 1 MVRFIVCLSLVLFLGGCLNPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSG 60
MVRFIVCLSLVLFLGGCLNPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSG
Sbjct: 1 MVRFIVCLSLVLFLGGCLNPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSG 60
Query: 61 VAPSGSVGSLNISNQDXXXXXXXXXXXXXXXXSPQEIWRGFLKTITLGKYRG 112
VAPSGSVGSLNISNQD SPQEIWRGFLKTITLGKYRG
Sbjct: 61 VAPSGSVGSLNISNQDIPIWVILLLILGWVLPSPQEIWRGFLKTITLGKYRG 112
>ref|XP_122285.1| (XM_122285) similar to hypothetical protein, MNCb-1301 [Mus
musculus]
Length = 182
Score = 37.4 bits (85), Expect = 0.032
Identities = 18/52 (34%), Positives = 28/52 (53%)
Query: 19 NPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSGVAPSGSVGSL 70
N LGGGGP + V+ ++N +G+ + TKA + PSG +G+L
Sbjct: 114 NEWKELGGGGPQESVGVRPHSDNKDSGLSSLESTKARASSAASLPSGDLGAL 165
>ref|NP_067468.1| (NM_021493) hypothetical protein, MNCb-1301 [Mus musculus]
ref|XP_109225.1| (XM_109225) similar to hypothetical protein, MNCb-1301 [Mus
musculus]
dbj|BAA95056.1| (AB041572) unnamed protein product [Mus musculus]
Length = 325
Score = 37.4 bits (85), Expect = 0.032
Identities = 18/52 (34%), Positives = 28/52 (53%)
Query: 19 NPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSGVAPSGSVGSL 70
N LGGGGP + V+ ++N +G+ + TKA + PSG +G+L
Sbjct: 257 NEWKELGGGGPQESVGVRPHSDNKDSGLSSLESTKARASSAASLPSGDLGAL 308
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.317 0.137 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 63,309,012
Number of Sequences: 1038431
Number of extensions: 2108985
Number of successful extensions: 5071
Number of sequences better than 5.0e-02: 3
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 5067
Number of HSP's gapped (non-prelim): 3
length of query: 112
length of database: 328,470,697
effective HSP length: 88
effective length of query: 24
effective length of database: 237,088,769
effective search space: 5690130456
effective search space used: 5690130456
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf41 [7888-7952]
(63 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.311 0.131 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 35,769,284
Number of Sequences: 1038431
Number of extensions: 1036503
Number of successful extensions: 1809
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1809
Number of HSP's gapped (non-prelim): 0
length of query: 63
length of database: 328,470,697
effective HSP length: 39
effective length of query: 24
effective length of database: 287,971,888
effective search space: 6911325312
effective search space used: 6911325312
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf42 [7946-8147]
(200 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064760.1| (NC_002519) gp23 [Roseophage SIO1] >gi|994... 391 e-108
sp|P71410|HTR2_HALSA Sensory rhodopsin II transducer (HTR-I... 39 0.046
ref|NP_280509.1| (NC_002607) Htr2 transducer; Htr2 [Halobac... 39 0.046
>ref|NP_064760.1| (NC_002519) gp23 [Roseophage SIO1]
gb|AAG02605.1|AF189021_25 (AF189021) Roseophage SIO1 complete genome
Length = 200
Score = 391 bits (1005), Expect = e-108
Identities = 200/200 (100%), Positives = 200/200 (100%)
Query: 1 MASSWNLAAAQTSSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE 60
MASSWNLAAAQTSSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE
Sbjct: 1 MASSWNLAAAQTSSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE 60
Query: 61 DVMDAAATGVEAAGSVLPAIMDTAQAAGEGLMGSGLSEFRFLGSNFFDAGGKFTEKDLNV 120
DVMDAAATGVEAAGSVLPAIMDTAQAAGEGLMGSGLSEFRFLGSNFFDAGGKFTEKDLNV
Sbjct: 61 DVMDAAATGVEAAGSVLPAIMDTAQAAGEGLMGSGLSEFRFLGSNFFDAGGKFTEKDLNV 120
Query: 121 SDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVLKGGILAGVFDPDLRMARTVGGFKFEE 180
SDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVLKGGILAGVFDPDLRMARTVGGFKFEE
Sbjct: 121 SDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVLKGGILAGVFDPDLRMARTVGGFKFEE 180
Query: 181 DAEGNTILRNTYNFNEGPKT 200
DAEGNTILRNTYNFNEGPKT
Sbjct: 181 DAEGNTILRNTYNFNEGPKT 200
>sp|P71410|HTR2_HALSA Sensory rhodopsin II transducer (HTR-II) (Methyl-accepting
phototaxis protein II) (MPP-II)
pir||T44946 transducer protein htrII [validated] - Halobacterium salinarum
gb|AAC44369.1| (U62676) phototaxis transducer II [Halobacterium salinarum]
Length = 765
Score = 38.5 bits (88), Expect = 0.046
Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 26/165 (15%)
Query: 13 SSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE---DVMDAAATG 69
S++ E+VA + SL AA A+ G++A D E + D + E D MDA +
Sbjct: 510 SASAEEVAETVASL-ADTAGQAASAVDDGRQATEDAVETMDDVADDAEAAADAMDALDSE 568
Query: 70 VEAAGSVLPAIMDTA------------QAAGEGLMGSGLS----EFRFLGSNFFDAGGKF 113
+ G ++ I D A +AA G G G + E + L DA
Sbjct: 569 MADIGEIVDVIADIADQTNMLALNASIEAARTGADGDGFAVVADEVKTLAEESRDAAEDI 628
Query: 114 TEKDL----NVSDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVL 154
+ L VSD++ RA ++GR+ V D TA +V+
Sbjct: 629 ESRLLALQGQVSDVADEMRATSDTVSDGRATV--GDAATALDDVV 671
>ref|NP_280509.1| (NC_002607) Htr2 transducer; Htr2 [Halobacterium sp. NRC-1]
sp|Q9HP81|HTR2_HALN1 Sensory rhodopsin II transducer (HTR-II) (Methyl-accepting
phototaxis protein II) (MPP-II)
gb|AAG19989.1| (AE005080) Htr2 transducer; Htr2 [Halobacterium sp. NRC-1]
Length = 764
Score = 38.5 bits (88), Expect = 0.046
Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 26/165 (15%)
Query: 13 SSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE---DVMDAAATG 69
S++ E+VA + SL AA A+ G++A D E + D + E D MDA +
Sbjct: 509 SASAEEVAETVASL-ADTAGQAASAVDDGRQATEDAVETMDDVADDAEAAADAMDALDSE 567
Query: 70 VEAAGSVLPAIMDTA------------QAAGEGLMGSGLS----EFRFLGSNFFDAGGKF 113
+ G ++ I D A +AA G G G + E + L DA
Sbjct: 568 MADIGEIVDVIADIADQTNMLALNASIEAARTGADGDGFAVVADEVKTLAEESRDAAEDI 627
Query: 114 TEKDL----NVSDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVL 154
+ L VSD++ RA ++GR+ V D TA +V+
Sbjct: 628 ESRLLALQGQVSDVADEMRATSDTVSDGRATV--GDAATALDDVV 670
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.311 0.130 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 126,619,707
Number of Sequences: 1038431
Number of extensions: 5117447
Number of successful extensions: 10855
Number of sequences better than 5.0e-02: 3
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 10852
Number of HSP's gapped (non-prelim): 5
length of query: 200
length of database: 328,470,697
effective HSP length: 115
effective length of query: 85
effective length of database: 209,051,132
effective search space: 17769346220
effective search space used: 17769346220
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 88 (38.5 bits)
Query= orf43 [8146-8407]
(260 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064761.1| (NC_002519) gp24 [Roseophage SIO1] >gi|994... 384 e-106
>ref|NP_064761.1| (NC_002519) gp24 [Roseophage SIO1]
gb|AAG02606.1|AF189021_26 (AF189021) Roseophage SIO1 complete genome
Length = 247
Score = 384 bits (987), Expect = e-106
Identities = 197/247 (79%), Positives = 197/247 (79%)
Query: 14 MKKXXXXXXXXXXXXXXXXPQMEKLTESLNMTSAWNKGQTLSKDMPDYKPELLNLSSYQA 73
MKK PQMEKLTESLNMTSAWNKGQTLSKDMPDYKPELLNLSSYQA
Sbjct: 1 MKKVSISITVYLVVLILLIPQMEKLTESLNMTSAWNKGQTLSKDMPDYKPELLNLSSYQA 60
Query: 74 KASPSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRKVEAQSYDTLYGNFERQDTP 133
KASPS RKVEAQSYDTLYGNFERQDTP
Sbjct: 61 KASPSTGGATYQAGAGTTPTSATSGITALQPTSTTQTTLRKVEAQSYDTLYGNFERQDTP 120
Query: 134 FKDVSVSSMTIGELADFSRASGEYGRYVKPRLPKNTYAYKKGLTSTPMGKYQIVGSTLRD 193
FKDVSVSSMTIGELADFSRASGEYGRYVKPRLPKNTYAYKKGLTSTPMGKYQIVGSTLRD
Sbjct: 121 FKDVSVSSMTIGELADFSRASGEYGRYVKPRLPKNTYAYKKGLTSTPMGKYQIVGSTLRD 180
Query: 194 LTRRMNLPADTVFNKETQDKMFLFLAKEAVGGGKTSSEKRSRLRGIWEGFKHVDNNTLNK 253
LTRRMNLPADTVFNKETQDKMFLFLAKEAVGGGKTSSEKRSRLRGIWEGFKHVDNNTLNK
Sbjct: 181 LTRRMNLPADTVFNKETQDKMFLFLAKEAVGGGKTSSEKRSRLRGIWEGFKHVDNNTLNK 240
Query: 254 VIAEIEQ 260
VIAEIEQ
Sbjct: 241 VIAEIEQ 247
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.312 0.129 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 128,691,723
Number of Sequences: 1038431
Number of extensions: 4357237
Number of successful extensions: 6512
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6510
Number of HSP's gapped (non-prelim): 1
length of query: 260
length of database: 328,470,697
effective HSP length: 119
effective length of query: 141
effective length of database: 204,897,408
effective search space: 28890534528
effective search space used: 28890534528
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 90 (39.3 bits)
Query= orf44 [8457-9043]
(585 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064762.1| (NC_002519) gp25 [Roseophage SIO1] >gi|994... 1030 0.0
>ref|NP_064762.1| (NC_002519) gp25 [Roseophage SIO1]
gb|AAG02607.1|AF189021_27 (AF189021) Roseophage SIO1 complete genome
Length = 564
Score = 1030 bits (2664), Expect = 0.0
Identities = 531/542 (97%), Positives = 531/542 (97%)
Query: 44 KLRAGSGPTQTDRNNAVFVENLRNVNQDLASGMSPDQVASKYGATFANLSLNSEQKAVLT 103
KLRAGSGPTQTDRNNAVFVENLRNVNQDLASGMSPDQVASKYGATFANLSLNSEQKAVLT
Sbjct: 23 KLRAGSGPTQTDRNNAVFVENLRNVNQDLASGMSPDQVASKYGATFANLSLNSEQKAVLT 82
Query: 104 KTIGEDIFYVPKQVETPVDTATEMFNNTDETVRLGLIDIETKKAEAEGETISQDVAAERA 163
KTIGEDIFYVPKQVETPVDTATEMFNNTDETVRLGLIDIETKKAEAEGETISQDVAAERA
Sbjct: 83 KTIGEDIFYVPKQVETPVDTATEMFNNTDETVRLGLIDIETKKAEAEGETISQDVAAERA 142
Query: 164 RNNYAAFQVAANAGILQGNIDFSTGFDQNMKTLDSFIKAVSAGLQVEQRGENLSLETLRQ 223
RNNYAAFQVAANAGILQGNIDFSTGFDQNMKTLDSFIKAVSAGLQVEQRGENLSLETLRQ
Sbjct: 143 RNNYAAFQVAANAGILQGNIDFSTGFDQNMKTLDSFIKAVSAGLQVEQRGENLSLETLRQ 202
Query: 224 LQDGFLLLKSNPSFQKPTGKAALEQWALMETRLQSIEDVFTRLEDYDAKGATAKAKQLMG 283
LQDGFLLLKSNPSFQKPTGKAALEQWALMETRLQSIEDVFTRLEDYDAKGATAKAKQLMG
Sbjct: 203 LQDGFLLLKSNPSFQKPTGKAALEQWALMETRLQSIEDVFTRLEDYDAKGATAKAKQLMG 262
Query: 284 VISLSGKSPLSALAAKDSDFMIKMXXXXXXXXXXXLANMGEIPEVDHSALNFDPVILELM 343
VISLSGKSPLSALAAKDSDFMIKM LANMGEIPEVDHSALNFDPVILELM
Sbjct: 263 VISLSGKSPLSALAAKDSDFMIKMAATIADDITADLANMGEIPEVDHSALNFDPVILELM 322
Query: 344 GVAPSGQQTGDVPEGGFDIPPSPFPTELGEKYEGMKIVEKAKFRAYHRSAITGLEAGSLD 403
GVAPSGQQTGDVPEGGFDIPPSPFPTELGEKYEGMKIVEKAKFRAYHRSAITGLEAGSLD
Sbjct: 323 GVAPSGQQTGDVPEGGFDIPPSPFPTELGEKYEGMKIVEKAKFRAYHRSAITGLEAGSLD 382
Query: 404 SPEAINAYASSVTALSYDLTQNDKPSSQNFDVLFSNRNIARVNALQAQGGEAGRVAGNLR 463
SPEAINAYASSVTALSYDLTQNDKPSSQNFDVLFSNRNIARVNALQAQGGEAGRVAGNLR
Sbjct: 383 SPEAINAYASSVTALSYDLTQNDKPSSQNFDVLFSNRNIARVNALQAQGGEAGRVAGNLR 442
Query: 464 AQMGAALQHNQARYGVLAAGLVSRIPQVKIDETNGKFTLDVKDDPVLQEIAAVVSVYYGG 523
AQMGAALQHNQARYGVLAAGLVSRIPQVKIDETNGKFTLDVKDDPVLQEIAAVVSVYYGG
Sbjct: 443 AQMGAALQHNQARYGVLAAGLVSRIPQVKIDETNGKFTLDVKDDPVLQEIAAVVSVYYGG 502
Query: 524 DFEALWAEGGTAKISLKNRLAREGKITPDSPEFERFDAATKVLESSLWKGLASRYNTVRG 583
DFEALWAEGGTAKISLKNRLAREGKITPDSPEFERFDAATKVLESSLWKGLASRYNTVRG
Sbjct: 503 DFEALWAEGGTAKISLKNRLAREGKITPDSPEFERFDAATKVLESSLWKGLASRYNTVRG 562
Query: 584 IP 585
IP
Sbjct: 563 IP 564
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.313 0.130 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 332,591,558
Number of Sequences: 1038431
Number of extensions: 13165893
Number of successful extensions: 27407
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 27406
Number of HSP's gapped (non-prelim): 1
length of query: 585
length of database: 328,470,697
effective HSP length: 127
effective length of query: 458
effective length of database: 196,589,960
effective search space: 90038201680
effective search space used: 90038201680
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 94 (40.8 bits)
Query= orf45 [9043-10528]
(1484 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064763.1| (NC_002519) gp26 [Roseophage SIO1] >gi|994... 2839 0.0
ref|NP_640308.1| (NC_003907) hypothetical protein [Vibrioph... 49 4e-004
ref|NP_415890.1| (NC_000913) putative membrane protein [Esc... 43 0.027
sp|P76072|STFR_ECOLI Side tail fiber protein homolog from l... 43 0.027
>ref|NP_064763.1| (NC_002519) gp26 [Roseophage SIO1]
gb|AAG02609.1|AF189021_29 (AF189021) Roseophage SIO1 complete genome
Length = 1484
Score = 2839 bits (7360), Expect = 0.0
Identities = 1464/1484 (98%), Positives = 1464/1484 (98%)
Query: 1 MDSIASFXXXXXXXXXXXXQSKSKPIDPNSPLEIETVEAQAYLLDEDPQVISETRANQDF 60
MDSIASF QSKSKPIDPNSPLEIETVEAQAYLLDEDPQVISETRANQDF
Sbjct: 1 MDSIASFEELEGGLEEETTQSKSKPIDPNSPLEIETVEAQAYLLDEDPQVISETRANQDF 60
Query: 61 THEGLAKQYPDLTRYLDTLYEAGVSVEEAGRLAREHVERKAIAVAPQEFIFSSMLMVDDE 120
THEGLAKQYPDLTRYLDTLYEAGVSVEEAGRLAREHVERKAIAVAPQEFIFSSMLMVDDE
Sbjct: 61 THEGLAKQYPDLTRYLDTLYEAGVSVEEAGRLAREHVERKAIAVAPQEFIFSSMLMVDDE 120
Query: 121 TVNPESLRVLTNYEKISSRISKRLEENDPSTFKWLTAGALNTARDFTVGVLEMAIRRDSS 180
TVNPESLRVLTNYEKISSRISKRLEENDPSTFKWLTAGALNTARDFTVGVLEMAIRRDSS
Sbjct: 121 TVNPESLRVLTNYEKISSRISKRLEENDPSTFKWLTAGALNTARDFTVGVLEMAIRRDSS 180
Query: 181 LSQKYADSLFMEEDEFNDFWDKEIADAEAKGLFNIREYESLKELQALVDNFGTDTDAGFN 240
LSQKYADSLFMEEDEFNDFWDKEIADAEAKGLFNIREYESLKELQALVDNFGTDTDAGFN
Sbjct: 181 LSQKYADSLFMEEDEFNDFWDKEIADAEAKGLFNIREYESLKELQALVDNFGTDTDAGFN 240
Query: 241 QLLALADIATLGGTKTVGRLASAGVKKAATGTRSVVSDLLASKTVSEAVTATKGIAEGGK 300
QLLALADIATLGGTKTVGRLASAGVKKAATGTRSVVSDLLASKTVSEAVTATKGIAEGGK
Sbjct: 241 QLLALADIATLGGTKTVGRLASAGVKKAATGTRSVVSDLLASKTVSEAVTATKGIAEGGK 300
Query: 301 ATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTKRSMLFEKMAEMLSSPFA 360
ATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTKRSMLFEKMAEMLSSPFA
Sbjct: 301 ATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTKRSMLFEKMAEMLSSPFA 360
Query: 361 GKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYIYTALLGKADTGKPFTTK 420
GKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYIYTALLGKADTGKPFTTK
Sbjct: 361 GKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYIYTALLGKADTGKPFTTK 420
Query: 421 AAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERVDTSRLATEIEDVNVEEG 480
AAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERVDTSRLATEIEDVNVEEG
Sbjct: 421 AAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERVDTSRLATEIEDVNVEEG 480
Query: 481 FLKRSAASLFSAGQTALGPRLGFMLNAAEGLVARVSKEADVAFKDISKLSKGESEELNKI 540
FLKRSAASLFSAGQTALGPRLGFMLNAAEGLVARVSKEADVAFKDISKLSKGESEELNKI
Sbjct: 481 FLKRSAASLFSAGQTALGPRLGFMLNAAEGLVARVSKEADVAFKDISKLSKGESEELNKI 540
Query: 541 ITSYRDSPLGDTELDLAAQRGAPSSAKFEQDFMAVNGRVPSEQQMKAYRALVDFNNSSWN 600
ITSYRDSPLGDTELDLAAQRGAPSSAKFEQDFMAVNGRVPSEQQMKAYRALVDFNNSSWN
Sbjct: 541 ITSYRDSPLGDTELDLAAQRGAPSSAKFEQDFMAVNGRVPSEQQMKAYRALVDFNNSSWN 600
Query: 601 VKATEILKKVTERGGWTVDVSEGYTTIGVPVTVADDAVVFSRLQGSVRGSAVGDRIVYKL 660
VKATEILKKVTERGGWTVDVSEGYTTIGVPVTVADDAVVFSRLQGSVRGSAVGDRIVYKL
Sbjct: 601 VKATEILKKVTERGGWTVDVSEGYTTIGVPVTVADDAVVFSRLQGSVRGSAVGDRIVYKL 660
Query: 661 DEPFEDANGNFFEYVTDVADARVPQKSDVLGYNFGGSRNNETTNFFVGTLFDVEFAXXXX 720
DEPFEDANGNFFEYVTDVADARVPQKSDVLGYNFGGSRNNETTNFFVGTLFDVEFA
Sbjct: 661 DEPFEDANGNFFEYVTDVADARVPQKSDVLGYNFGGSRNNETTNFFVGTLFDVEFAGGKK 720
Query: 721 XXXXFRSLLGSYSSKEATSAARELNNIQETIGGFLKATGLKSISKLELSGEDLDRVNDII 780
FRSLLGSYSSKEATSAARELNNIQETIGGFLKATGLKSISKLELSGEDLDRVNDII
Sbjct: 721 GKGGFRSLLGSYSSKEATSAARELNNIQETIGGFLKATGLKSISKLELSGEDLDRVNDII 780
Query: 781 SRNNKWNPSVENFDDLKDLSQRHKESFANKFEVKRRDQQVDTEITDGYGLSVGDYQSRRV 840
SRNNKWNPSVENFDDLKDLSQRHKESFANKFEVKRRDQQVDTEITDGYGLSVGDYQSRRV
Sbjct: 781 SRNNKWNPSVENFDDLKDLSQRHKESFANKFEVKRRDQQVDTEITDGYGLSVGDYQSRRV 840
Query: 841 ARKRGDAPLMEYGGGRVSNQDPITNILEQFQSTAYRYTHSKATQAAVNGWVDKARRMGNV 900
ARKRGDAPLMEYGGGRVSNQDPITNILEQFQSTAYRYTHSKATQAAVNGWVDKARRMGNV
Sbjct: 841 ARKRGDAPLMEYGGGRVSNQDPITNILEQFQSTAYRYTHSKATQAAVNGWVDKARRMGNV 900
Query: 901 TFDGPVPSDPNDFLRRAKIKGSEGVDADMAEQQAVIKRRMGLEERTDKENNFFIRMGQHI 960
TFDGPVPSDPNDFLRRAKIKGSEGVDADMAEQQAVIKRRMGLEERTDKENNFFIRMGQHI
Sbjct: 901 TFDGPVPSDPNDFLRRAKIKGSEGVDADMAEQQAVIKRRMGLEERTDKENNFFIRMGQHI 960
Query: 961 YDEGVFMGKGKGIKTNPEDWLDGAAGRVRAMVFHMKMGIFNPDQLVLNASHVAQIMAISP 1020
YDEGVFMGKGKGIKTNPEDWLDGAAGRVRAMVFHMKMGIFNPDQLVLNASHVAQIMAISP
Sbjct: 961 YDEGVFMGKGKGIKTNPEDWLDGAAGRVRAMVFHMKMGIFNPDQLVLNASHVAQIMAISP 1020
Query: 1021 KAGLKATAAVPIIAQLMRKTSKAAAKDIDALYANGFAGMTKEELLATVKYMRESGRDIVG 1080
KAGLKATAAVPIIAQLMRKTSKAAAKDIDALYANGFAGMTKEELLATVKYMRESGRDIVG
Sbjct: 1021 KAGLKATAAVPIIAQLMRKTSKAAAKDIDALYANGFAGMTKEELLATVKYMRESGRDIVG 1080
Query: 1081 TSVLERSGTAFDPKKNAASEFLELGLTPYKMGELFGRIASVATAVVEHNAKKISDDVFSE 1140
TSVLERSGTAFDPKKNAASEFLELGLTPYKMGELFGRIASVATAVVEHNAKKISDDVFSE
Sbjct: 1081 TSVLERSGTAFDPKKNAASEFLELGLTPYKMGELFGRIASVATAVVEHNAKKISDDVFSE 1140
Query: 1141 AGLQYVANREQVLSFRMTSGQKGAYQEGAIMGLATQWMSYTNRFVDNILIGRDLTKAERA 1200
AGLQYVANREQVLSFRMTSGQKGAYQEGAIMGLATQWMSYTNRFVDNILIGRDLTKAERA
Sbjct: 1141 AGLQYVANREQVLSFRMTSGQKGAYQEGAIMGLATQWMSYTNRFVDNILIGRDLTKAERA 1200
Query: 1201 RMVGVNTVLFGTRGMGFSPKMTAALVAFGVDPEDENSTATLNAVKFGLFDFMLSQLVGED 1260
RMVGVNTVLFGTRGMGFSPKMTAALVAFGVDPEDENSTATLNAVKFGLFDFMLSQLVGED
Sbjct: 1201 RMVGVNTVLFGTRGMGFSPKMTAALVAFGVDPEDENSTATLNAVKFGLFDFMLSQLVGED 1260
Query: 1261 VSLGSRIAPMGGIVQQYSELFREDPLYATLGGPSAQIGFDTYKAIKATLETITGGHKQIA 1320
VSLGSRIAPMGGIVQQYSELFREDPLYATLGGPSAQIGFDTYKAIKATLETITGGHKQIA
Sbjct: 1261 VSLGSRIAPMGGIVQQYSELFREDPLYATLGGPSAQIGFDTYKAIKATLETITGGHKQIA 1320
Query: 1321 VEEFKVLMRNVKSVDIYAKVVELIETGEYRSKRRGLAGEFDEVGTGLAASVLGGATPMKV 1380
VEEFKVLMRNVKSVDIYAKVVELIETGEYRSKRRGLAGEFDEVGTGLAASVLGGATPMKV
Sbjct: 1321 VEEFKVLMRNVKSVDIYAKVVELIETGEYRSKRRGLAGEFDEVGTGLAASVLGGATPMKV 1380
Query: 1381 LNHYDAKDISYKEDAKFKDARRRIDTWATKALDLISTGEPDKMKQGRELYNDAMNLIEDG 1440
LNHYDAKDISYKEDAKFKDARRRIDTWATKALDLISTGEPDKMKQGRELYNDAMNLIEDG
Sbjct: 1381 LNHYDAKDISYKEDAKFKDARRRIDTWATKALDLISTGEPDKMKQGRELYNDAMNLIEDG 1440
Query: 1441 GFSEENQTKLYRAVVNLETMTDLVKRATGQSKASQLTAKASQGE 1484
GFSEENQTKLYRAVVNLETMTDLVKRATGQSKASQLTAKASQGE
Sbjct: 1441 GFSEENQTKLYRAVVNLETMTDLVKRATGQSKASQLTAKASQGE 1484
>ref|NP_640308.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28412.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 1536
Score = 49.3 bits (116), Expect = 4e-004
Identities = 88/373 (23%), Positives = 154/373 (40%), Gaps = 43/373 (11%)
Query: 988 VRAMVFHMKMGIFNPDQLVLNASHVAQIMAISP----KAGLKATAAVPIIAQLMRKTSKA 1043
++ +V H +G+FN QL AS ++ + P + TA + ++ ++ K
Sbjct: 987 MKTIVMHNLLGVFNIAQLYTQASAMSVALGKFPAKFAPRVINDTAWMHVLDNIV--DDKM 1044
Query: 1044 AAKDIDALYANGFAGMTKEELLATVKYMRESG--RDIVGTSVLERSGT-AFDPKKNAASE 1100
K + L+ NG A E AT R +G + S + R G+ + S
Sbjct: 1045 LGKVYNILF-NGRA--IDETTKATYDAFRRTGLKEAVFNNSDMARIGSHGLGVTRRIISN 1101
Query: 1101 FLELGLTPYKMGELFGRIASVATAV---VEHNAKKISDDVFSEAGLQYVANREQV----L 1153
L L Y+ GEL R A+ A + KK+ D+ L+ V N++ +
Sbjct: 1102 ADNLSLMLYRSGELAARRATFAAEFQNWMRTTGKKVPSDLELSEILEEV-NKDLLELGPA 1160
Query: 1154 SFRMTSGQKGAYQEGAIMGLATQWMSYTNRFVDNILIGRDL--------TKAERARMVGV 1205
+ G +G +MG+ATQ+M + ++ L G+ L T ++AR+
Sbjct: 1161 NRAYWQGGRGTGDVRQLMGVATQFMQVGAKTME--LAGKGLGGVNRGGFTNRQKARIFLS 1218
Query: 1206 NTVLFGTRGMGFSPKMTAALVA-FGVD--PEDENSTATLNAVKFGLFDFMLSQLVGEDVS 1262
+FG G+ +T A+ + G++ PE+ A V FG ++L+ DV
Sbjct: 1219 QLAMFGAAGVPLGGLITQAITSGLGIEQMPEEAAELANQGFVGFG------ARLMLGDVE 1272
Query: 1263 LGSRIAPMGGIVQQYSELF-REDPLYATLGGPSAQIGFD-TYKAIK--ATLETITGGHKQ 1318
+ R A Q ++ +DPL+ GP+ F + AI+ L GH+
Sbjct: 1273 VSDRFALGAQTTQMLEDIITSDDPLWIKALGPAGSGVFGRAWDAIQQLRPLMNADWGHEG 1332
Query: 1319 IAVEEFKVLMRNV 1331
+ E+ +L +V
Sbjct: 1333 VVTEDGLMLAASV 1345
>ref|NP_415890.1| (NC_000913) putative membrane protein [Escherichia coli K12]
pir||G64887 probable tail fiber protein GP37 - Escherichia coli
gb|AAC74454.1| (AE000234) putative membrane protein [Escherichia coli K12]
Length = 1122
Score = 43.1 bits (100), Expect = 0.027
Identities = 61/306 (19%), Positives = 107/306 (34%), Gaps = 26/306 (8%)
Query: 246 ADIATLGGTKTVGRLASAGVKKAATGTRSVV----------------------SDLLASK 283
A + G KT ASA ++ AAT + S+ AS
Sbjct: 198 AAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASS 257
Query: 284 TVSEAVTATKGIAEGGKATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTK 343
+ S A ++ KA +AR S A ++ + K + + ++A T +
Sbjct: 258 SASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQ 317
Query: 344 RSMLFEKMAEMLSSPFAGKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYI 403
S + S + + T EAT + + A+S +A K S A+ S+
Sbjct: 318 ASASATAAGKSAESAASSASTATTKAGEATEQAS--AAARSASA-AKTSETNAKASETSA 374
Query: 404 YTALLGKADTGKPFTTKAAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERV 463
++ A + + A++ A KD+ +A S T A + G ++
Sbjct: 375 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 434
Query: 464 DTSRLATEIEDVNVEEGFLKRSAASLFSAGQTALG-PRLGFMLNAAEGLVARVSKEADVA 522
T+ A + + SA +L A T G +L N+ +A K A
Sbjct: 435 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 494
Query: 523 FKDISK 528
+ + K
Sbjct: 495 YDNAEK 500
>sp|P76072|STFR_ECOLI Side tail fiber protein homolog from lambdoid prophage Rac
dbj|BAA14966.1| (D90774) Tail fiber protein GP37. [Escherichia coli]
dbj|BAA14975.1| (D90775) Tail fiber protein GP37. [Escherichia coli]
Length = 1120
Score = 43.1 bits (100), Expect = 0.027
Identities = 61/306 (19%), Positives = 107/306 (34%), Gaps = 26/306 (8%)
Query: 246 ADIATLGGTKTVGRLASAGVKKAATGTRSVV----------------------SDLLASK 283
A + G KT ASA ++ AAT + S+ AS
Sbjct: 196 AAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASS 255
Query: 284 TVSEAVTATKGIAEGGKATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTK 343
+ S A ++ KA +AR S A ++ + K + + ++A T +
Sbjct: 256 SASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQ 315
Query: 344 RSMLFEKMAEMLSSPFAGKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYI 403
S + S + + T EAT + + A+S +A K S A+ S+
Sbjct: 316 ASASATAAGKSAESAASSASTATTKAGEATEQAS--AAARSASA-AKTSETNAKASETSA 372
Query: 404 YTALLGKADTGKPFTTKAAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERV 463
++ A + + A++ A KD+ +A S T A + G ++
Sbjct: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 432
Query: 464 DTSRLATEIEDVNVEEGFLKRSAASLFSAGQTALG-PRLGFMLNAAEGLVARVSKEADVA 522
T+ A + + SA +L A T G +L N+ +A K A
Sbjct: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 492
Query: 523 FKDISK 528
+ + K
Sbjct: 493 YDNAEK 498
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.314 0.131 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 863,663,283
Number of Sequences: 1038431
Number of extensions: 35954457
Number of successful extensions: 77323
Number of sequences better than 5.0e-02: 4
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 77322
Number of HSP's gapped (non-prelim): 6
length of query: 1484
length of database: 328,470,697
effective HSP length: 134
effective length of query: 1350
effective length of database: 189,320,943
effective search space: 255583273050
effective search space used: 255583273050
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 98 (42.4 bits)
Query= orf46 [10533-10678]
(144 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064764.1| (NC_002519) gp26.1 [Roseophage SIO1] >gi|9... 171 1e-042
sp|O76743|GLH4_CAEEL ATP-dependent RNA helicase glh-4 (Germ... 37 0.028
ref|NP_491207.1| (NM_058806) helicase [Caenorhabditis elega... 37 0.028
pir||T32759 hypothetical protein T12F5.3 - Caenorhabditis e... 37 0.028
>ref|NP_064764.1| (NC_002519) gp26.1 [Roseophage SIO1]
gb|AAG02608.1|AF189021_28 (AF189021) Roseophage SIO1 complete genome
Length = 144
Score = 171 bits (433), Expect = 1e-042
Identities = 93/144 (64%), Positives = 93/144 (64%)
Query: 1 MVAVTGTVLAIAGTTAGIVGTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXIRSNILXXXX 60
MVAVTGTVLAIAGTTAGIVGTV IRSNIL
Sbjct: 1 MVAVTGTVLAIAGTTAGIVGTVKAGKAARRSARAQQQAQEVQAKRQRRAAIRSNILASAR 60
Query: 61 XXXXXXXXXXXXXXXXXXXIGAGRSQLGAELGFGSQLSGLSANISKFDMQAQTYGDIAKL 120
IGAGRSQLGAELGFGSQLSGLSANISKFDMQAQTYGDIAKL
Sbjct: 61 AKASAQAAGTSQSSGLSGAIGAGRSQLGAELGFGSQLSGLSANISKFDMQAQTYGDIAKL 120
Query: 121 GFSAAGNAPTIGGYAEDLYGFFKK 144
GFSAAGNAPTIGGYAEDLYGFFKK
Sbjct: 121 GFSAAGNAPTIGGYAEDLYGFFKK 144
>sp|O76743|GLH4_CAEEL ATP-dependent RNA helicase glh-4 (Germline helicase-4)
pir||T43326 germline RNA helicase-4 - Caenorhabditis elegans
gb|AAC28387.1| (AF079508) germline RNA helicase-4 [Caenorhabditis elegans]
Length = 1156
Score = 37.4 bits (85), Expect = 0.028
Identities = 17/42 (40%), Positives = 22/42 (51%)
Query: 92 GFGSQLSGLSANISKFDMQAQTYGDIAKLGFSAAGNAPTIGG 133
GFG + +AN + FD Q +G AK GF G AP+ G
Sbjct: 84 GFGGNAAAAAANKTSFDQQGNGFGGAAKHGFGGVGGAPSSFG 125
>ref|NP_491207.1| (NM_058806) helicase [Caenorhabditis elegans]
gb|AAB96745.2| (AF039718) C. elegans GLH-4 protein (corresponding sequence
T12F5.3) [Caenorhabditis elegans]
Length = 1156
Score = 37.4 bits (85), Expect = 0.028
Identities = 17/42 (40%), Positives = 22/42 (51%)
Query: 92 GFGSQLSGLSANISKFDMQAQTYGDIAKLGFSAAGNAPTIGG 133
GFG + +AN + FD Q +G AK GF G AP+ G
Sbjct: 84 GFGGNAAAAAANKTSFDQQGNGFGGAAKHGFGGVGGAPSSFG 125
>pir||T32759 hypothetical protein T12F5.3 - Caenorhabditis elegans
Length = 1172
Score = 37.4 bits (85), Expect = 0.028
Identities = 17/42 (40%), Positives = 22/42 (51%)
Query: 92 GFGSQLSGLSANISKFDMQAQTYGDIAKLGFSAAGNAPTIGG 133
GFG + +AN + FD Q +G AK GF G AP+ G
Sbjct: 100 GFGGNAAAAAANKTSFDQQGNGFGGAAKHGFGGVGGAPSSFG 141
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.318 0.137 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 52,065,103
Number of Sequences: 1038431
Number of extensions: 1339487
Number of successful extensions: 3611
Number of sequences better than 5.0e-02: 4
Number of HSP's better than 0.1 without gapping: 4
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3600
Number of HSP's gapped (non-prelim): 10
length of query: 144
length of database: 328,470,697
effective HSP length: 120
effective length of query: 24
effective length of database: 203,858,977
effective search space: 4892615448
effective search space used: 4892615448
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf47 [10678-10831]
(152 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064765.1| (NC_002519) gp26.2 [Roseophage SIO1] >gi|9... 299 6e-081
ref|NP_349180.1| (NC_003030) Predicted acetyltransferase [C... 47 8e-005
ref|NP_561792.1| (NC_003366) hypothetical protein [Clostrid... 45 3e-004
ref|NP_655925.1| (NC_003995) Acetyltransf, Acetyltransferas... 44 5e-004
ref|NP_656088.1| (NC_003995) Acetyltransf, Acetyltransferas... 42 0.002
ref|NP_656432.1| (NC_003995) Acetyltransf, Acetyltransferas... 38 0.030
ref|NP_288995.1| (NC_002655) Z3699 gene product [Escherichi... 38 0.030
ref|NP_248202.1| (NC_000909) protease synthase and sporulat... 38 0.030
>ref|NP_064765.1| (NC_002519) gp26.2 [Roseophage SIO1]
gb|AAG02610.1|AF189021_30 (AF189021) Roseophage SIO1 complete genome
Length = 153
Score = 299 bits (766), Expect = 6e-081
Identities = 152/152 (100%), Positives = 152/152 (100%)
Query: 1 VLKTTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVI 60
VLKTTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVI
Sbjct: 2 VLKTTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVI 61
Query: 61 DVDGEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQ 120
DVDGEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQ
Sbjct: 62 DVDGEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQ 121
Query: 121 IGMGDIEGISSLEKLYNRLGYERAETVYLKEL 152
IGMGDIEGISSLEKLYNRLGYERAETVYLKEL
Sbjct: 122 IGMGDIEGISSLEKLYNRLGYERAETVYLKEL 153
>ref|NP_349180.1| (NC_003030) Predicted acetyltransferase [Clostridium
acetobutylicum]
gb|AAK80520.1|AE007755_3 (AE007755) Predicted acetyltransferase [Clostridium acetobutylicum]
Length = 167
Score = 46.6 bits (109), Expect = 8e-005
Identities = 39/147 (26%), Positives = 71/147 (47%), Gaps = 15/147 (10%)
Query: 2 LKTTATKIRTATPDDVFDILILAKEFSKEAP------QSHKWNKEKTEQFILSALQNTNM 55
+++ + IR A D + I+ L K+ KE+P + + E +++I +NT +
Sbjct: 1 MQSESLVIRRAEEGDAYQIIRLVKQVIKESPFFERTPEEFNFTVEDEQEYI----KNTEL 56
Query: 56 TIFVIDVDGEIEGALVGLLSELYMSYTVQATELAWF-VSKDYRGKPASLKLIKAFEKWAK 114
I V++VDG+I G+ S L M ++ T L + K+Y GK +IK +WA+
Sbjct: 57 FI-VVEVDGKIVGSATLKRSSLSM---LRHTALFGITILKEYSGKGIGSLIIKRVFEWAE 112
Query: 115 ESGAKQIGMGDIEGISSLEKLYNRLGY 141
E+ ++I + LY + G+
Sbjct: 113 ENAIEKIDLEVFHDNFKAISLYKKFGF 139
>ref|NP_561792.1| (NC_003366) hypothetical protein [Clostridium perfringens]
dbj|BAB80582.1| (AP003188) hypothetical protein [Clostridium perfringens str. 13]
Length = 147
Score = 44.7 bits (104), Expect = 3e-004
Identities = 35/121 (28%), Positives = 54/121 (43%), Gaps = 4/121 (3%)
Query: 22 ILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVIDVDGEIEGALVGLLSELYMSY 81
+ K FSKE P + W+ E ++ + + V+ D +IEG ++G L + Y
Sbjct: 18 LFVKTFSKE-PWNEPWDFENAKKRLNDVVLTPGFRGAVLRNDEKIEGVILGNLEQWYDGE 76
Query: 82 TVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQIGMGDIEGISSLEKLYNRLGY 141
E +FV +GK KL+ A E E I ++G S+ E YN+ GY
Sbjct: 77 HFCVKE--FFVDSSSQGKGTGKKLLNALEDILMEKEVGVIHFWTMKG-STAEAFYNKRGY 133
Query: 142 E 142
E
Sbjct: 134 E 134
>ref|NP_655925.1| (NC_003995) Acetyltransf, Acetyltransferase (GNAT) family [Bacillus
anthracis A2012] [Bacillus anthracis str. A2012]
Length = 165
Score = 43.9 bits (102), Expect = 5e-004
Identities = 33/141 (23%), Positives = 63/141 (44%), Gaps = 8/141 (5%)
Query: 9 IRTATPDDVFDILILAKEFSKEA------PQSHKWNKEKTEQFILSALQNTNMTIFVIDV 62
IR +D L L+K+ +E P K+ E+ EQ I ++N TIFV
Sbjct: 2 IREIQIEDAASFLQLSKQLDEETKFMLYEPGERKFTAEQQEQMIHRFIENKYTTIFVAVE 61
Query: 63 DGEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQIG 122
+ I G ++ + + V + + + ++Y G+ +L K EKWA+ ++
Sbjct: 62 EERIVGFILVNGNNIQRKRHVASIVIG--ILQEYSGRGIGTRLFKEVEKWARLQDIWRLE 119
Query: 123 MGDIEGISSLEKLYNRLGYER 143
+ + + + LY + G+E+
Sbjct: 120 LTVMAHNTRAQALYKKAGFEK 140
>ref|NP_656088.1| (NC_003995) Acetyltransf, Acetyltransferase (GNAT) family [Bacillus
anthracis A2012] [Bacillus anthracis str. A2012]
Length = 146
Score = 42.0 bits (97), Expect = 0.002
Identities = 34/119 (28%), Positives = 60/119 (49%), Gaps = 12/119 (10%)
Query: 40 EKTEQFILSALQNTNMTIFVIDVDGEIEGALVGLLSELYMSYTVQATELAW-----FVSK 94
E + F+ + ++ IFV DGE G ++LY S++ + + W FV
Sbjct: 32 EGAKVFLRNRMERKESVIFVAVEDGEYIG-----FTQLYPSFSSISMKELWILNDLFVQA 86
Query: 95 DYRGKPASLKLIKAFEKWAKESGAKQIGM-GDIEGISSLEKLYNRLGYERAETVYLKEL 152
RG KL++A +++A E+GAK + + +I+ +S+ ++LY GY R + EL
Sbjct: 87 AKRGAGTGKKLLEAAKEFALENGAKGVKLQTEIDNLSA-QRLYAENGYLRDNRYFHYEL 144
>ref|NP_656432.1| (NC_003995) Acetyltransf, Acetyltransferase (GNAT) family [Bacillus
anthracis A2012] [Bacillus anthracis str. A2012]
Length = 144
Score = 38.1 bits (87), Expect = 0.030
Identities = 29/101 (28%), Positives = 52/101 (50%), Gaps = 4/101 (3%)
Query: 50 LQNTNMTIFVIDVDGEIE-GALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKA 108
L+ ++ + +DG+ G ++G + + T++ LA V +YRG S KL +
Sbjct: 44 LEGNSLEHSFLALDGDKPVGVILGGIKDYENIKTMRCGTLA--VHPNYRGVGVSQKLFEL 101
Query: 109 FEKWAKESGAKQIGMGDIEGISSLEKLYNRLGYERAETVYL 149
++ A ++ KQ+ + I G + YN+LGYE T+YL
Sbjct: 102 HKEEALQNECKQLFLEVIVGNDRAIRFYNKLGYENC-TIYL 141
>ref|NP_288995.1| (NC_002655) Z3699 gene product [Escherichia coli O157:H7 EDL933]
ref|NP_311332.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
gb|AAG57552.1|AE005473_7 (AE005473) Z3699 gene product [Escherichia coli O157:H7 EDL933]
dbj|BAB36728.1| (AP002561) hypothetical protein [Escherichia coli O157:H7]
Length = 178
Score = 38.1 bits (87), Expect = 0.030
Identities = 34/146 (23%), Positives = 64/146 (43%), Gaps = 12/146 (8%)
Query: 4 TTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVIDVD 63
T +IR +D +++ L + P WN + + + N ++++F++
Sbjct: 35 TNTMEIRVFRQEDFEEVITLWERCDLLRP----WNDPEMD---IERKMNHDVSLFLV--- 84
Query: 64 GEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQIGM 123
E+ G +VG + Y + A L V ++RG+ + L+ EK G +I +
Sbjct: 85 AEVNGEVVGTVMGGYDGHRGSAYYLG--VHPEFRGRGIANALLNRLEKKLIARGCPKIQI 142
Query: 124 GDIEGISSLEKLYNRLGYERAETVYL 149
E + +Y RLGYE A+ + L
Sbjct: 143 NVPEDNDMVLGMYERLGYEHADVLSL 168
>ref|NP_248202.1| (NC_000909) protease synthase and sporulation negative regulator
Pai1, putative [Methanococcus jannaschii]
[Methanocaldococcus jannaschii]
sp|Q58604|YC07_METJA Hypothetical protein MJ1207
pir||F64450 hypothetical protein MJ1207 - Methanococcus jannaschii
gb|AAB99211.1| (U67562) protease synthase and sporulation negative regulator Pai1,
putative [Methanococcus jannaschii] [Methanocaldococcus
jannaschii]
Length = 226
Score = 38.1 bits (87), Expect = 0.030
Identities = 35/143 (24%), Positives = 64/143 (44%), Gaps = 7/143 (4%)
Query: 9 IRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVIDVDGEIEG 68
IR T DD+ D L L + + + + K+ + ++ FV +V+G+ G
Sbjct: 77 IRNVTKDDLDDFLELYFKAYRGFDKYYYKKKKWARWYFKWLMKRDEDGFFVCEVNGKPVG 136
Query: 69 ALV---GLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQIGMG- 124
+ +S + + E+ FV D+RG+ LI ++AK+ G + + +
Sbjct: 137 FVACDCNWISNIEKREVAEIHEI--FVDPDFRGRGIGTALINKAIEYAKKRGRRIVELWV 194
Query: 125 DIEGISSLEKLYNRLGYERAETV 147
+E ++E Y RLG+E E V
Sbjct: 195 GVENKGAIE-FYKRLGFEEKEVV 216
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.315 0.132 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 90,689,644
Number of Sequences: 1038431
Number of extensions: 3383609
Number of successful extensions: 6526
Number of sequences better than 5.0e-02: 10
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 6524
Number of HSP's gapped (non-prelim): 10
length of query: 152
length of database: 328,470,697
effective HSP length: 110
effective length of query: 42
effective length of database: 214,243,287
effective search space: 8998218054
effective search space used: 8998218054
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 86 (37.7 bits)
Query= orf48 [10824-11540]
(715 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064766.1| (NC_002519) gp27 [Roseophage SIO1] >gi|994... 1397 0.0
ref|NP_640315.1| (NC_003907) hypothetical protein [Vibrioph... 77 1e-012
ref|NP_640313.1| (NC_003907) hypothetical protein [Vibrioph... 51 6e-005
>ref|NP_064766.1| (NC_002519) gp27 [Roseophage SIO1]
gb|AAG02611.1|AF189021_31 (AF189021) Roseophage SIO1 complete genome
Length = 715
Score = 1397 bits (3615), Expect = 0.0
Identities = 695/715 (97%), Positives = 695/715 (97%)
Query: 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLS 60
MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLS
Sbjct: 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLS 60
Query: 61 DVVVPEGALVQTLDWYNVAGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHS 120
DVVVPEGALVQTLDWYNVAGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHS
Sbjct: 61 DVVVPEGALVQTLDWYNVAGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHS 120
Query: 121 ASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGSD 180
ASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGSD
Sbjct: 121 ASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGSD 180
Query: 181 VDVTSLYFGEGTSVSNQRIYDTYNVGWVGPKGSAALNTYGSYIVYPALTHPWYSGKDANG 240
VDVTSLYFGEGTSVSNQRIYDTYNVGWVGPKGSAALNTYGSYIVYPALTHPWYSGKDANG
Sbjct: 181 VDVTSLYFGEGTSVSNQRIYDTYNVGWVGPKGSAALNTYGSYIVYPALTHPWYSGKDANG 240
Query: 241 AFNKADWLEIYTGSSLASNGHYVLDVFNKARTGLTTEVETGRFRSVAAYAGRVFYAGIDS 300
AFNKADWLEIYTGSSLASNGHYVLDVFNKARTGLTTEVETGRFRSVAAYAGRVFYAGIDS
Sbjct: 241 AFNKADWLEIYTGSSLASNGHYVLDVFNKARTGLTTEVETGRFRSVAAYAGRVFYAGIDS 300
Query: 301 AKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAHNIRKLHVLGA 360
AKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAHNIRKLHVLGA
Sbjct: 301 AKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAHNIRKLHVLGA 360
Query: 361 SLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTGIYAVQ 420
SLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTGIYAVQ
Sbjct: 361 SLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTGIYAVQ 420
Query: 421 QSENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYN 480
QSENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYN
Sbjct: 421 QSENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYN 480
Query: 481 NILVMDLALQAFYPWRVEDEAXXXXXXXXXXXXXXXXXXXXETQVVNGADVVVNGSDNVV 540
NILVMDLALQAFYPWRVEDEA ETQVVNGADVVVNGSDNVV
Sbjct: 481 NILVMDLALQAFYPWRVEDEASSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVV 540
Query: 541 ATLYRDYLEGDSEIKLLVRDGTTGKMTFATFRGDTYLDWGSADYKSFAEAGYDFMGDITT 600
ATLYRDYLEGDSEIKLLVRDGTTGKMTFATFRGDTYLDWGSADYKSFAEAGYDFMGDITT
Sbjct: 541 ATLYRDYLEGDSEIKLLVRDGTTGKMTFATFRGDTYLDWGSADYKSFAEAGYDFMGDITT 600
Query: 601 FKNAPYVTTYMRVTEDGYVASGAGYEFINPSSCLMSVSWNLSKSGSTPREIYKLKDVPVV 660
FKNAPYVTTYMRVTEDGYVASGAGYEFINPSSCLMSVSWNLSKSGSTPREIYKLKDVPVV
Sbjct: 601 FKNAPYVTTYMRVTEDGYVASGAGYEFINPSSCLMSVSWNLSKSGSTPREIYKLKDVPVV 660
Query: 661 NPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVIGAKNNSY 715
NPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVIGAKNNSY
Sbjct: 661 NPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVIGAKNNSY 715
>ref|NP_640315.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28405.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 911
Score = 76.6 bits (187), Expect = 1e-012
Identities = 56/217 (25%), Positives = 104/217 (47%), Gaps = 8/217 (3%)
Query: 282 RFRSVAAYAGRVFYAGIDSAKNGG-KVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTD 340
R R + G V++ D KNG ++ S+L + ++ C+Q DPT+E ++DL+ TD
Sbjct: 353 RPRCSGYHNGHVYFG--DRDKNGKTRILVSQLVNSLDNIPKCFQDADPTAEEINDLIATD 410
Query: 341 GGVVRIPDAHNIRKLHVLGASLLVFAENGVWAVAGVD-NVFRATEYAITRISDVGLSNEN 399
G + + LL+ NGVWA+ G AT++ + +++ V ++
Sbjct: 411 GFTMYPVGMGAPITMVEFNKRLLLLCTNGVWAIRGTSGGGATATDFTLDKVASVEFNSPQ 470
Query: 400 SFVVADGIPIWWGKTGIYAVQQSENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDK 459
S V ++W + GI A+ ++ T+ NL+ +TI ++++ V +
Sbjct: 471 SVVDIGTAIVFWSERGIIAIGVND-FGDLTSNNLTENTIDEYYDSLDRDIIKNVKGTFIN 529
Query: 460 INQRVFWFYPD-NDESVDYKYNN--ILVMDLALQAFY 493
RV+W P+ D + +YK + +LV++L FY
Sbjct: 530 DENRVYWVVPNKQDSNGEYKTDGELVLVLNLDTGGFY 566
>ref|NP_640313.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28407.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 98
Score = 50.8 bits (120), Expect = 6e-005
Identities = 29/83 (34%), Positives = 48/83 (56%), Gaps = 7/83 (8%)
Query: 632 SCLMSVSWNLS-----KSGSTPREIYKLKDVPVVNPNDLSSINYPTDTVVTKSKVRGRGR 686
S LM+ S++ S + +T +E YK + S+ YP + V TK ++RG+GR
Sbjct: 8 SSLMTASFDWSIDSRTRKVTTQQEAYKYDSRRLAQGGAESA--YPYEVVDTKLRIRGQGR 65
Query: 687 SMKFRFESVAGKDFHLVGYEVIG 709
+++ RF+S GKD L+GY ++G
Sbjct: 66 AVRIRFQSNGGKDLRLLGYSILG 88
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.315 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 454,402,024
Number of Sequences: 1038431
Number of extensions: 19338038
Number of successful extensions: 44987
Number of sequences better than 5.0e-02: 3
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 44979
Number of HSP's gapped (non-prelim): 4
length of query: 715
length of database: 328,470,697
effective HSP length: 128
effective length of query: 587
effective length of database: 195,551,529
effective search space: 114788747523
effective search space used: 114788747523
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 95 (41.2 bits)
Query= orf49 [11529-11635]
(105 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.305 0.126 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 59,900,751
Number of Sequences: 1038431
Number of extensions: 1787727
Number of successful extensions: 2265
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2265
Number of HSP's gapped (non-prelim): 0
length of query: 105
length of database: 328,470,697
effective HSP length: 81
effective length of query: 24
effective length of database: 244,357,786
effective search space: 5864586864
effective search space used: 5864586864
T: 11
A: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf50 [11630-11880]
(249 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064767.1| (NC_002519) gp28 [Roseophage SIO1] >gi|994... 410 e-114
ref|NP_640312.1| (NC_003907) hypothetical protein [Vibrioph... 72 6e-012
>ref|NP_064767.1| (NC_002519) gp28 [Roseophage SIO1]
gb|AAG02612.1|AF189021_32 (AF189021) Roseophage SIO1 complete genome
Length = 206
Score = 410 bits (1054), Expect = e-114
Identities = 206/206 (100%), Positives = 206/206 (100%)
Query: 44 MIATRFIPEHSQTLKLTSFSSSARPTHFSFPTRVKNIEFLDYNVSKAVGGVEYRRLKYLS 103
MIATRFIPEHSQTLKLTSFSSSARPTHFSFPTRVKNIEFLDYNVSKAVGGVEYRRLKYLS
Sbjct: 1 MIATRFIPEHSQTLKLTSFSSSARPTHFSFPTRVKNIEFLDYNVSKAVGGVEYRRLKYLS 60
Query: 104 PDEFFGLSDGRDSLASNVKQVADVGSDSILLIRNDAMPMYYTSFDDDTVVLDSYDASVDA 163
PDEFFGLSDGRDSLASNVKQVADVGSDSILLIRNDAMPMYYTSFDDDTVVLDSYDASVDA
Sbjct: 61 PDEFFGLSDGRDSLASNVKQVADVGSDSILLIRNDAMPMYYTSFDDDTVVLDSYDASVDA 120
Query: 164 ILTSAKTRAYGVKYPTFDSFSDTFVPDIDDTMFPFLLAEAKSTAMSLFKSGADPKIEQTA 223
ILTSAKTRAYGVKYPTFDSFSDTFVPDIDDTMFPFLLAEAKSTAMSLFKSGADPKIEQTA
Sbjct: 121 ILTSAKTRAYGVKYPTFDSFSDTFVPDIDDTMFPFLLAEAKSTAMSLFKSGADPKIEQTA 180
Query: 224 RRQKVYVQNDMHKVNTGRAKNNYGRN 249
RRQKVYVQNDMHKVNTGRAKNNYGRN
Sbjct: 181 RRQKVYVQNDMHKVNTGRAKNNYGRN 206
>ref|NP_640312.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28408.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 250
Score = 72.0 bits (175), Expect = 6e-012
Identities = 73/259 (28%), Positives = 120/259 (46%), Gaps = 23/259 (8%)
Query: 2 KMTLLEMVQNILSDMDSEEINSLV-DSNEAEQIA----KVVENTYFAMIATRFIPEHSQT 56
K TLL++V+ + S+E+ SL DS E + + +V+E+ + F+ +
Sbjct: 3 KRTLLQIVKKMAQKTGSDEVTSLSEDSIEIQDMVDCALEVLEDIIYRN-DWEFLKDRPAQ 61
Query: 57 LKLTSFSSSARPTHFSFPTRVKNIEFLDYNVSKAVGGVE--YRRLKYLSPDEFFGLSDGR 114
L+ + S P V+ I+ L Y A GV+ +R L+Y+ P EF
Sbjct: 62 LE-----AGTNAIELSIPDNVRKIQTLRYRYEDA--GVQNCFRTLRYMYPHEFMERLQNN 114
Query: 115 DSLASNVKQVADVGSDSILLIRNDAMPMYYTSFDDDTVVLDSYDASVDAILTSAKTRA-Y 173
+ V G + L + + P Y+TSFD+ VVLDSYDA+ + A A
Sbjct: 115 KPTDPDTTTVTINGVE--LYPKTNRHPRYWTSFDEQNVVLDSYDATQNPTGVDATDSAII 172
Query: 174 GVKYPTF-DSFSDTFVPDIDDTMFPFLLAEAKSTAMSLFKSGADPKIEQTARR---QKVY 229
Y F S +D++V I +++F EA + A F+ +P+ E+ +RR Q++
Sbjct: 173 ATLYLDFTGSDADSWVAPIPESLFTLWEQEAVAEAFVQFRQTENPRAERRSRRTYVQQIK 232
Query: 230 VQNDMHKVNTGRAKNNYGR 248
+ HK + G + NYGR
Sbjct: 233 KEPVTHK-DEGSDEVNYGR 250
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.316 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 142,260,782
Number of Sequences: 1038431
Number of extensions: 5306939
Number of successful extensions: 13198
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 13194
Number of HSP's gapped (non-prelim): 2
length of query: 249
length of database: 328,470,697
effective HSP length: 118
effective length of query: 131
effective length of database: 205,935,839
effective search space: 26977594909
effective search space used: 26977594909
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 90 (39.3 bits)
Query= orf51 [11881-12072]
(190 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064768.1| (NC_002519) gp29 [Roseophage SIO1] >gi|994... 379 e-104
>ref|NP_064768.1| (NC_002519) gp29 [Roseophage SIO1]
gb|AAG02613.1|AF189021_33 (AF189021) Roseophage SIO1 complete genome
Length = 190
Score = 379 bits (972), Expect = e-104
Identities = 190/190 (100%), Positives = 190/190 (100%)
Query: 1 MANVEHSSLTGSALHEPKGTATANSGEVYVANGSGSGTWQPIHRHLGAATAFSSTSPYAY 60
MANVEHSSLTGSALHEPKGTATANSGEVYVANGSGSGTWQPIHRHLGAATAFSSTSPYAY
Sbjct: 1 MANVEHSSLTGSALHEPKGTATANSGEVYVANGSGSGTWQPIHRHLGAATAFSSTSPYAY 60
Query: 61 TIDTDTVEKFLSFPVSSSHVEGFTVLTSPNLRFRYDDPTEITSLINVTMSSSQAGGVGHA 120
TIDTDTVEKFLSFPVSSSHVEGFTVLTSPNLRFRYDDPTEITSLINVTMSSSQAGGVGHA
Sbjct: 61 TIDTDTVEKFLSFPVSSSHVEGFTVLTSPNLRFRYDDPTEITSLINVTMSSSQAGGVGHA 120
Query: 121 VQWALFKNGVEIVGSRAIRTISSGSWGSISVTGVTTLNQNDYIEIKTKADADNVDVNYAN 180
VQWALFKNGVEIVGSRAIRTISSGSWGSISVTGVTTLNQNDYIEIKTKADADNVDVNYAN
Sbjct: 121 VQWALFKNGVEIVGSRAIRTISSGSWGSISVTGVTTLNQNDYIEIKTKADADNVDVNYAN 180
Query: 181 IYVSIIGMSA 190
IYVSIIGMSA
Sbjct: 181 IYVSIIGMSA 190
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.312 0.127 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 120,457,684
Number of Sequences: 1038431
Number of extensions: 4771127
Number of successful extensions: 16164
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 16163
Number of HSP's gapped (non-prelim): 1
length of query: 190
length of database: 328,470,697
effective HSP length: 114
effective length of query: 76
effective length of database: 210,089,563
effective search space: 15966806788
effective search space used: 15966806788
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 88 (38.5 bits)
Query= orf52 [12111-12425]
(313 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_640318.1| (NC_003907) hypothetical protein [Vibrioph... 211 1e-053
>ref|NP_640318.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28402.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 322
Score = 211 bits (536), Expect = 1e-053
Identities = 119/315 (37%), Positives = 185/315 (57%), Gaps = 16/315 (5%)
Query: 4 NTVATLALAKRAEVWSAELKEILRDELQGMKYVKWLDQFPDGDTFKIPSLGDATINNYSE 63
NT T AL +E+W+ E+++IL ++L + + +D FPDGD IPS+G + + E
Sbjct: 6 NTSNTQALIV-SEIWADEIEDILHEKLLDVNIARVVD-FPDGDKLTIPSVGTPVVRSRPE 63
Query: 64 DTAVTYDPIDDAQFTFSITEYLQAGNYITNKAMQDVYYANEIMSQFVPIQERALMERLES 123
T+D +D + + + + + AGN I+ K QD + + + + Q RA+MER ++
Sbjct: 64 QGDFTFDNLDTGEISIILRDEVYAGNAISKKLRQDSRWISNVGAMLPAEQARAIMERYQT 123
Query: 124 DIMALGGQQTV--DNANTINGVDHRMLGSGTGGKIGVADFAKANLALKTAKVPQKNLVAI 181
D++ALG Q ++ N INGV HR +G+GT + V DF++ N + +K+P ++ I
Sbjct: 124 DLLALGNAQFAGQNDPNVINGVPHRFVGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGI 183
Query: 182 VDPSVEFELNTLSQLTNVSNNPRWEGVVRDGIATGMSFIANIYGFDVYTSNYLKTEAAET 241
+DPSV L T++ ++N+SNNPRWEG+V GIA M F+ ++YG D++ SN L +A ET
Sbjct: 184 IDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDLFVSNLL-ADANET 242
Query: 242 I--GGTTVNNAI--TNMFFS-ADQAVLPFVGAWRQMPEVDT---EYNKDFQRTEFVTTAR 293
I GG + NMF + +D +LPFV AW++MP + +YN D TTAR
Sbjct: 243 INAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNT---ATTAR 299
Query: 294 YGLKLYRPENLATVM 308
+G L R ENL V+
Sbjct: 300 WGNGLVRDENLVCVL 314
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.316 0.132 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 193,399,631
Number of Sequences: 1038431
Number of extensions: 7730005
Number of successful extensions: 19042
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 19037
Number of HSP's gapped (non-prelim): 2
length of query: 313
length of database: 328,470,697
effective HSP length: 121
effective length of query: 192
effective length of database: 202,820,546
effective search space: 38941544832
effective search space used: 38941544832
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 91 (39.7 bits)
Query= orf53 [12429-12685]
(255 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064769.1| (NC_002519) gp30 [Roseophage SIO1] >gi|994... 490 e-138
ref|NP_640319.1| (NC_003907) hypothetical protein [Vibrioph... 66 3e-010
ref|NP_173855.1| (NM_102291) unknown protein; protein id: A... 43 0.003
gb|AAF97958.1|AC000103_8 (AC000103) F21J9.12 [Arabidopsis t... 43 0.003
ref|NP_629712.1| (NC_003888) putative chromosome associated... 42 0.006
pir||I51116 NF-180 - sea lamprey >gi|632549|gb|AAA80106.1| ... 42 0.006
gb|AAB71984.1| (U73162) M-protein [Streptococcus equi] 42 0.008
gb|AAF73810.1|AF154038_1 (AF154038) surface protein PspC [S... 41 0.011
dbj|BAB86220.1| (AP003922) contains ESTs AU093915(E1276),AU... 40 0.019
>ref|NP_064769.1| (NC_002519) gp30 [Roseophage SIO1]
gb|AAG02614.1|AF189021_35 (AF189021) Roseophage SIO1 complete genome
Length = 255
Score = 490 bits (1261), Expect = e-138
Identities = 255/255 (100%), Positives = 255/255 (100%)
Query: 1 MSVFSEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQM 60
MSVFSEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQM
Sbjct: 1 MSVFSEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQM 60
Query: 61 KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADI 120
KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADI
Sbjct: 61 KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADI 120
Query: 121 ESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAK 180
ESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAK
Sbjct: 121 ESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAK 180
Query: 181 SPAAFMQLMGKPAPRSNPLVQGSIRTEGSTMQASSERDFGYYQRLRKENSSLYYKPSTQR 240
SPAAFMQLMGKPAPRSNPLVQGSIRTEGSTMQASSERDFGYYQRLRKENSSLYYKPSTQR
Sbjct: 181 SPAAFMQLMGKPAPRSNPLVQGSIRTEGSTMQASSERDFGYYQRLRKENSSLYYKPSTQR 240
Query: 241 AMMADADRLGDNFYR 255
AMMADADRLGDNFYR
Sbjct: 241 AMMADADRLGDNFYR 255
>ref|NP_640319.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28401.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 244
Score = 66.2 bits (160), Expect = 3e-010
Identities = 62/237 (26%), Positives = 99/237 (41%), Gaps = 24/237 (10%)
Query: 18 VSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQMKQENAALKADLEKQAYR 77
V+ +E S +++ELVGEG+K+ VE K + I ++ E A L+ E Q+
Sbjct: 15 VNPYEGKDSKEIMDELVGEGKKYKSVEDAVKALAYSQHHITTLESETAKLR---EAQSQA 71
Query: 78 LGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADIESLVEQTLRKREQESVA 137
+ E L A + +N D A KP + + D+E+ V++ L + + A
Sbjct: 72 KTIDEVL---AKLQQKQQPSDDNKGSDENDPAAQKP-TDDIDVEATVKRLLEQHTSTAQA 127
Query: 138 KSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAKSPAAFMQLMG------- 190
++N V + YGT+A + ELG+ L+ MA SPAA ++L+G
Sbjct: 128 EANHKQVVDAVKAKYGTKAFDVWDKAEKELGI---NLEQMAQTSPAATLKLLGISGESAP 184
Query: 191 -------KPAPRSNPLVQGSIRTEGSTMQASSERDFGYYQRLRKENSSLYYKPSTQR 240
K + P QG EGS G R + N L Y ++
Sbjct: 185 AQSAATFKGDAKPTPTDQGERPPEGSKRLVDYMLSKGEINRSQAYNLKLKYSADPEK 241
>ref|NP_173855.1| (NM_102291) unknown protein; protein id: At1g24460.1 [Arabidopsis
thaliana]
Length = 1730
Score = 43.1 bits (100), Expect = 0.003
Identities = 31/128 (24%), Positives = 60/128 (46%), Gaps = 7/128 (5%)
Query: 43 VEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNI 102
++ + ++E IE++K E AL + L + L + E TA + + L++ N N+
Sbjct: 845 IQEVQLARVEEQEEIEKVKSEVDALTSKLAETQTALKLVEDALSTAEDNISRLTEENRNV 904
Query: 103 GGTADVANTKPSSSEADIESLVEQTLRKREQESV-------AKSNIALVESEREKAYGTE 155
+ A + + AD S+ + +S A+ NI+ + SE+E+A G
Sbjct: 905 QAAKENAELELQKAVADASSVASELDEVLATKSTLEAALMQAERNISDIISEKEEAQGRT 964
Query: 156 AAATVQQK 163
A A ++Q+
Sbjct: 965 ATAEMEQE 972
>gb|AAF97958.1|AC000103_8 (AC000103) F21J9.12 [Arabidopsis thaliana]
Length = 1864
Score = 43.1 bits (100), Expect = 0.003
Identities = 31/128 (24%), Positives = 60/128 (46%), Gaps = 7/128 (5%)
Query: 43 VEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNI 102
++ + ++E IE++K E AL + L + L + E TA + + L++ N N+
Sbjct: 875 IQEVQLARVEEQEEIEKVKSEVDALTSKLAETQTALKLVEDALSTAEDNISRLTEENRNV 934
Query: 103 GGTADVANTKPSSSEADIESLVEQTLRKREQESV-------AKSNIALVESEREKAYGTE 155
+ A + + AD S+ + +S A+ NI+ + SE+E+A G
Sbjct: 935 QAAKENAELELQKAVADASSVASELDEVLATKSTLEAALMQAERNISDIISEKEEAQGRT 994
Query: 156 AAATVQQK 163
A A ++Q+
Sbjct: 995 ATAEMEQE 1002
>ref|NP_629712.1| (NC_003888) putative chromosome associated protein [Streptomyces
coelicolor A3(2)]
pir||T35661 probable chromosome associated protein - Streptomyces coelicolor
emb|CAA22420.1| (AL034447) putative chromosome associated protein [Streptomyces
coelicolor A3(2)]
Length = 1186
Score = 42.0 bits (97), Expect = 0.006
Identities = 41/179 (22%), Positives = 78/179 (42%), Gaps = 7/179 (3%)
Query: 32 ELVGEGRKFNDVEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEHLKETASAS 91
EL E + + EA + LEA E ++ AA +ADLE++ L + E + A+ +
Sbjct: 329 ELEREAARVREQEAELEAALEA---AEHALEDTAAHRADLERE---LAMEERRLKDAARA 382
Query: 92 TAELSDPNNNIGGTADVANTKPSSSEADIESLVEQTLRKREQESVAKSNIALVESEREKA 151
A+ + +GG A ++ ++++A+IE L + ++ + A+ + +E +
Sbjct: 383 IADRRENLARLGGQVGAARSRAAAAQAEIERLAQARDESGQRAAAAQEEYEALRAEVDGL 442
Query: 152 YGTEAAATVQQKASELGLPMAELQSMAAKSPA-AFMQLMGKPAPRSNPLVQGSIRTEGS 209
+ + A+ L AE AA+ A A + R L G R +G+
Sbjct: 443 DAGDQELAERHDAARRALTEAEAALSAAREAATAAERQRAATQARHEALALGLRRKDGT 501
>pir||I51116 NF-180 - sea lamprey
gb|AAA80106.1| (U19361) NF-180 [Petromyzon marinus]
Length = 1110
Score = 42.0 bits (97), Expect = 0.006
Identities = 43/176 (24%), Positives = 78/176 (43%), Gaps = 12/176 (6%)
Query: 6 EEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQM----- 60
EE + A +E + EE + + E E + +VEA K ++EA+ +E+
Sbjct: 578 EEAIEKAEAAEAKAEVEEEEAEAEEEE--EEEAEEEEVEAETKEEVEAEAEVEEEGEAAE 635
Query: 61 --KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADV-ANTKPSSSE 117
+E A + ++ + + E +E A A+ AE + G DV A +K E
Sbjct: 636 EEAEEEEAEEEEVTSKKAKTQEAEVEEEEAEAAEAEAEEEAEEEAGEEDVEAESKEEEEE 695
Query: 118 ADIESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAE 173
E+ E+ + E+E V + + ++E +A E AA +++A+E AE
Sbjct: 696 DSKEADAEED--EAEEEEVKEEEVTKSDAEEAEAEAEEEAAKSEEEAAEEAKDEAE 749
Score = 39.3 bits (90), Expect = 0.042
Identities = 50/219 (22%), Positives = 87/219 (38%), Gaps = 11/219 (5%)
Query: 5 SEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQMKQEN 64
+EE+ A E+V + E E E + + E K + +E+ + E
Sbjct: 608 AEEEEVEAETKEEVEAEAEVEEEGEAAEEEAEEEEAEEEEVTSKKAKTQEAEVEEEEAEA 667
Query: 65 AALKAD--LEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVAN---TKPSSSEAD 119
A +A+ E++A V KE + E +D + +V TK + EA+
Sbjct: 668 AEAEAEEEAEEEAGEEDVEAESKEEEEEDSKE-ADAEEDEAEEEEVKEEEVTKSDAEEAE 726
Query: 120 IESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAA--TVQQKASELGLPMAELQSM 177
E+ E + E AK E+E E TEAA ++AS+ P E++
Sbjct: 727 AEAEEEAAKSEEEAAEEAKDEAEEEEAEEEAVEETEAATEEAEAKEASDDEKPEEEVKE- 785
Query: 178 AAKSPAAFMQLMGKPAPRSNPLVQGSIRTEGSTMQASSE 216
+++P A + P P++ P + + E T + E
Sbjct: 786 -SEAPVA-PEAKKAPEPKAAPKKKAPAKVESPTSEPEDE 822
>gb|AAB71984.1| (U73162) M-protein [Streptococcus equi]
Length = 534
Score = 41.6 bits (96), Expect = 0.008
Identities = 36/145 (24%), Positives = 67/145 (45%), Gaps = 16/145 (11%)
Query: 38 RKFNDVEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEHLKETASASTAELSD 97
+K +++ L K +AD + A+L+ +LEK L V+E L E+ AEL
Sbjct: 302 QKLAEIDQLTADKAKADAELAAANDTIASLQTELEKAKTELAVSERLIESGKREIAELQK 361
Query: 98 PNNNIGGTADVANTKPSSSEADIESLVEQTLRKREQESVAKSNIALVESERE--KAYGTE 155
+ +D A + ++ A++E +Q++ + + +A +E E E KA +
Sbjct: 362 QKD----ASDKALVESQANVAELE----------KQKAASDAKVAELEKEVEAAKAEVAD 407
Query: 156 AAATVQQKASELGLPMAELQSMAAK 180
A + +K EL E +++ AK
Sbjct: 408 LKAQLAKKEEELEAVKKEKEALEAK 432
>gb|AAF73810.1|AF154038_1 (AF154038) surface protein PspC [Streptococcus pneumoniae]
Length = 660
Score = 41.2 bits (95), Expect = 0.011
Identities = 59/251 (23%), Positives = 103/251 (40%), Gaps = 35/251 (13%)
Query: 15 SEQVSSFEEPTSPSVVNELVGEGRKFNDV---------EALPKGKLEADRFIEQM--KQE 63
SEQ S + + V+ EL +K ND E LP L+ ++ + + K E
Sbjct: 253 SEQDKS-KRRANREVLGELATPDKKENDAKSSDSSVGEETLPSPSLKPEKKVAEAEKKVE 311
Query: 64 NAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADIESL 123
A KA+ +K+ R T + T EL ++A + +A++E +
Sbjct: 312 EAKKKAEDQKEEDRRNYP-----TNTYKTLEL-----------EIAESDVEVKKAELELV 355
Query: 124 VEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAKSPA 183
E+ R +E V ++ A VES++ +A E T ++KA E A + + PA
Sbjct: 356 KEEAKEPRNEEKVKQAK-AEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPA 414
Query: 184 AFMQLMGKPAPRSNPLVQGSIRTEGSTMQ--ASSERDFGYYQRLRKENSSLYYK--PSTQ 239
Q PAP++ P + E + A + + Y +R +E + L + P T+
Sbjct: 415 E--QPQPAPAPKAAPAPKPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTE 472
Query: 240 RAMMADADRLG 250
+ + G
Sbjct: 473 KPAQPSTPKTG 483
>dbj|BAB86220.1| (AP003922) contains ESTs AU093915(E1276),AU162319(E60301)~similar
to cytosolic factor~unknown protein [Oryza sativa
(japonica cultivar-group)]
dbj|BAB92436.1| (AP003297) contains ESTs AU093915(E1276),AU162319(E60301)~unknown
protein [Oryza sativa (japonica cultivar-group)]
Length = 613
Score = 40.4 bits (93), Expect = 0.019
Identities = 56/236 (23%), Positives = 91/236 (37%), Gaps = 37/236 (15%)
Query: 24 PTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEH 83
P P E E K + +A + K E E+ K E AA A + TE
Sbjct: 115 PPPPKAKVEAAVEETKAEETKAEEEPKAEEPAKEEEPKAEVAAAAAAPPEAG-----TEE 169
Query: 84 LKETASASTAELSDPNNNIGGTADVANTKPSSSEADIESLV------------EQTLRKR 131
K AS+ A+ +P A+ A +P+ E+ E+ E+T+
Sbjct: 170 PKAEASSEEAKTEEPK------AEAAADEPAKEESKAEAAPAEEAKPAEPEPEEKTVVVT 223
Query: 132 EQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAKSPAAFMQLMGK 191
E+E+ K+ A+ E+ A AAAT + A P E+Q+ AA P +
Sbjct: 224 EEEAATKTVEAIEETVVPAAAAPAAAATEEAAA-----PEPEVQAAAAPEPVLIWGV--- 275
Query: 192 PAPRSNPLVQGSIRTEGSTMQASSERDFGYYQRLRKENSSLYYKPSTQRAMMADAD 247
PLV RT+ ++ R+F + + S++ ++ + DAD
Sbjct: 276 ------PLVGDDERTDTVLLKFLRAREFKVKEAMAMLRSAVLWRKRFGIESLLDAD 325
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.305 0.121 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 142,223,582
Number of Sequences: 1038431
Number of extensions: 5308819
Number of successful extensions: 15320
Number of sequences better than 5.0e-02: 16
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 15304
Number of HSP's gapped (non-prelim): 27
length of query: 255
length of database: 328,470,697
effective HSP length: 119
effective length of query: 136
effective length of database: 204,897,408
effective search space: 27866047488
effective search space used: 27866047488
T: 11
A: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 90 (39.3 bits)
Query= orf54 [12685-12766]
(80 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.311 0.126 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 50,561,216
Number of Sequences: 1038431
Number of extensions: 1724941
Number of successful extensions: 4771
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4771
Number of HSP's gapped (non-prelim): 0
length of query: 80
length of database: 328,470,697
effective HSP length: 56
effective length of query: 24
effective length of database: 270,318,561
effective search space: 6487645464
effective search space used: 6487645464
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf55 [12765-12865]
(99 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_597932.1| (NC_002519) gp5-like protein [Roseophage S... 143 3e-034
ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1] >gi|9944... 123 5e-028
>ref|NP_597932.1| (NC_002519) gp5-like protein [Roseophage SIO1]
Length = 99
Score = 143 bits (361), Expect = 3e-034
Identities = 75/87 (86%), Positives = 75/87 (86%)
Query: 1 MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE 60
MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE
Sbjct: 1 MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE 60
Query: 61 QMETQKVATXXXXXXXXXXXXMVDQGS 87
QMETQKVAT MVDQGS
Sbjct: 61 QMETQKVATEAQIEFEAEQEEMVDQGS 87
>ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1]
gb|AAG02587.1|AF189021_5 (AF189021) Roseophage SIO1 complete genome
Length = 271
Score = 123 bits (308), Expect = 5e-028
Identities = 62/62 (100%), Positives = 62/62 (100%)
Query: 1 MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE 60
MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE
Sbjct: 178 MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE 237
Query: 61 QM 62
QM
Sbjct: 238 QM 239
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.313 0.125 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 36,726,995
Number of Sequences: 1038431
Number of extensions: 976071
Number of successful extensions: 1934
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1932
Number of HSP's gapped (non-prelim): 2
length of query: 99
length of database: 328,470,697
effective HSP length: 75
effective length of query: 24
effective length of database: 250,588,372
effective search space: 6014120928
effective search space used: 6014120928
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 84 (37.0 bits)
Query= orf56 [12868-13042]
(170 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,038,431 sequences; 328,470,697 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1] >gi|9944... 307 3e-083
ref|NP_640321.1| (NC_003907) hypothetical protein [Vibrioph... 118 2e-026
>ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1]
gb|AAG02587.1|AF189021_5 (AF189021) Roseophage SIO1 complete genome
Length = 271
Score = 307 bits (787), Expect = 3e-083
Identities = 154/159 (96%), Positives = 155/159 (96%)
Query: 1 MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY 60
MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY
Sbjct: 1 MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY 60
Query: 61 LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT 120
LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT
Sbjct: 61 LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT 120
Query: 121 AHFERVFLEPILNAMLETARRNMDYEDTAKVLNRSLLLH 159
AHFERVFLEPILNAMLETARRNMDYEDTAKVLN L+
Sbjct: 121 AHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLY 159
>ref|NP_640321.1| (NC_003907) hypothetical protein [Vibriophage VpV262]
gb|AAM28399.1| (AY095314) hypothetical protein [Vibriophage VpV262]
Length = 599
Score = 118 bits (296), Expect = 2e-026
Identities = 64/158 (40%), Positives = 89/158 (55%), Gaps = 2/158 (1%)
Query: 1 MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFE--PNARIYLGDEGDV 58
+GPL L GMQY++D EN + D+ D+ +P LK GDV + PN + + GDV
Sbjct: 326 IGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREKGMRGGPNHVFEVEETGDV 385
Query: 59 GYLVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQH 118
Y+ P + L D Q+ ME ++GAP+E++G R+AGEKT FEV L ++F+
Sbjct: 386 QYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRR 445
Query: 119 KTAHFERVFLEPILNAMLETARRNMDYEDTAKVLNRSL 156
K FER L P+LN LE R ++D DT K N L
Sbjct: 446 KVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSEL 483
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Aug 23, 2002 1:09 AM
Number of letters in database: 328,470,697
Number of sequences in database: 1,038,431
Lambda K H
0.321 0.137 4.65e-269
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 99,556,760
Number of Sequences: 1038431
Number of extensions: 3647551
Number of successful extensions: 8434
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8431
Number of HSP's gapped (non-prelim): 2
length of query: 170
length of database: 328,470,697
effective HSP length: 112
effective length of query: 58
effective length of database: 212,166,425
effective search space: 12305652650
effective search space used: 12305652650
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 87 (38.1 bits)