National Center for Biotechnology Information (NCBI)
welcome to the blast network service.
PEPTIDE SEQUENCE DATABASES
nr Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
month All new or revised GenBank CDS translation+PDB+SwissProt+PIR
sequences released in the last 30 days
pdb PDB protein sequences
yeast Yeast (Saccharomyces cerevisiae) protein sequences.
kabat Kabat Sequences of Proteins of Immunological Interest
alu * Translations of Select Alu Repeats from REPBASE
swissprot SwissProt sequences
NUCLEOTIDE SEQUENCE DATABASES
nr Non-redundant GenBank+EMBL+DDBJ+PDB sequences
(but no EST, STS, GSS, or HTGS sequences)
month All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
the last 30 days
yeast Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
est + Non-redundant Database of GenBank+EMBL+DDBJ EST Division
sts + Non-redundant Database of GenBank+EMBL+DDBJ STS Division
htgs High Throughput Genomic Sequences
pdb PDB nucleotide sequences
vector Vector subset of GenBank
mito * Database of mitochondrial sequences, Rel. 1.0, July 1995
gss Genome Survey Sequences (includes single-pass genomic data, exon-
trapped sequences, and Alu PCR primers.
kabat Kabat Sequences of Nucleic Acid of Immunological Interest
epd Eukaryotic Promotor Database
alu *+ Select Alu Repeats from REPBASE
BLASTP 2.2.2 [Dec-14-2001]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= orf1 [1-19]
(14 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.324 0.144 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,367,654
Number of Sequences: 891141
Number of extensions: 42312
Number of successful extensions: 11
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 11
Number of HSP's gapped (non-prelim): 0
length of query: 14
length of database: 279,076,897
effective HSP length: 0
effective length of query: 24
effective length of database: 279,076,897
effective search space: 6697845528
effective search space used: 6697845528
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 84 (37.0 bits)
Query= orf2 [147-284]
(136 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064738.1| (NC_002519) gp1 [Roseophage SIO1] >gi|9944... 281 1e-075
ref|NP_064742.1| (NC_002519) gp4 [Roseophage SIO1] >gi|9944... 279 4e-075
>ref|NP_064738.1| (NC_002519) gp1 [Roseophage SIO1]
gb|AAG02583.1|AF189021_1 (AF189021) Roseophage SIO1 complete genome
Length = 136
Score = 281 bits (718), Expect = 1e-075
Identities = 136/136 (100%), Positives = 136/136 (100%)
Query: 1 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 60
MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM
Sbjct: 1 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 60
Query: 61 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 120
GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK
Sbjct: 61 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 120
Query: 121 RGQYLDRVQFRAQYYQ 136
RGQYLDRVQFRAQYYQ
Sbjct: 121 RGQYLDRVQFRAQYYQ 136
>ref|NP_064742.1| (NC_002519) gp4 [Roseophage SIO1]
gb|AAG02586.1|AF189021_4 (AF189021) Roseophage SIO1 complete genome
Length = 353
Score = 279 bits (713), Expect = 4e-075
Identities = 135/135 (100%), Positives = 135/135 (100%)
Query: 1 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 60
MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM
Sbjct: 1 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 60
Query: 61 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 120
GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK
Sbjct: 61 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 120
Query: 121 RGQYLDRVQFRAQYY 135
RGQYLDRVQFRAQYY
Sbjct: 121 RGQYLDRVQFRAQYY 135
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.317 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 79,242,119
Number of Sequences: 891141
Number of extensions: 3382608
Number of successful extensions: 7698
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7696
Number of HSP's gapped (non-prelim): 2
length of query: 136
length of database: 279,076,897
effective HSP length: 112
effective length of query: 24
effective length of database: 179,269,105
effective search space: 4302458520
effective search space used: 4302458520
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf3 [273-507]
(233 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064739.1| (NC_002519) gp2 [Roseophage SIO1] >gi|9944... 479 e-134
ref|NP_064742.1| (NC_002519) gp4 [Roseophage SIO1] >gi|9944... 396 e-110
>ref|NP_064739.1| (NC_002519) gp2 [Roseophage SIO1]
gb|AAG02584.1|AF189021_2 (AF189021) Roseophage SIO1 complete genome
Length = 247
Score = 479 bits (1232), Expect = e-134
Identities = 233/233 (100%), Positives = 233/233 (100%)
Query: 1 VFSLEHSITNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVS 60
VFSLEHSITNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVS
Sbjct: 15 VFSLEHSITNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVS 74
Query: 61 KRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQS 120
KRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQS
Sbjct: 75 KRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQS 134
Query: 121 AIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVL 180
AIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVL
Sbjct: 135 AIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVL 194
Query: 181 EEELVSYNPAHDDCKDCLAAAVEVAVKPSSAVKRKRSQDNNVVFHPKFGGVAF 233
EEELVSYNPAHDDCKDCLAAAVEVAVKPSSAVKRKRSQDNNVVFHPKFGGVAF
Sbjct: 195 EEELVSYNPAHDDCKDCLAAAVEVAVKPSSAVKRKRSQDNNVVFHPKFGGVAF 247
>ref|NP_064742.1| (NC_002519) gp4 [Roseophage SIO1]
gb|AAG02586.1|AF189021_4 (AF189021) Roseophage SIO1 complete genome
Length = 353
Score = 396 bits (1018), Expect = e-110
Identities = 190/190 (100%), Positives = 190/190 (100%)
Query: 10 NDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIV 69
NDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIV
Sbjct: 136 NDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIV 195
Query: 70 VIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDN 129
VIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDN
Sbjct: 196 VIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDN 255
Query: 130 YIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNP 189
YIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNP
Sbjct: 256 YIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNP 315
Query: 190 AHDDCKDCLA 199
AHDDCKDCLA
Sbjct: 316 AHDDCKDCLA 325
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.318 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 133,970,557
Number of Sequences: 891141
Number of extensions: 5397828
Number of successful extensions: 11647
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 11645
Number of HSP's gapped (non-prelim): 2
length of query: 233
length of database: 279,076,897
effective HSP length: 116
effective length of query: 117
effective length of database: 175,704,541
effective search space: 20557431297
effective search space used: 20557431297
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 89 (38.9 bits)
Query= orf4 [527-810]
(282 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064740.1| (NC_002519) gp3 [Roseophage SIO1] >gi|9944... 552 e-156
>ref|NP_064740.1| (NC_002519) gp3 [Roseophage SIO1]
gb|AAG02585.1|AF189021_3 (AF189021) Roseophage SIO1 complete genome
Length = 282
Score = 552 bits (1423), Expect = e-156
Identities = 271/271 (100%), Positives = 271/271 (100%)
Query: 1 MAGTTIDIESMIDPHSLAVEIANRWTSWNNARSEKVKEWKELRNYIYATDTRTTSNNKLP 60
MAGTTIDIESMIDPHSLAVEIANRWTSWNNARSEKVKEWKELRNYIYATDTRTTSNNKLP
Sbjct: 1 MAGTTIDIESMIDPHSLAVEIANRWTSWNNARSEKVKEWKELRNYIYATDTRTTSNNKLP 60
Query: 61 WSNSTTTPKLTQIADNLHANYFAALFPQKRWFRFEATDADSDTKIKRSIIQAYMQNKLRQ 120
WSNSTTTPKLTQIADNLHANYFAALFPQKRWFRFEATDADSDTKIKRSIIQAYMQNKLRQ
Sbjct: 61 WSNSTTTPKLTQIADNLHANYFAALFPQKRWFRFEATDADSDTKIKRSIIQAYMQNKLRQ 120
Query: 121 SDFVNTTSKLVNDYIQYGNCFATVDFERKVTKYEDGDRIVNYVGPKVVRISPFDICFNPL 180
SDFVNTTSKLVNDYIQYGNCFATVDFERKVTKYEDGDRIVNYVGPKVVRISPFDICFNPL
Sbjct: 121 SDFVNTTSKLVNDYIQYGNCFATVDFERKVTKYEDGDRIVNYVGPKVVRISPFDICFNPL 180
Query: 181 AANFSDTPKIVRSVLTLGEIQRMVENDSSKGYMADIFNKMLGNRGSARGNEVDINKSEGF 240
AANFSDTPKIVRSVLTLGEIQRMVENDSSKGYMADIFNKMLGNRGSARGNEVDINKSEGF
Sbjct: 181 AANFSDTPKIVRSVLTLGEIQRMVENDSSKGYMADIFNKMLGNRGSARGNEVDINKSEGF 240
Query: 241 VADGFASLTDYYESDYVEVLTFYGDIYDKVR 271
VADGFASLTDYYESDYVEVLTFYGDIYDKVR
Sbjct: 241 VADGFASLTDYYESDYVEVLTFYGDIYDKVR 271
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.317 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 149,213,115
Number of Sequences: 891141
Number of extensions: 5980115
Number of successful extensions: 15702
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 15701
Number of HSP's gapped (non-prelim): 1
length of query: 282
length of database: 279,076,897
effective HSP length: 119
effective length of query: 163
effective length of database: 173,031,118
effective search space: 28204072234
effective search space used: 28204072234
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 90 (39.3 bits)
Query= orf5 [843-1115]
(271 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1] >gi|9944... 537 e-152
>ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1]
gb|AAG02587.1|AF189021_5 (AF189021) Roseophage SIO1 complete genome
Length = 271
Score = 537 bits (1384), Expect = e-152
Identities = 271/271 (100%), Positives = 271/271 (100%)
Query: 1 MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY 60
MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY
Sbjct: 1 MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY 60
Query: 61 LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT 120
LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT
Sbjct: 61 LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT 120
Query: 121 AHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLYFFTQITRDDIKANGKIVPMGA 180
AHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLYFFTQITRDDIKANGKIVPMGA
Sbjct: 121 AHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLYFFTQITRDDIKANGKIVPMGA 180
Query: 181 RHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSEQMG 240
RHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSEQMG
Sbjct: 181 RHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSEQMG 240
Query: 241 KTVFTVVMRWVVGNKFLFQNLTVTTTVVIHL 271
KTVFTVVMRWVVGNKFLFQNLTVTTTVVIHL
Sbjct: 241 KTVFTVVMRWVVGNKFLFQNLTVTTTVVIHL 271
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.321 0.137 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 133,477,418
Number of Sequences: 891141
Number of extensions: 5085275
Number of successful extensions: 12941
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 12940
Number of HSP's gapped (non-prelim): 1
length of query: 271
length of database: 279,076,897
effective HSP length: 118
effective length of query: 153
effective length of database: 173,922,259
effective search space: 26610105627
effective search space used: 26610105627
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 90 (39.3 bits)
Query= orf6 [1053-1502]
(448 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064742.1| (NC_002519) gp4 [Roseophage SIO1] >gi|9944... 732 0.0
ref|NP_064739.1| (NC_002519) gp2 [Roseophage SIO1] >gi|9944... 396 e-109
ref|NP_064738.1| (NC_002519) gp1 [Roseophage SIO1] >gi|9944... 279 5e-074
sp|P10310|VDMB_BPT3 DNA MATURASE B (DNA PACKAGING PROTEIN B... 54 4e-006
ref|NP_042010.1| (NC_001604) gene 19, DNA maturation protei... 53 6e-006
ref|NP_298858.1| (NC_002488) hypothetical protein [Xylella ... 46 7e-004
>ref|NP_064742.1| (NC_002519) gp4 [Roseophage SIO1]
gb|AAG02586.1|AF189021_4 (AF189021) Roseophage SIO1 complete genome
Length = 353
Score = 732 bits (1889), Expect = 0.0
Identities = 353/353 (100%), Positives = 353/353 (100%)
Query: 96 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 155
MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM
Sbjct: 1 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 60
Query: 156 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 215
GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK
Sbjct: 61 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 120
Query: 216 RGQYLDRVQFRAQYYNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVD 275
RGQYLDRVQFRAQYYNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVD
Sbjct: 121 RGQYLDRVQFRAQYYNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVD 180
Query: 276 FAYSVSKRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAE 335
FAYSVSKRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAE
Sbjct: 181 FAYSVSKRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAE 240
Query: 336 CTAAQSAIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRG 395
CTAAQSAIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRG
Sbjct: 241 CTAAQSAIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRG 300
Query: 396 GNCQVLEEELVSYNPAHDDCKDCLAHLFTDRYVLFEECWLTKFVCKKTGKLFA 448
GNCQVLEEELVSYNPAHDDCKDCLAHLFTDRYVLFEECWLTKFVCKKTGKLFA
Sbjct: 301 GNCQVLEEELVSYNPAHDDCKDCLAHLFTDRYVLFEECWLTKFVCKKTGKLFA 353
>ref|NP_064739.1| (NC_002519) gp2 [Roseophage SIO1]
gb|AAG02584.1|AF189021_2 (AF189021) Roseophage SIO1 complete genome
Length = 247
Score = 396 bits (1018), Expect = e-109
Identities = 190/190 (100%), Positives = 190/190 (100%)
Query: 231 NDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIV 290
NDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIV
Sbjct: 24 NDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIV 83
Query: 291 VIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDN 350
VIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDN
Sbjct: 84 VIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDN 143
Query: 351 YIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNP 410
YIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNP
Sbjct: 144 YIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNP 203
Query: 411 AHDDCKDCLA 420
AHDDCKDCLA
Sbjct: 204 AHDDCKDCLA 213
>ref|NP_064738.1| (NC_002519) gp1 [Roseophage SIO1]
gb|AAG02583.1|AF189021_1 (AF189021) Roseophage SIO1 complete genome
Length = 136
Score = 279 bits (713), Expect = 5e-074
Identities = 135/135 (100%), Positives = 135/135 (100%)
Query: 96 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 155
MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM
Sbjct: 1 MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM 60
Query: 156 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 215
GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK
Sbjct: 61 GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK 120
Query: 216 RGQYLDRVQFRAQYY 230
RGQYLDRVQFRAQYY
Sbjct: 121 RGQYLDRVQFRAQYY 135
>sp|P10310|VDMB_BPT3 DNA MATURASE B (DNA PACKAGING PROTEIN B) (GP19)
pir||JVBPB3 DNA maturase B - phage T3
gb|AAA92528.1| (M14784) DNA-packaging protein B [Bacteriophage T3]
Length = 586
Score = 53.5 bits (127), Expect = 4e-006
Identities = 35/116 (30%), Positives = 55/116 (47%), Gaps = 10/116 (8%)
Query: 6 VAWELTKNPTLRVLYISATSNLAQKQLSFIKNIFESDIHQKYWPEHLHKDESKREKWTTS 65
V W L ++P L++L +SA+ A FIKNI I + + L +R+ +
Sbjct: 72 VVWTLWRDPQLKILIVSASKERADLNSIFIKNI----IDLLPFLDELKPSPGQRDSVISF 127
Query: 66 EIALDHPDRKKEAIRDPSIFTGGLTTSLTGMHCDIAVLDDVVVFENAYTNEGRNKV 121
++ PD PS+ + G+T LTG DI + DDV + N+ T R K+
Sbjct: 128 DVGPAKPDHS------PSVKSVGITGQLTGSRADIIIADDVEIPSNSATQGAREKL 177
>ref|NP_042010.1| (NC_001604) gene 19, DNA maturation protein [Enterobacteria phage
T7]
sp|P03694|VDMB_BPT7 DNA MATURASE B (DNA PACKAGING PROTEIN B) (GP19)
pir||JVBPB7 DNA maturase B - phage T7
emb|CAA24440.1| (V01146) gene 19, DNA maturation protein [Enterobacteria phage T7]
Length = 586
Score = 53.1 bits (126), Expect = 6e-006
Identities = 44/153 (28%), Positives = 68/153 (43%), Gaps = 16/153 (10%)
Query: 6 VAWELTKNPTLRVLYISATSNLAQKQLSFIKNIFESDIHQKYWPEHLHKDESKREKWTTS 65
V W L ++P L++L +SA+ A FIKNI I + L +R+ +
Sbjct: 72 VVWSLWRDPQLKILIVSASKERADANSIFIKNI----IDLLPFLSELKPRPGQRDSVISF 127
Query: 66 EIALDHPDRKKEAIRDPSIFTGGLTTSLTGMHCDIAVLDDVVVFENAYTNEGRNK----V 121
++ +PD PS+ + G+T LTG DI + DDV + N+ T R K V
Sbjct: 128 DVGPANPDHS------PSVKSVGITGQLTGSRADIIIADDVEIPSNSATMGAREKLWTLV 181
Query: 122 KSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDL 154
+ +LL + S +GT LY +L
Sbjct: 182 QEFAALLKPLPSSRV--IYLGTPQTEMTLYKEL 212
>ref|NP_298858.1| (NC_002488) hypothetical protein [Xylella fastidiosa 9a5c]
ref|NP_298963.1| (NC_002488) hypothetical protein [Xylella fastidiosa 9a5c]
pir||H82649 hypothetical protein XF1674 XF1569 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF84378.1|AE003986_8 (AE003986) hypothetical protein [Xylella fastidiosa 9a5c]
gb|AAF84483.1|AE003993_2 (AE003993) hypothetical protein [Xylella fastidiosa 9a5c]
Length = 316
Score = 46.2 bits (108), Expect = 7e-004
Identities = 34/132 (25%), Positives = 62/132 (46%), Gaps = 14/132 (10%)
Query: 10 LTKNPTLRVLYISATSNLAQKQLSFIKNIFESDIHQKYWPEHLHKDESKREKWTTSEIAL 69
L NP R L+IS + +LA + I +SD ++ WP + D +++W + +
Sbjct: 85 LALNPYARFLHISYSDDLALLNSETAREIVQSDEYRALWPLEIADDAKSKKRW---NVVV 141
Query: 70 DHPDRKKEAIRDPSIFTGGLTTSLTGMHC-----DIAVLDDVVVFENAYTNEGRNKVKSQ 124
D K+A ++ GG T H ++DD + E+AY+ GR+K +
Sbjct: 142 D----GKKAGGVYAVSLGGQVTGFRAGHMAPGWQGAIIIDDPLKVEDAYSKTGRSKANRK 197
Query: 125 YSLLSSIEGSEA 136
L+S+++ +A
Sbjct: 198 --LVSTVKSRKA 207
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.318 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 263,060,621
Number of Sequences: 891141
Number of extensions: 11719138
Number of successful extensions: 25811
Number of sequences better than 5.0e-02: 9
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 25802
Number of HSP's gapped (non-prelim): 9
length of query: 448
length of database: 279,076,897
effective HSP length: 123
effective length of query: 325
effective length of database: 169,466,554
effective search space: 55076630050
effective search space used: 55076630050
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 92 (40.0 bits)
Query= orf7 [1562-1751]
(188 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064743.1| (NC_002519) gp5.1 [Roseophage SIO1] >gi|99... 396 e-110
>ref|NP_064743.1| (NC_002519) gp5.1 [Roseophage SIO1]
gb|AAG02588.1|AF189021_6 (AF189021) Roseophage SIO1 complete genome
Length = 188
Score = 396 bits (1017), Expect = e-110
Identities = 188/188 (100%), Positives = 188/188 (100%)
Query: 1 MTLHLENISFVAVPKTATIAIEKAFSPYARGYTPHTHEPVNVVKRDGGQDCLGLIRNPHD 60
MTLHLENISFVAVPKTATIAIEKAFSPYARGYTPHTHEPVNVVKRDGGQDCLGLIRNPHD
Sbjct: 1 MTLHLENISFVAVPKTATIAIEKAFSPYARGYTPHTHEPVNVVKRDGGQDCLGLIRNPHD 60
Query: 61 WIKSYYLYLKHSPYFYSASSAWGIGEKTFEEFVGRFCDGHRLWPEPMRLQSMYLLRNGVS 120
WIKSYYLYLKHSPYFYSASSAWGIGEKTFEEFVGRFCDGHRLWPEPMRLQSMYLLRNGVS
Sbjct: 61 WIKSYYLYLKHSPYFYSASSAWGIGEKTFEEFVGRFCDGHRLWPEPMRLQSMYLLRNGVS 120
Query: 121 TDFIYRFENLLDAVAHLGAACGQMPKMSRHNVSPKCDVILSKKMTYLFEDAASEDFDLYE 180
TDFIYRFENLLDAVAHLGAACGQMPKMSRHNVSPKCDVILSKKMTYLFEDAASEDFDLYE
Sbjct: 121 TDFIYRFENLLDAVAHLGAACGQMPKMSRHNVSPKCDVILSKKMTYLFEDAASEDFDLYE 180
Query: 181 SGADGVMT 188
SGADGVMT
Sbjct: 181 SGADGVMT 188
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.322 0.138 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 110,639,055
Number of Sequences: 891141
Number of extensions: 4469421
Number of successful extensions: 7499
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7498
Number of HSP's gapped (non-prelim): 1
length of query: 188
length of database: 279,076,897
effective HSP length: 113
effective length of query: 75
effective length of database: 178,377,964
effective search space: 13378347300
effective search space used: 13378347300
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 87 (38.1 bits)
Query= orf8 [1750-1887]
(136 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_102239.1| (NC_002678) unknown protein [Mesorhizobium... 44 2e-004
>ref|NP_102239.1| (NC_002678) unknown protein [Mesorhizobium loti]
dbj|BAB48025.1| (AP002994) unknown protein [Mesorhizobium loti]
Length = 130
Score = 44.3 bits (103), Expect = 2e-004
Identities = 21/57 (36%), Positives = 32/57 (55%), Gaps = 3/57 (5%)
Query: 1 MKYKSARYINSTKIDCEIEHPVHGWIPFTCDPSDA---GSEFDVAALHAEMAANPET 54
M+Y++A + +D EI HP +GWIPFT P D+ G E +A + +P+T
Sbjct: 2 MEYRNAAFNQFGTVDLEINHPEYGWIPFTASPDDSEELGREIFASASPIAASYDPKT 58
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.316 0.133 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 78,172,583
Number of Sequences: 891141
Number of extensions: 2869789
Number of successful extensions: 7223
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7222
Number of HSP's gapped (non-prelim): 1
length of query: 136
length of database: 279,076,897
effective HSP length: 112
effective length of query: 24
effective length of database: 179,269,105
effective search space: 4302458520
effective search space used: 4302458520
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf9 [1887-2085]
(197 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064744.1| (NC_002519) gp6 [Roseophage SIO1] >gi|9944... 362 1e-099
ref|NP_570387.1| (NC_003390) unknown [Cyanophage P60] >gi|1... 38 0.049
>ref|NP_064744.1| (NC_002519) gp6 [Roseophage SIO1]
gb|AAG02589.1|AF189021_7 (AF189021) Roseophage SIO1 complete genome
Length = 249
Score = 362 bits (929), Expect = 1e-099
Identities = 170/197 (86%), Positives = 170/197 (86%)
Query: 1 MLEVTSDGSMVYQRYTTYNSVHTVYVRTYYSGTWYNWAKQWDNLNDGSGSGLDADLLDGQ 60
MLEVTSDGSMVYQRYTTYNSVHTVYVRTYYSGTWYNWAKQWDNLNDGSGSGLDADLLDGQ
Sbjct: 53 MLEVTSDGSMVYQRYTTYNSVHTVYVRTYYSGTWYNWAKQWDNLNDGSGSGLDADLLDGQ 112
Query: 61 QGSYYYPASNPNGYTAYTNSDVDTHLNTGTATTDQVLSWNGTDYDWIDGGGAPPTDLHAV 120
QGSYYYPASNPNGYTAYTNSDVDTHLNTGTATTDQVLSWNGTDYDWIDGGGAPPTDLHAV
Sbjct: 113 QGSYYYPASNPNGYTAYTNSDVDTHLNTGTATTDQVLSWNGTDYDWIDGGGAPPTDLHAV 172
Query: 121 GTYTVACTGVNVAQGGTTAGSNLRYRNIYALDRFXXXXXXXXXXXXXGTWRNVGGPVXXX 180
GTYTVACTGVNVAQGGTTAGSNLRYRNIYALDRF GTWRNVGGPV
Sbjct: 173 GTYTVACTGVNVAQGGTTAGSNLRYRNIYALDRFTVLTTTSSTVTLTGTWRNVGGPVQYV 232
Query: 181 XXXXXXXXXXXLWVRIS 197
LWVRIS
Sbjct: 233 SGRGGGQRGVSLWVRIS 249
>ref|NP_570387.1| (NC_003390) unknown [Cyanophage P60]
gb|AAL73325.1| (AF338467) unknown [Cyanophage P60]
Length = 126
Score = 38.1 bits (87), Expect = 0.049
Identities = 23/59 (38%), Positives = 32/59 (53%), Gaps = 5/59 (8%)
Query: 46 DGSGSGLDADLLDGQQGSYYYPASNPNGYTAYTN--SDVDTHLNTGTATTDQVLSWNGT 102
DG+GSGLDAD LDG+ GS+Y +N SD++ +N Q+LS + T
Sbjct: 61 DGTGSGLDADKLDGKNGSHYLNYNNATNKPDIRGNLSDLNDDIN---VAVQQILSAHST 116
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.314 0.133 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 117,729,832
Number of Sequences: 891141
Number of extensions: 5421487
Number of successful extensions: 7949
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 7948
Number of HSP's gapped (non-prelim): 2
length of query: 197
length of database: 279,076,897
effective HSP length: 114
effective length of query: 83
effective length of database: 177,486,823
effective search space: 14731406309
effective search space used: 14731406309
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 87 (38.1 bits)
Query= orf10 [2116-2657]
(540 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064745.1| (NC_002519) gp7 [Roseophage SIO1] >gi|9944... 531 e-150
ref|NP_570387.1| (NC_003390) unknown [Cyanophage P60] >gi|1... 97 3e-019
ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED... 44 0.004
gb|AAC04580.1| (AF048699) surface layer protein [Campylobac... 41 0.036
>ref|NP_064745.1| (NC_002519) gp7 [Roseophage SIO1]
gb|AAG02590.1|AF189021_8 (AF189021) Roseophage SIO1 complete genome
Length = 540
Score = 531 bits (1368), Expect = e-150
Identities = 283/540 (52%), Positives = 283/540 (52%)
Query: 1 MAKKPTVTTLQSGFNSTEVLNENFENIRDAFDNTLSLDGSTPNAMQADLDLNGNNLIGAT 60
MAKKPTVTTLQSGFNSTEVLNENFENIRDAFDNTLSLDGSTPNAMQADLDLNGNNLIGAT
Sbjct: 1 MAKKPTVTTLQSGFNSTEVLNENFENIRDAFDNTLSLDGSTPNAMQADLDLNGNNLIGAT 60
Query: 61 GLLINGTDYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
GLLINGTDY
Sbjct: 61 GLLINGTDYLADVEAAKAAALVAQAAAELAENNAETAEVNAEASETAAGLSATAAATSAT 120
Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
Sbjct: 121 NAGASETAASASATASATSATNSATSASQAATSASAAAVSEGNAATSETNAANSATSASG 180
Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
Sbjct: 181 SASTATTQASAASVSATNAATSASNAATSATNAAASQAAAATSETNAAASESTVTTSATN 240
Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300
Sbjct: 241 AATSEANAATSASTATTQATNAATSASTASTSATNAATSETNAASSASSAASSATSAQAS 300
Query: 301 KXXXXXXXXXXXXRYLGQKVSDPTLDNDGNALVAGALYFNTTDGIMKVYDGSVWLAAYAS 360
K RYLGQKVSDPTLDNDGNALVAGALYFNTTDGIMKVYDGSVWLAAYAS
Sbjct: 301 KDAALAALDSFDDRYLGQKVSDPTLDNDGNALVAGALYFNTTDGIMKVYDGSVWLAAYAS 360
Query: 361 LSGALIATNNLSDVLDVTASRTNLGLGXXXXXXXXXXXXXXQGALAVSAVQPNDSPSFGS 420
LSGALIATNNLSDVLDVTASRTNLGLG QGALAVSAVQPNDSPSFGS
Sbjct: 361 LSGALIATNNLSDVLDVTASRTNLGLGTAATTASTDYATAAQGALAVSAVQPNDSPSFGS 420
Query: 421 VTVTGTVDGRNVAADGSKLDGIEAGATADQTASEILTSIKTVDGSGSGLDADLLDGIHAS 480
VTVTGTVDGRNVAADGSKLDGIEAGATADQTASEILTSIKTVDGSGSGLDADLLDGIHAS
Sbjct: 421 VTVTGTVDGRNVAADGSKLDGIEAGATADQTASEILTSIKTVDGSGSGLDADLLDGIHAS 480
Query: 481 SFLQGNQTITLSGDVSGSGTTSILVTVADDSHNHIIANVDGLQTALDGKWSKGADIGGGC 540
SFLQGNQTITLSGDVSGSGTTSILVTVADDSHNHIIANVDGLQTALDGKWSKGADIGGGC
Sbjct: 481 SFLQGNQTITLSGDVSGSGTTSILVTVADDSHNHIIANVDGLQTALDGKWSKGADIGGGC 540
>ref|NP_570387.1| (NC_003390) unknown [Cyanophage P60]
gb|AAL73325.1| (AF338467) unknown [Cyanophage P60]
Length = 126
Score = 97.4 bits (241), Expect = 3e-019
Identities = 50/80 (62%), Positives = 62/80 (77%), Gaps = 1/80 (1%)
Query: 419 GSVTVTGTVDGRNVAADGSKLDGIEAGATADQTASEILTSIKTVDGSGSGLDADLLDGIH 478
G++TVTGTVDGR+V+ DG+KLDG+EA AT DQTASEILT + TVDG+GSGLDAD LDG +
Sbjct: 17 GNITVTGTVDGRDVSGDGTKLDGMEANATGDQTASEILTLLLTVDGTGSGLDADKLDGKN 76
Query: 479 ASSFLQGNQTITLSGDVSGS 498
S +L N T D+ G+
Sbjct: 77 GSHYLNYNNA-TNKPDIRGN 95
>ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14589.1| (AL646061) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 3501
Score = 43.9 bits (102), Expect = 0.004
Identities = 48/200 (24%), Positives = 78/200 (39%), Gaps = 13/200 (6%)
Query: 320 VSDPTLDNDGNALVAGALYFNTTDGIMKVYDGSVWLAAYASLSGALIATNNLSDVLDVTA 379
VS ++DN G +VAG + ++ G V A+++ A TN + A
Sbjct: 851 VSGGSVDNSGGTVVAGGTAVVQSGSTLRNAAGRVQAKGSAAVTAAGAITNTGGQIEADGA 910
Query: 380 SRTNLGLGXXXXXXXXXXXXXXQGALAVSAVQPNDSPSFGSVTVTGTVDGR-NVAADGSK 438
S T G GA V A ++ + G V GT+ G +V G
Sbjct: 911 SATLQVTGASLDNTNGRIANAGSGATTVGAASITNA-NTGGVAGAGTIGGNGDVTVSGQT 969
Query: 439 LDGIEAG---ATADQTASEILTSIKTVDGSGSGLDADLLDGIHASSFLQGNQTITLSGDV 495
L + G A D T L + ++V+ SG L A + ++ G + +G +
Sbjct: 970 LSNTQGGQIVAGHDLT----LATTRSVNNSGGTLSA----ANNVTANAAGAAVVNQAGSI 1021
Query: 496 SGSGTTSILVTVADDSHNHI 515
G+G S+ V D++ I
Sbjct: 1022 RGNGAVSLNVASLDNTSGKI 1041
>gb|AAC04580.1| (AF048699) surface layer protein [Campylobacter fetus]
Length = 1112
Score = 40.8 bits (94), Expect = 0.036
Identities = 55/211 (26%), Positives = 85/211 (40%), Gaps = 18/211 (8%)
Query: 328 DGNALVAGALYFNTTDGIMKVYDGSVWLAAYASLSGALIATNNLSDVLDVTASRTNLGLG 387
+ N V+G TT V D ++ +L +T N S ++ LG
Sbjct: 669 EANINVSGEFKLGTTTAKGDV-DFNIENVGALTLGAITSSTGNASVIISSATGNVTLGAV 727
Query: 388 XXXXXXXXXXXXXXQGALAVSAVQPNDSPSFGSVTVTGTVDGRNVAADGSKLDGIEAGAT 447
G + + A+ D S V GT++ +A G+K++ T
Sbjct: 728 SATQGNLTLNAGNTLGNITIGALA-GDIVSVDLGGVLGTIN----SASGNKVEITSNEVT 782
Query: 448 ADQTASEILTSIK--TVDGSGSGLDADLLDGIHASSFLQ-----GNQTITLSGDVSGSGT 500
SEI ++ T G+ L+A ++ G A L QTIT SGD+SG GT
Sbjct: 783 --YVGSEISKNVVEITAAAGGTDLNAQVIGGAAADDALTIIGKGDTQTITASGDLSG-GT 839
Query: 501 TSILVTVADDSHNHIIANVDGLQ--TALDGK 529
++ +T A + I+ V GL TA+D K
Sbjct: 840 LTLTLTDATKLSSLDISGVKGLSAATAIDLK 870
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.310 0.130 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 153,413,845
Number of Sequences: 891141
Number of extensions: 5614774
Number of successful extensions: 13398
Number of sequences better than 5.0e-02: 4
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 13388
Number of HSP's gapped (non-prelim): 9
length of query: 540
length of database: 279,076,897
effective HSP length: 125
effective length of query: 415
effective length of database: 167,684,272
effective search space: 69588972880
effective search space used: 69588972880
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 93 (40.4 bits)
Query= orf11 [2657-2821]
(163 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.314 0.131 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 92,640,067
Number of Sequences: 891141
Number of extensions: 3709785
Number of successful extensions: 6810
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6810
Number of HSP's gapped (non-prelim): 0
length of query: 163
length of database: 279,076,897
effective HSP length: 111
effective length of query: 52
effective length of database: 180,160,246
effective search space: 9368332792
effective search space used: 9368332792
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 86 (37.7 bits)
Query= orf12 [3111-3163]
(51 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.313 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 25,762,515
Number of Sequences: 891141
Number of extensions: 659615
Number of successful extensions: 1246
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1246
Number of HSP's gapped (non-prelim): 0
length of query: 51
length of database: 279,076,897
effective HSP length: 27
effective length of query: 24
effective length of database: 255,016,090
effective search space: 6120386160
effective search space used: 6120386160
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf13 [3162-3359]
(196 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064747.1| (NC_002519) gp9 [Roseophage SIO1] >gi|9944... 375 e-103
>ref|NP_064747.1| (NC_002519) gp9 [Roseophage SIO1]
gb|AAG02592.1|AF189021_10 (AF189021) Roseophage SIO1 complete genome
Length = 196
Score = 375 bits (963), Expect = e-103
Identities = 183/196 (93%), Positives = 183/196 (93%)
Query: 1 MTNHVRNILKLYRQATQEDTINGVEWYARAERMAKAIASDAGLPLPTVIGVMAALSPNNR 60
MTNHVRNILKLYRQATQEDTINGVEWYARAERMAKAIASDAGLPLPTVIGVMAALSPNNR
Sbjct: 1 MTNHVRNILKLYRQATQEDTINGVEWYARAERMAKAIASDAGLPLPTVIGVMAALSPNNR 60
Query: 61 WERNCRDAATMCKAWQNGDSMDSFKVSCYNTMKAKAWAILDLGLTDDEDILSHLNGQKIR 120
WERNCRDAATMCKAWQNGDSMDSFKVSCYNTMKAKAWAILDLGLTDDEDILSHLNGQKIR
Sbjct: 61 WERNCRDAATMCKAWQNGDSMDSFKVSCYNTMKAKAWAILDLGLTDDEDILSHLNGQKIR 120
Query: 121 SFYSNIRGLDEVTIDGHALNIARGKREGLTSDKTNMGKREYRELQXXXXXXXXXXXXXPH 180
SFYSNIRGLDEVTIDGHALNIARGKREGLTSDKTNMGKREYRELQ PH
Sbjct: 121 SFYSNIRGLDEVTIDGHALNIARGKREGLTSDKTNMGKREYRELQAAYVRAAKRVKVKPH 180
Query: 181 VLQAITWTTWKRVHNI 196
VLQAITWTTWKRVHNI
Sbjct: 181 VLQAITWTTWKRVHNI 196
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.318 0.132 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 101,033,487
Number of Sequences: 891141
Number of extensions: 3729891
Number of successful extensions: 7084
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7083
Number of HSP's gapped (non-prelim): 1
length of query: 196
length of database: 279,076,897
effective HSP length: 114
effective length of query: 82
effective length of database: 177,486,823
effective search space: 14553919486
effective search space used: 14553919486
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 87 (38.1 bits)
Query= orf14 [3366-3509]
(142 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064748.1| (NC_002519) gp10 [Roseophage SIO1] >gi|994... 277 1e-074
>ref|NP_064748.1| (NC_002519) gp10 [Roseophage SIO1]
gb|AAG02593.1|AF189021_11 (AF189021) Roseophage SIO1 complete genome
Length = 142
Score = 277 bits (709), Expect = 1e-074
Identities = 142/142 (100%), Positives = 142/142 (100%)
Query: 1 MTNTQHKNILKHLKTAKGLTVREALIEYSISSLTKRVHELRGLGYDIESVRKKHPCNRSK 60
MTNTQHKNILKHLKTAKGLTVREALIEYSISSLTKRVHELRGLGYDIESVRKKHPCNRSK
Sbjct: 1 MTNTQHKNILKHLKTAKGLTVREALIEYSISSLTKRVHELRGLGYDIESVRKKHPCNRSK 60
Query: 61 IYTLLSTRGARAMSMCGEIENTENAIKSLRAKEFNIQFNSIGLTDKERLTIVNTIREELL 120
IYTLLSTRGARAMSMCGEIENTENAIKSLRAKEFNIQFNSIGLTDKERLTIVNTIREELL
Sbjct: 61 IYTLLSTRGARAMSMCGEIENTENAIKSLRAKEFNIQFNSIGLTDKERLTIVNTIREELL 120
Query: 121 EETVRLNKLLELANKAEGSHRA 142
EETVRLNKLLELANKAEGSHRA
Sbjct: 121 EETVRLNKLLELANKAEGSHRA 142
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.315 0.130 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 61,874,848
Number of Sequences: 891141
Number of extensions: 2011039
Number of successful extensions: 6293
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6292
Number of HSP's gapped (non-prelim): 1
length of query: 142
length of database: 279,076,897
effective HSP length: 118
effective length of query: 24
effective length of database: 173,922,259
effective search space: 4174134216
effective search space used: 4174134216
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf15 [3610-3660]
(49 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.312 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 28,642,583
Number of Sequences: 891141
Number of extensions: 851263
Number of successful extensions: 1177
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1177
Number of HSP's gapped (non-prelim): 0
length of query: 49
length of database: 279,076,897
effective HSP length: 25
effective length of query: 24
effective length of database: 256,798,372
effective search space: 6163160928
effective search space used: 6163160928
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf16 [3660-3883]
(222 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064749.1| (NC_002519) RP thymidylate synthase [Roseo... 461 e-129
ref|NP_228259.1| (NC_000853) thy1 protein [Thermotoga marit... 112 3e-024
sp|P15808|THY1_DICDI Thy1 protein >gi|279686|pir||YXDOTC th... 104 7e-022
ref|NP_360042.1| (NC_003103) thy1 protein [Rickettsia conor... 103 1e-021
ref|NP_219444.1| (NC_000919) thymidylate synthase-complemen... 102 3e-021
ref|NP_045749.1| (NC_001857) thymidylate synthase-complemen... 98 7e-020
ref|NP_570363.1| (NC_003390) unknown [Cyanophage P60] >gi|1... 87 1e-016
ref|NP_213443.1| (NC_000918) thymidylate synthase complemen... 73 2e-012
pir||T35843 hypothetical protein SC9A10.07 SC9A10.07 - Stre... 71 7e-012
ref|NP_049030.1| (NC_000852) similar to Synechocystis ORF s... 65 4e-010
ref|NP_280185.1| (NC_002607) Vng1325c [Halobacterium sp. NR... 45 7e-004
sp|Q9HQ52|YD25_HALN1 Hypothetical protein Vng1325c 45 7e-004
ref|NP_578269.1| (NC_003413) hypothetical protein [Pyrococc... 42 0.003
sp|Q9C4Y9|THY1_HALSA Thymidylate synthase complementing pro... 41 0.010
ref|NP_542556.1| (NC_003345) putative phage C31 protein gp1... 40 0.021
>ref|NP_064749.1| (NC_002519) RP thymidylate synthase [Roseophage SIO1]
gb|AAG02594.1|AF189021_12 (AF189021) Roseophage SIO1 complete genome
Length = 222
Score = 461 bits (1185), Expect = e-129
Identities = 222/222 (100%), Positives = 222/222 (100%)
Query: 1 MTQIEATYIDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHK 60
MTQIEATYIDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHK
Sbjct: 1 MTQIEATYIDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHK 60
Query: 61 HISPFGHAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKK 120
HISPFGHAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKK
Sbjct: 61 HISPFGHAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKK 120
Query: 121 QGSDGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFAD 180
QGSDGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFAD
Sbjct: 121 QGSDGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFAD 180
Query: 181 MCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALMEGEKQ 222
MCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALMEGEKQ
Sbjct: 181 MCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALMEGEKQ 222
>ref|NP_228259.1| (NC_000853) thy1 protein [Thermotoga maritima]
sp|Q9WYT0|Y449_THEMA Hypothetical protein TM0449
pir||B72375 thy1 protein - Thermotoga maritima (strain MSB8)
gb|AAD35532.1|AE001723_2 (AE001723) thy1 protein [Thermotoga maritima]
Length = 220
Score = 112 bits (280), Expect = 3e-024
Identities = 73/215 (33%), Positives = 110/215 (50%), Gaps = 26/215 (12%)
Query: 9 IDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSG--RDTKLIKYLAKHKHISPFG 66
+D MG+DLS V AARVSF D GL RD LI+YL KH H +PF
Sbjct: 14 VDVMGNDLSAVRAARVSF---------------DMGLKDEERDRHLIEYLMKHGHETPFE 58
Query: 67 HAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKKQGSDGV 126
H +FHVKAPIFVARQ +H+ +NE+S RY EFY P S ++ +G
Sbjct: 59 HIVFTFHVKAPIFVARQWFRHRIASYNELSGRYSKLSYEFYIP------SPERLEGYKTT 112
Query: 127 VNPEYNPQYLD---NKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCR 183
+ PE + + +K YL+ ++ G+ E AR++LP + T ++W+ + + +
Sbjct: 113 IPPERVTEKISEIVDKAYRTYLELIESGVPREVARIVLPLNLYTRFFWTVNARSLMNFLN 172
Query: 184 LRCKEDTQYESRVVADQISEKMADLYPVSWAALME 218
LR Q+E + A I+ + P ++ A ++
Sbjct: 173 LRADSHAQWEIQQYALAIARIFKEKCPWTFEAFLK 207
>sp|P15808|THY1_DICDI Thy1 protein
pir||YXDOTC thymidylate synthase-complementing protein - slime mold
(Dictyostelium discoideum)
gb|AAA33257.1| (M27713) Thy1 protein [Dictyostelium discoideum]
Length = 260
Score = 104 bits (259), Expect = 7e-022
Identities = 70/206 (33%), Positives = 101/206 (48%), Gaps = 20/206 (9%)
Query: 14 SDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHAFASFH 73
+D S+ AARVS+G ++ V DKGL I+YL +H+H SPF FH
Sbjct: 59 ADFSICQAARVSYGAGTKKVT------EDKGL-------IRYLYRHQHTSPFEMVEFKFH 105
Query: 74 VKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDV--WRGRSADKKQGSDGVVNPEY 131
P+F+ARQ ++H+ NE S RY +FY P + R +S +QG + + P+
Sbjct: 106 CVMPVFIARQWIRHRTANVNEYSARYSVLPDKFYHPSIEEVRKQSTSNRQGGEEALEPKT 165
Query: 132 NPQYLDNKIKF-----AYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCRLRC 186
++LD K Y + L+ G+S E R+ LP S TEWYW L RLR
Sbjct: 166 AQEFLDYLDKVEENYKTYNELLEKGLSRELGRIGLPVSIYTEWYWKIDLHNLFHFLRLRM 225
Query: 187 KEDTQYESRVVADQISEKMADLYPVS 212
+Q E R A+ I + + PV+
Sbjct: 226 DSHSQKEIRDYANTIFALIRPIVPVA 251
>ref|NP_360042.1| (NC_003103) thy1 protein [Rickettsia conorii]
gb|AAL02943.1| (AE008605) thy1 protein [Rickettsia conorii]
Length = 294
Score = 103 bits (257), Expect = 1e-021
Identities = 77/231 (33%), Positives = 109/231 (46%), Gaps = 34/231 (14%)
Query: 9 IDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHA 68
ID+MG D ++V AARVS+GK ++ + +DKGL I YL +H H +PF
Sbjct: 32 IDYMGDDSAIVQAARVSYGKGTKQL------NQDKGL-------INYLLRHYHTTPFEMC 78
Query: 69 FASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWR--GRSADKKQGSDGV 126
FH+K PIF+ARQ ++H+ NE S RY EFY P+ +SA KQ +G
Sbjct: 79 DIKFHIKLPIFIARQWIRHRTASVNEYSARYSILGNEFYLPEPANIASQSAVNKQCREGD 138
Query: 127 VNPEYNPQ----YLDNKIKFAYLQALDI---------------GISPEQARMLLPQSTMT 167
P+ + L+ + Y ++ GI+ E ARM L + T
Sbjct: 139 SLPKEVAEKVLAILEEDARQCYGHYKELMNADEEGNIIDENATGIARELARMNLTLNYYT 198
Query: 168 EWYWSGSLDAFADMCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALME 218
EWYW +L RLR QYE RV A+++ E + P + A E
Sbjct: 199 EWYWKINLHNLLHFLRLRADPHAQYEIRVYAEKMLEIVKAWVPFVYEAFEE 249
>ref|NP_219444.1| (NC_000919) thymidylate synthase-complementing protein (thy1)
[Treponema pallidum]
sp|O83971|YA07_TREPA Hypothetical protein TP1007
pir||A71255 probable thymidylate synthase-complementing protein (thy1) -
syphilis spirochete
gb|AAC65955.1| (AE001268) thymidylate synthase-complementing protein (thy1)
[Treponema pallidum]
Length = 305
Score = 102 bits (254), Expect = 3e-021
Identities = 77/223 (34%), Positives = 105/223 (46%), Gaps = 24/223 (10%)
Query: 9 IDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHA 68
ID++G D +V AARVS+G G RD D LI +L ++KH SPF
Sbjct: 55 IDYLGGDARIVQAARVSYG-------AGTRTARD------DAALIDFLLRNKHTSPFEQV 101
Query: 69 FASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVW--RGRSADKKQG--SD 124
+FHV+APIFVARQ ++H+ R +E+S RY + Y P +S KQG S+
Sbjct: 102 VLTFHVRAPIFVARQWMRHRTARISEVSSRYSLLSHDCYVPQETSVAVQSTRNKQGRASE 161
Query: 125 GVVNPEYNPQY------LDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAF 178
G ++PE + AY + I+ E AR+ +P S TEWYW L
Sbjct: 162 G-ISPEQQQEVRAAFEAQQKAACAAYDALIQKNIARELARINVPLSLYTEWYWQIDLHNL 220
Query: 179 ADMCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALMEGEK 221
RLR Q E R A+ I E + P + A+ EK
Sbjct: 221 FHFLRLRASAHAQAEIRAYAEVIIEITRAVAPCATASFENHEK 263
>ref|NP_045749.1| (NC_001857) thymidylate synthase-complementing protein (thy1)
[Borrelia burgdorferi]
sp|O50965|YP76_BORBU Hypothetical protein BBA76
pir||D70216 thymidylate synthase-complementing protein (thy1) homolog - Lyme
disease spirochete plasmid A/lp54
gb|AAC66242.1| (AE000790) thymidylate synthase-complementing protein (thy1)
[Borrelia burgdorferi]
Length = 265
Score = 97.8 bits (242), Expect = 7e-020
Identities = 71/222 (31%), Positives = 105/222 (46%), Gaps = 36/222 (16%)
Query: 9 IDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHA 68
ID MG D +V AAR+S+ R++ + +D +LI YL ++ H SP
Sbjct: 16 IDFMGDDRRIVKAARISY--------------REESVKRKDAELIDYLIRNGHTSPLEQV 61
Query: 69 FASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKK-QGSDGVV 127
+FHVKAPIFVARQ ++H+ R NE+S Y EFY P D K Q S
Sbjct: 62 VFTFHVKAPIFVARQWMRHRTARINEVSGCYSLAREEFYVP-----LEEDLKCQTSSNSS 116
Query: 128 NPEYNP-QYLDNKIKF-------AYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFA 179
E+ + L +KIK Y ++ I E +R++LP S TEWYW L+
Sbjct: 117 EKEFKSLEKLSDKIKHHQKHSYELYQDMINANIPKELSRIVLPLSLYTEWYWQIDLNNLF 176
Query: 180 DMCRLR--------CKEDTQYESRVVADQISEKMADLYPVSW 213
+LR KE++ E R A + + ++ P+++
Sbjct: 177 HFIKLRLALDSPKEIKENSPKEMREYAKALISIVREIVPIAF 218
>ref|NP_570363.1| (NC_003390) unknown [Cyanophage P60]
gb|AAL73301.1| (AF338467) unknown [Cyanophage P60]
Length = 235
Score = 87.0 bits (214), Expect = 1e-016
Identities = 60/210 (28%), Positives = 108/210 (50%), Gaps = 7/210 (3%)
Query: 15 DLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHAFASFHV 74
++S V+ V+ + Y + +D L +LI+YL KH H SPF A +
Sbjct: 25 NMSKVSLISVTPDAEELIAYLARVSNKDAKLGDPADRLIRYLIKHGHWSPFEMAHMVVEI 84
Query: 75 KAPIFVARQLVKHKFLRWNEISRRY-VDDEPEFYTPDVWRGRSADKKQGSDGV---VNPE 130
+ +A Q+++H+ + E S+RY V D EF P++ R + +++ D + + E
Sbjct: 85 ETTRAIAPQILRHRSFSFQEFSQRYAVADLGEFEVPELRRQDTKNRQNSIDDLPAEIVAE 144
Query: 131 YNPQYLD--NKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCRLRCKE 188
+ + + + Y + L G++ E AR +LP +T T+ Y SG++ ++ C+LRC
Sbjct: 145 FQSEIDEVFETTRVLYQRMLSYGVAKECARGILPLNTPTKLYMSGTIRSWIHYCQLRCGN 204
Query: 189 DTQYESRVVADQISEKM-ADLYPVSWAALM 217
TQ E R++A++ + A+L V+ AA M
Sbjct: 205 GTQEEHRLIAEECKNLLYANLPNVAKAAFM 234
>ref|NP_213443.1| (NC_000918) thymidylate synthase complementing protein [Aquifex
aeolicus]
sp|O66883|Y640_AQUAE Hypothetical protein AQ_640
pir||G70356 thymidylate synthase complementing protein - Aquifex aeolicus
gb|AAC06847.1| (AE000700) thymidylate synthase complementing protein [Aquifex
aeolicus]
Length = 317
Score = 72.8 bits (177), Expect = 2e-012
Identities = 49/141 (34%), Positives = 76/141 (53%), Gaps = 6/141 (4%)
Query: 69 FASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDE-PEFYTPDVWRGRSADKKQGS--DG 125
F +F V+ P+FVARQ ++H+F +NE+S+RYV E EFY P R ++ KQ S +
Sbjct: 164 FYTFVVECPLFVARQWMRHRFGSYNEVSKRYVGKEFLEFYLPKYIRKQAEKNKQASVDEP 223
Query: 126 VVNPEYNPQYLDNKIKFA---YLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMC 182
+ E + ++N I + Y + ++ G + E AR +LPQ T +YW+ + +
Sbjct: 224 ISESEVFIKKIENLISKSVKLYEEIIEKGGAKELARGVLPQFMKTRFYWTVPRISLDNFI 283
Query: 183 RLRCKEDTQYESRVVADQISE 203
LR E Q E R A+ I E
Sbjct: 284 TLRTHEGAQKEIREFAEAIKE 304
Score = 47.4 bits (111), Expect = 1e-004
Identities = 28/64 (43%), Positives = 34/64 (52%), Gaps = 12/64 (18%)
Query: 12 MGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKHISPFGHAFAS 71
MGSD +V ARVSF K S Y + RD +LI+YL KH+H SPF H +
Sbjct: 7 MGSDQRIVRCARVSFAKDS---YVDEK---------RDKRLIRYLFKHRHASPFEHNIIA 54
Query: 72 FHVK 75
F K
Sbjct: 55 FEWK 58
>pir||T35843 hypothetical protein SC9A10.07 SC9A10.07 - Streptomyces coelicolor
emb|CAA20294.1| (AL031260) hypothetical protein SC9A10.07 [Streptomyces coelicolor
A3(2)]
Length = 246
Score = 71.2 bits (173), Expect = 7e-012
Identities = 64/230 (27%), Positives = 102/230 (43%), Gaps = 23/230 (10%)
Query: 2 TQIEATYIDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHKH 61
+ I + +D V+ AARVS + + R KGL I YL + +H
Sbjct: 15 SDITVELVKSAATDSDVLFAARVSTAGEQSLDELKKDPERSKGL-------INYLMRDRH 67
Query: 62 ISPFGHAFASFHVKAPIFVARQLVKHKF-LRWNEISRRYVDDEPEFYTPDVWRGRSADKK 120
SPF H +F V APIFV R+ ++H+ +NE S RY + +P FY PD R +
Sbjct: 68 GSPFEHNSMTFFVSAPIFVFREFMRHRVGWSYNEESGRYRELQPVFYAPDASRKLVQQGR 127
Query: 121 QGSDGVV--NPEYNP---QYLDNKIKFAYL---QALDIGISPEQARMLLPQSTMTEWYWS 172
G V PE + +++ + AY Q L G++ E AR +LP + Y +
Sbjct: 128 PGKYVFVEGTPEQHELVGSAMEDSYRQAYATYQQMLAAGVAREVARAVLPVGLYSSMYAT 187
Query: 173 GSLDAFADMCRLRCKED-------TQYESRVVADQISEKMADLYPVSWAA 215
+ + LR + + Q E + +++ + A L P++ AA
Sbjct: 188 CNARSLMHFLGLRTQHELAKVPSFPQREIEMAGEKMEAEWARLMPLTHAA 237
>ref|NP_049030.1| (NC_000852) similar to Synechocystis ORF s111635, corresponds to
GenBank Accession Number D90903 [Paramecium bursaria
Chlorella virus 1]
sp|O41156|A674_CHVP1 Putative protein A674R
pir||T18176 hypothetical protein A674R - Chlorella virus PBCV-1
gb|AAC96983.1| (U42580) similar to Synechocystis ORF s111635, corresponds to
GenBank Accession Number D90903 [Paramecium bursaria
Chlorella virus 1]
Length = 216
Score = 65.5 bits (158), Expect = 4e-010
Identities = 50/167 (29%), Positives = 77/167 (45%), Gaps = 11/167 (6%)
Query: 52 LIKYLAKHKHISPFGHAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDV 111
L+KY +HKH S F AF + +K +A Q+++H+ + E S+RY TP
Sbjct: 44 LLKYCIRHKHWSIFETAFMTLELKTSRGIAAQVLRHRSFHFQEFSQRYAS---VMETPPP 100
Query: 112 WRGRSADKKQGSDGV-VNPEYNPQYLDNKIKFAYLQ-------ALDIGISPEQARMLLPQ 163
+ R D K + + PE + + + + Y Q AL+ GI+ E AR +LP
Sbjct: 101 HQARFQDHKNRQNSLDTVPEDDQTWWATEQEKLYAQSMELYNKALEKGIAKECARFILPL 160
Query: 164 STMTEWYWSGSLDAFADMCRLRCKEDTQYESRVVADQISEKMADLYP 210
ST T Y SG++ + LR TQ E +A+ E +P
Sbjct: 161 STPTTIYMSGTIRDWIHYIELRTSNGTQREHIDLANACKEIFIKEFP 207
>ref|NP_280185.1| (NC_002607) Vng1325c [Halobacterium sp. NRC-1]
gb|AAG19665.1| (AE005054) Vng1325c [Halobacterium sp. NRC-1]
Length = 266
Score = 44.7 bits (104), Expect = 7e-004
Identities = 53/207 (25%), Positives = 81/207 (38%), Gaps = 30/207 (14%)
Query: 30 SEWVYCGQSDGRDKGLSGRDT-----KLIKYLAKHKHISPFGHAFASFHVKAPIFVAR-- 82
S+WV D + G T LI L H PF H A+F ++ V+R
Sbjct: 45 SDWVGDTPLDTAMASVDGDTTDEKLSNLIAQLLTRGHYGPFEHPSATFAIEG---VSRSC 101
Query: 83 --QLVKHKFLRWNEISRRYV--DD--------------EPEFYTPDVWRGRSADKKQ-GS 123
QL +H+ ++ S RYV DD P PD W GR+ D
Sbjct: 102 MAQLTRHRHASFDVQSMRYVAFDDVDPAAVAEGELVVTPPSATDPD-WVGRNQDAGDIDE 160
Query: 124 DGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCR 183
+ + E Q + Y + L +G+ PE AR +LP T + + + +
Sbjct: 161 ETMAEREAVFQASVRRAVEDYQELLGLGMPPEDARFVLPIGTEVNVVITLNPRSLMHVAD 220
Query: 184 LRCKEDTQYESRVVADQISEKMADLYP 210
+R D Q+E R + +Q+ + A P
Sbjct: 221 MRAAADAQWEIRELTEQLLDAAAQWCP 247
>sp|Q9HQ52|YD25_HALN1 Hypothetical protein Vng1325c
Length = 247
Score = 44.7 bits (104), Expect = 7e-004
Identities = 53/207 (25%), Positives = 81/207 (38%), Gaps = 30/207 (14%)
Query: 30 SEWVYCGQSDGRDKGLSGRDT-----KLIKYLAKHKHISPFGHAFASFHVKAPIFVAR-- 82
S+WV D + G T LI L H PF H A+F ++ V+R
Sbjct: 26 SDWVGDTPLDTAMASVDGDTTDEKLSNLIAQLLTRGHYGPFEHPSATFAIEG---VSRSC 82
Query: 83 --QLVKHKFLRWNEISRRYV--DD--------------EPEFYTPDVWRGRSADKKQ-GS 123
QL +H+ ++ S RYV DD P PD W GR+ D
Sbjct: 83 MAQLTRHRHASFDVQSMRYVAFDDVDPAAVAEGELVVTPPSATDPD-WVGRNQDAGDIDE 141
Query: 124 DGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCR 183
+ + E Q + Y + L +G+ PE AR +LP T + + + +
Sbjct: 142 ETMAEREAVFQASVRRAVEDYQELLGLGMPPEDARFVLPIGTEVNVVITLNPRSLMHVAD 201
Query: 184 LRCKEDTQYESRVVADQISEKMADLYP 210
+R D Q+E R + +Q+ + A P
Sbjct: 202 MRAAADAQWEIRELTEQLLDAAAQWCP 228
>ref|NP_578269.1| (NC_003413) hypothetical protein [Pyrococcus furiosus DSM 3638]
gb|AAL80664.1| (AE010177) hypothetical protein [Pyrococcus furiosus DSM 3638]
Length = 244
Score = 42.4 bits (98), Expect = 0.003
Identities = 45/167 (26%), Positives = 77/167 (45%), Gaps = 23/167 (13%)
Query: 61 HISPFGHAFASFHVKAPIFVA-RQLVKHKFLRWNEISRRYV-----DDEPEFYTPD-VWR 113
H S HA +F ++ V QLV+H+ + + S+RY+ + E F P+ + +
Sbjct: 54 HESILEHATFTFSIEGCSRVCTHQLVRHRIASYTQQSQRYIVLDEENVEETFVIPESIKK 113
Query: 114 GRSADKKQGSDGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSG 173
R +K + + I Y ++++ G+ E AR +LPQ+ T+ +
Sbjct: 114 DRELYEKW-----------KKVMAETISL-YKESINRGVHQEDARFILPQAVKTKIIVTM 161
Query: 174 SLDAFADMCRLRCKEDTQYESRVVADQISEKMA---DLYP-VSWAAL 216
+L LR E Q+E R VA ++ E+MA D+ P + WA L
Sbjct: 162 NLRELKHFFGLRLCERAQWEIREVAWKMLEEMAKRDDIRPIIKWAKL 208
>sp|Q9C4Y9|THY1_HALSA Thymidylate synthase complementing protein
gb|AAK00648.1|AF227235_2 (AF227235) thymidylate synthase complementing protein
[Halobacterium salinarum]
Length = 247
Score = 40.8 bits (94), Expect = 0.010
Identities = 50/206 (24%), Positives = 79/206 (38%), Gaps = 28/206 (13%)
Query: 30 SEWVYCGQSDGRDKGLSGRDT-----KLIKYLAKHKHISPFGHAFASFHVKAPIFVAR-- 82
S+WV D + G T LI L H PF H A+F ++ V+R
Sbjct: 26 SDWVGDTPLDTAMASVDGDTTDEKLSNLIAQLLTRGHYGPFEHPSATFAIEG---VSRSC 82
Query: 83 --QLVKHKFLRWNEISRRYV--DD-------------EPEFYTPDVWRGRSADKKQ-GSD 124
QL +H+ ++ S RYV DD P T W GR+ D +
Sbjct: 83 MAQLTRHRHASFDVQSMRYVAFDDVDPAAVAEGELVVTPPSATDPDWVGRNQDAGDIDEE 142
Query: 125 GVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFADMCRL 184
+ + Q + Y + L +G+ P AR +LP T + + + + +
Sbjct: 143 TMAERQAVFQASVRRAVEDYQELLGLGMPPVDARFVLPIGTEVNVVITLNPRSLMHVADM 202
Query: 185 RCKEDTQYESRVVADQISEKMADLYP 210
R D Q+E R + +Q+ + A P
Sbjct: 203 RAAADAQWEIRELTEQLLDAAAQWCP 228
>ref|NP_542556.1| (NC_003345) putative phage C31 protein gp16-like protein [Halovirus
HF2]
gb|AAL54978.1| (AF222060) putative phage C31 protein gp16-like protein [Halovirus
HF2]
Length = 234
Score = 39.7 bits (91), Expect = 0.021
Identities = 41/180 (22%), Positives = 76/180 (41%), Gaps = 18/180 (10%)
Query: 52 LIKYLAKHKHISPFGHAFASFHVKAPIFVAR-QLVKHKFLRWNEISRRYVDDEPEFYTPD 110
L+ L + H PF H A F V+ A Q+ +H+ + ++ S+RY D F +
Sbjct: 50 LMGELLRRGHFGPFEHIQAFFAVEGLSRSAMAQVTRHRHMSFDVQSQRYCD----FSDKN 105
Query: 111 VWRGRSADKKQGSDGVVNPEYNPQYLD------------NKIKFAYLQALDIGISPEQAR 158
+ K ++ VV +Y+ D N ++ Y + ++ G++ E AR
Sbjct: 106 IVIPPGDKKNPAAEDVVLGQYDGPAEDFTGADALEGHFVNSVEL-YEKLIEQGMAKEDAR 164
Query: 159 MLLPQSTMTEWYWSGSLDAFADMCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALME 218
+LP + +S + LR Q+E+R A ++ E+ + P+ + A E
Sbjct: 165 FVLPIGVEVDMTFSANARTLMHFFDLRKNMKAQWEAREFATKVLEECKEWSPLVFNAYEE 224
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.319 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 126,622,887
Number of Sequences: 891141
Number of extensions: 5160355
Number of successful extensions: 9815
Number of sequences better than 5.0e-02: 26
Number of HSP's better than 0.1 without gapping: 13
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 9775
Number of HSP's gapped (non-prelim): 29
length of query: 222
length of database: 279,076,897
effective HSP length: 116
effective length of query: 106
effective length of database: 175,704,541
effective search space: 18624681346
effective search space used: 18624681346
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 88 (38.5 bits)
Query= orf17 [3881-3986]
(103 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064750.1| (NC_002519) gp12 [Roseophage SIO1] >gi|994... 227 1e-059
>ref|NP_064750.1| (NC_002519) gp12 [Roseophage SIO1]
gb|AAG02595.1|AF189021_14 (AF189021) Roseophage SIO1 complete genome
Length = 114
Score = 227 bits (579), Expect = 1e-059
Identities = 103/104 (99%), Positives = 103/104 (99%), Gaps = 1/104 (0%)
Query: 1 MTNRKFTPPTEFPAEYVDGFGCKVTILGRSYYNKERPLVGFDDEGCACNYAENGAYWPDD 60
MTNRKFTPPTEFPAEYVDGFGCKVTILGRSYYNKERPLVGFDDEGCACNYAENGAYWPDD
Sbjct: 11 MTNRKFTPPTEFPAEYVDGFGCKVTILGRSYYNKERPLVGFDDEGCACNYAENGAYWPDD 70
Query: 61 GGKYDLHDIQKRITTWHNVYEGWVGASNKVN-GVTENRLCVIPH 103
GGKYDLHDIQKRITTWHNVYEGWVGASNKVN GVTENRLCVIPH
Sbjct: 71 GGKYDLHDIQKRITTWHNVYEGWVGASNKVNXGVTENRLCVIPH 114
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.319 0.141 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 71,074,634
Number of Sequences: 891141
Number of extensions: 3050981
Number of successful extensions: 5477
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 5475
Number of HSP's gapped (non-prelim): 1
length of query: 103
length of database: 279,076,897
effective HSP length: 79
effective length of query: 24
effective length of database: 208,676,758
effective search space: 5008242192
effective search space used: 5008242192
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf18 [4002-4080]
(77 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.313 0.130 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 28,090,096
Number of Sequences: 891141
Number of extensions: 471460
Number of successful extensions: 549
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 549
Number of HSP's gapped (non-prelim): 0
length of query: 77
length of database: 279,076,897
effective HSP length: 53
effective length of query: 24
effective length of database: 231,846,424
effective search space: 5564314176
effective search space used: 5564314176
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf19 [4078-4201]
(122 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064751.1| (NC_002519) gp13 [Roseophage SIO1] >gi|994... 188 7e-048
>ref|NP_064751.1| (NC_002519) gp13 [Roseophage SIO1]
gb|AAG02596.1|AF189021_15 (AF189021) Roseophage SIO1 complete genome
Length = 122
Score = 188 bits (478), Expect = 7e-048
Identities = 93/122 (76%), Positives = 93/122 (76%)
Query: 1 MSEAPERIWAWWDDAYDVGLLNKHGDKRYTPNDAKEYVRADRIEELETQLSEARQVGKEW 60
MSEAPERIWAWWDDAYDVGLLNKHGDKRYTPNDAKEYVRADRIEELETQLSEARQVGKEW
Sbjct: 1 MSEAPERIWAWWDDAYDVGLLNKHGDKRYTPNDAKEYVRADRIEELETQLSEARQVGKEW 60
Query: 61 FEEQXXXXXXXXXXXXXXXXXIDXXXXXXXXXXXXESLVDWANLHVLKARTLLAELKGET 120
FEEQ ID ESLVDWANLHVLKARTLLAELKGET
Sbjct: 61 FEEQKKARKDANAAEAKLAKAIDFVAGVAGGFKFGESLVDWANLHVLKARTLLAELKGET 120
Query: 121 DE 122
DE
Sbjct: 121 DE 122
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.312 0.131 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 51,037,116
Number of Sequences: 891141
Number of extensions: 1336714
Number of successful extensions: 2644
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2643
Number of HSP's gapped (non-prelim): 1
length of query: 122
length of database: 279,076,897
effective HSP length: 98
effective length of query: 24
effective length of database: 191,745,079
effective search space: 4601881896
effective search space used: 4601881896
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf20 [4198-4281]
(82 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.320 0.137 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 47,017,146
Number of Sequences: 891141
Number of extensions: 1582336
Number of successful extensions: 4120
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4120
Number of HSP's gapped (non-prelim): 0
length of query: 82
length of database: 279,076,897
effective HSP length: 58
effective length of query: 24
effective length of database: 227,390,719
effective search space: 5457377256
effective search space used: 5457377256
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf21 [4277-4356]
(78 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.322 0.139 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 43,583,563
Number of Sequences: 891141
Number of extensions: 1461831
Number of successful extensions: 2718
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2718
Number of HSP's gapped (non-prelim): 0
length of query: 78
length of database: 279,076,897
effective HSP length: 54
effective length of query: 24
effective length of database: 230,955,283
effective search space: 5542926792
effective search space used: 5542926792
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf22 [4356-4419]
(62 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.330 0.150 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,664,815
Number of Sequences: 891141
Number of extensions: 181493
Number of successful extensions: 200
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 200
Number of HSP's gapped (non-prelim): 0
length of query: 62
length of database: 279,076,897
effective HSP length: 38
effective length of query: 24
effective length of database: 245,213,539
effective search space: 5885124936
effective search space used: 5885124936
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf23 [4462-4980]
(517 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064752.1| (NC_002519) RPprimase/helicase [Roseophage... 1040 0.0
ref|NP_570328.1| (NC_003390) RP primase/helicase [Cyanophag... 335 9e-091
ref|NP_523315.1| (NC_003298) DNA primase/helicase [Bacterio... 276 4e-073
ref|NP_052087.1| (NC_001271) DNA primase/helicase [Bacterio... 274 2e-072
ref|NP_041975.1| (NC_001604) gene 4A, primase/helicase [14,... 273 3e-072
ref|NP_523316.1| (NC_003298) helicase [Bacteriophage T3] >g... 255 9e-067
pdb|1CR1|A Chain A, Crystal Structure Of The Helicase Domai... 156 4e-037
gb|AAD41902.1|AF159357_1 (AF159357) DNA primase [Enterobact... 154 2e-036
emb|CAA24348.1| (V01127) 4A protein [Enterobacteria phage T7] 66 1e-009
gb|AAF52820.1| (AE003626) CG5924 gene product [Drosophila m... 66 1e-009
ref|NP_391924.1| (NC_000964) replicative DNA helicase [Baci... 63 8e-009
ref|NP_214015.1| (NC_000918) replicative DNA helicase [Aqui... 62 1e-008
ref|NP_563550.1| (NC_003366) replicative DNA helicase [Clos... 56 8e-007
ref|NP_072756.1| (NC_000908) replicative DNA helicase (dnaB... 54 4e-006
ref|NP_108071.1| (NC_002678) replicative DNA helicase (dnaB... 49 1e-004
ref|NP_439720.1| (NC_000907) replicative DNA helicase (dnaB... 49 1e-004
ref|NP_543071.1| (NC_003356) putative helicase [Bacteriopha... 46 0.001
ref|NP_458617.1| (NC_003198) DNA helicase [Salmonella enter... 45 0.002
>ref|NP_064752.1| (NC_002519) RPprimase/helicase [Roseophage SIO1]
gb|AAG02597.1|AF189021_16 (AF189021) Roseophage SIO1 complete genome
Length = 522
Score = 1040 bits (2689), Expect = 0.0
Identities = 517/517 (100%), Positives = 517/517 (100%)
Query: 1 MDYEHKKHQPCQSCGSSDGAYPYEDGIYCHVCKTKTFNDDGDTQVTEQKTLPPIRGKIMA 60
MDYEHKKHQPCQSCGSSDGAYPYEDGIYCHVCKTKTFNDDGDTQVTEQKTLPPIRGKIMA
Sbjct: 6 MDYEHKKHQPCQSCGSSDGAYPYEDGIYCHVCKTKTFNDDGDTQVTEQKTLPPIRGKIMA 65
Query: 61 VPSRGLNKATAEKYRALTSDDKISLLYTTEGKVTSFKERGLSEKTFKFNGPAQTDLFGQS 120
VPSRGLNKATAEKYRALTSDDKISLLYTTEGKVTSFKERGLSEKTFKFNGPAQTDLFGQS
Sbjct: 66 VPSRGLNKATAEKYRALTSDDKISLLYTTEGKVTSFKERGLSEKTFKFNGPAQTDLFGQS 125
Query: 121 AFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYEWLDSFEKINI 180
AFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYEWLDSFEKINI
Sbjct: 126 AFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYEWLDSFEKINI 185
Query: 181 AFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKAAPFTPDG 240
AFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKAAPFTPDG
Sbjct: 186 AFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKAAPFTPDG 245
Query: 241 IIAGSSLYDLVSTPPEYDCVPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYS 300
IIAGSSLYDLVSTPPEYDCVPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYS
Sbjct: 246 IIAGSSLYDLVSTPPEYDCVPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYS 305
Query: 301 LIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTLGTGRVF 360
LIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTLGTGRVF
Sbjct: 306 LIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTLGTGRVF 365
Query: 361 LYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLT 420
LYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLT
Sbjct: 366 LYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLT 425
Query: 421 VELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELE 480
VELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELE
Sbjct: 426 VELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELE 485
Query: 481 RNTTRVRVIKNRFTGETGVADSLSYSRHTGRLKSYEG 517
RNTTRVRVIKNRFTGETGVADSLSYSRHTGRLKSYEG
Sbjct: 486 RNTTRVRVIKNRFTGETGVADSLSYSRHTGRLKSYEG 522
>ref|NP_570328.1| (NC_003390) RP primase/helicase [Cyanophage P60]
gb|AAL73266.1| (AF338467) RP primase/helicase [Cyanophage P60]
Length = 531
Score = 335 bits (858), Expect = 9e-091
Identities = 204/519 (39%), Positives = 306/519 (58%), Gaps = 25/519 (4%)
Query: 4 EHKKHQPCQSCGSSDGAYPYEDG-IYCHVCKTKTFNDDGDTQVTEQKTLP--PIRGKIMA 60
E +H+PC +C SSD Y DG YC C +++ G Q T P P+ G
Sbjct: 8 EFTRHEPCPNCPSSDALARYSDGGAYCFSCG---YSERGTGQSTHTSRAPMIPLEGDFHR 64
Query: 61 VPSRGLNKATAEKYRALTSDDKISL-LYTTEGKVTSFKERGLSEKTFKFNGP-AQTDLFG 118
+PSR L + T K+ +D +I Y ++ +K RG EK F++ G LFG
Sbjct: 65 IPSRNLTEETCRKFNVRVNDGRIQFPYYDNSRQLVGYKHRG-KEKDFRWQGKNPDNQLFG 123
Query: 119 QSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYEWLDSFEKI 178
Q + KG K++ +TEGEFDA++ +Q + P VSV NG+ A K +L +FE+I
Sbjct: 124 QQLWGKG-KAIVVTEGEFDALSVFQAR-KNWPVVSVPNGAQSAKKALAAQLTYLLNFEEI 181
Query: 179 NIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKA----A 234
+ FDND AG AA LF +V T ++ D + +D+ DA +A +
Sbjct: 182 ILMFDNDDAGIQAAEECVSLFPADRV--FTAFIDGYKDASEAVSAKDY-DAVTQAIWNKS 238
Query: 235 PFTPDGIIAGSSLYDLVSTPPEYDCVPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVM 294
+TP II G +L+D+V+TP YPF+ LN T GLR GEL+T AG+G GK+ +
Sbjct: 239 TYTPKSIIDGRTLFDVVTTPLHGRDAEYPFAALNSTTGGLRLGELVTITAGSGTGKSTLC 298
Query: 295 REILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTL 354
EI SLI QD+ VG + LEE V+ TGL +M++ A KPLHL + + T +E A+++TL
Sbjct: 299 GEIAVSLINQDQ-RVGYIALEESVKRTGLRLMTVAANKPLHLNNELPT-DELRTAFDSTL 356
Query: 355 GTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVS-DHAKDERKALDEIA 413
GTGRVFL D FGS + +++ +R+L ++ + ++I+LDH+SI++S + + DERK +D +
Sbjct: 357 GTGRVFLRDGFGSVDPDSLLNDIRFLTKAHEVQWIVLDHLSILLSGNESTDERKMIDIVM 416
Query: 414 TKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNT 473
TKL++ E I ++++SHL R++ K E+G ++L +RG+ I QLS+++IALER+
Sbjct: 417 TKLRSFVEETGIGMILISHLRRNQGDKGHEDGAQVSLSQLRGSHSIAQLSDLVIALERDI 476
Query: 474 QADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRL 512
D N +++RV+KNRF G+TG A LSYS TGR+
Sbjct: 477 SKGD----NRSQLRVLKNRFNGQTGPAGGLSYSVETGRM 511
>ref|NP_523315.1| (NC_003298) DNA primase/helicase [Bacteriophage T3]
sp|P20315|PRIM_BPT3 DNA PRIMASE/HELICASE
pir||S07508 DNA primase - phage T3
emb|CAA35135.1| (X17255) 4A [Bacteriophage T3]
emb|CAC86278.1| (AJ318471) DNA primase/helicase [Bacteriophage T3]
Length = 566
Score = 276 bits (706), Expect = 4e-073
Identities = 180/541 (33%), Positives = 275/541 (50%), Gaps = 35/541 (6%)
Query: 8 HQPCQSCGSSDGAYPYEDGI-YCHVCKTKT-FNDDGDTQVTEQK-------------TLP 52
H PC++CGSSDG Y DG +C VC+ + N++ + +++ ++
Sbjct: 12 HAPCENCGSSDGNSVYSDGHEWCFVCEHRVPANEEREAKLSTRRRTGGSKPMSYDVWNFG 71
Query: 53 PIRGKIMAVPSRGLNKATAEK--YRALTSDDK---ISLLYTTEGKVTSFKERGLSEKTFK 107
G+ + +RG++K T +K Y D++ ++ G + S K R +K FK
Sbjct: 72 DSNGRYSDLTARGISKETCQKAGYWLAKVDNRMYQVADYRDQNGSIVSQKVRD-KDKNFK 130
Query: 108 FNGPAQTDLFGQSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKR 167
G ++D GGK + +TEGE DA+ ++ P VS+ +G+S A K C
Sbjct: 131 TTGSHKSDALFLKHLWSGGKKIVVTEGEIDALTVMELQDCKYPVVSLGHGASAAKKTCAA 190
Query: 168 NYEWLDSFEKINIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFI 227
NYE+ D FE+I + FD D AG+ A A++ KVR+ + D N+ G ++ +
Sbjct: 191 NYEYFDQFEQIILMFDMDDAGRKAVEEAAQVLPAGKVRVAVLPCKDANECHIMGEDKAIL 250
Query: 228 DAHKKAAPFTPDGIIAGSSLYDLV-STPPEYDCVPYPF---SGLNRMTKGLRTGELITFV 283
+ A P+ PDG+++ SL D V D V F GLN T G R GE++
Sbjct: 251 EQVWNANPWVPDGVVSALSLKDRVKEAMTSEDAVGLLFDGCQGLNDRTLGARGGEVVMVT 310
Query: 284 AGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTD----T 339
+G+G+GK+ +R+ + ++ VG LEE V DT MM + + L +D
Sbjct: 311 SGSGMGKSTFVRQQALAWGKRMGKRVGLAMLEESVEDTIQDMMGLNNKVRLRQSDEVKKA 370
Query: 340 IYTKEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVS 399
I FD Y+ G LYDSF +R+++ + Y+ C I+LDHISIVVS
Sbjct: 371 IAEDGRFDEWYDELFGDDTFHLYDSFAEAEADRLLAKLAYMRTGLGCDVIVLDHISIVVS 430
Query: 400 -DHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAG 458
DERK +D + TKLK + L+++ HL + K EEG +++ D+RG+
Sbjct: 431 ASEESDERKMIDRLMTKLKGFAKSTGVVLVVICHLKNPEKGKAHEEGRAVSITDLRGSGA 490
Query: 459 IGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRLK--SYE 516
+ QLS+ IIALERN Q D N VR++K RFTG+TG+A + Y+R TG L+ SY
Sbjct: 491 LRQLSDTIIALERNQQGD---MPNLVLVRLLKCRFTGDTGIAGYMEYNRETGWLEPSSYT 547
Query: 517 G 517
G
Sbjct: 548 G 548
>ref|NP_052087.1| (NC_001271) DNA primase/helicase [Bacteriophage phiYeO3-12]
emb|CAB63608.1| (AJ251805) DNA primase/helicase [Bacteriophage phiYeO3-12]
Length = 566
Score = 274 bits (700), Expect = 2e-072
Identities = 184/544 (33%), Positives = 273/544 (49%), Gaps = 41/544 (7%)
Query: 8 HQPCQSCGSSDGAYPYEDG-IYCHVCKTKTFNDDGDTQVTEQKTL-------PPI----- 54
H PC++CGSSD + DG YC+ C+ T GD Q EQ + P+
Sbjct: 12 HIPCENCGSSDANSLFSDGHTYCYACENWT---PGDEQKAEQLSTRRRTGGSKPMSYDVW 68
Query: 55 -----RGKIMAVPSRGLNKATAEK--YRALTSDDK---ISLLYTTEGKVTSFKERGLSEK 104
G+ + +RG++K T +K Y D++ ++ G + S K R +K
Sbjct: 69 NFGDSNGRYSDLTARGISKETCQKAGYWLAKVDNRMYQVADYRDQNGSIVSQKVRD-KDK 127
Query: 105 TFKFNGPAQTDLFGQSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKD 164
FK G ++D GGK + +TEGE DA+ ++ P VS+ +G+S A K
Sbjct: 128 NFKTTGSHKSDALFLKHLWSGGKKIVVTEGEIDALTVMELQDCKYPVVSLGHGASAAKKT 187
Query: 165 CKRNYEWLDSFEKINIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRER 224
C NYE+ D FE+I + FD D AG+ A A++ KVR+ + D N+ G ++
Sbjct: 188 CAANYEYFDQFEQIILMFDMDDAGRKAVEEAAQVLPAGKVRVAVLPYKDANECHIMGEDK 247
Query: 225 DFIDAHKKAAPFTPDGIIAGSSLYDLV-STPPEYDCVPYPF---SGLNRMTKGLRTGELI 280
++ A P+ PDG+++ SL D V D V F GLN T G R GE+I
Sbjct: 248 AILEQVWNANPWVPDGVVSALSLKDRVKEAMTSEDAVGLLFDGCQGLNDRTLGARGGEVI 307
Query: 281 TFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTD-- 338
+G+G+GK+ +R+ + ++ VG LEE V DT MM + + L +D
Sbjct: 308 MVTSGSGMGKSTFVRQQALAWGKRMGKRVGLAMLEESVEDTIQDMMGLNNKVRLRQSDEV 367
Query: 339 --TIYTKEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISI 396
I FD Y+ G LYDSF +R+++ + Y+ C I+LDHISI
Sbjct: 368 KKAIAEDGRFDEWYDELFGDDTFHLYDSFAEAEADRLLAKLAYMRTGLGCDVIVLDHISI 427
Query: 397 VVS-DHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRG 455
VVS DERK +D + TKLK + L+++ HL + K EEG +++ D+RG
Sbjct: 428 VVSASEESDERKMIDRLMTKLKGFAKSTGVVLVVICHLKNPEKGKAHEEGRAVSITDLRG 487
Query: 456 TAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRLK-- 513
+ + QLS+ IIALERN Q D + VR++K RFTG+TG+A + Y+R TG L+
Sbjct: 488 SGALRQLSDTIIALERNQQGD---MPHLVLVRLLKCRFTGDTGIAGYMEYNRETGWLEPS 544
Query: 514 SYEG 517
SY G
Sbjct: 545 SYTG 548
>ref|NP_041975.1| (NC_001604) gene 4A, primase/helicase [14,15] [Enterobacteria phage
T7]
sp|P03692|PRIM_BPT7 DNA PRIMASE/HELICASE
pir||YDBPA7 DNA primase - phage T7
emb|CAA24405.1| (V01146) gene 4A, primase/helicase [14,15] [Enterobacteria phage
T7]
Length = 566
Score = 273 bits (698), Expect = 3e-072
Identities = 179/542 (33%), Positives = 279/542 (51%), Gaps = 38/542 (7%)
Query: 8 HQPCQSCGSSDGAYPYEDG-IYCHVCKTKTFNDDGDTQVTEQKTLPP------------- 53
H PC +CGSSDG + DG +C+VC+ T ++ DT+ K P
Sbjct: 14 HIPCDNCGSSDGNSLFSDGHTFCYVCEKWTAGNE-DTKERASKRKPSGGKPMTYNVWNFG 72
Query: 54 -IRGKIMAVPSRGLNKATAEK--YRALTSDD---KISLLYTTEGKVTSFKERGLSEKTFK 107
G+ A+ +RG++K T +K Y D +++ G + S K R +K FK
Sbjct: 73 ESNGRYSALTARGISKETCQKAGYWIAKVDGVMYQVADYRDQNGNIVSQKVRD-KDKNFK 131
Query: 108 FNGPAQTD-LFGQSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCK 166
G ++D LFG+ ++ GGK + +TEGE D + ++ P VS+ +G+S A K C
Sbjct: 132 TTGSHKSDALFGKHLWN-GGKKIVVTEGEIDMLTVMELQDCKYPVVSLGHGASAAKKTCA 190
Query: 167 RNYEWLDSFEKINIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDF 226
NYE+ D FE+I + FD D+AG+ A A++ KVR+ + D N+ G +R+
Sbjct: 191 ANYEYFDQFEQIILMFDMDEAGRKAVEEAAQVLPAGKVRVAVLPCKDANECHLNGHDREI 250
Query: 227 IDAHKKAAPFTPDGIIAGSSLYDLV----STPPEYDCVPYPFSGLNRMTKGLRTGELITF 282
++ A P+ PDG+++ SL + + S+ + +G+N T G R GE+I
Sbjct: 251 MEQVWNAGPWIPDGVVSALSLRERIREHLSSEESVGLLFSGCTGINDKTLGARGGEVIMV 310
Query: 283 VAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDT--- 339
+G+G+GK+ +R+ VG LEE V +T ++ + L +D+
Sbjct: 311 TSGSGMGKSTFVRQQALQWGTAMGKKVGLAMLEESVEETAEDLIGLHNRVRLRQSDSLKR 370
Query: 340 -IYTKEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVV 398
I +FD ++ G LYDSF +R+++ + Y+ C IILDHISIVV
Sbjct: 371 EIIENGKFDQWFDELFGNDTFHLYDSFAEAETDRLLAKLAYMRSGLGCDVIILDHISIVV 430
Query: 399 SDHAK-DERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTA 457
S + DERK +D + TKLK + L+++ HL K EEG +++ D+RG+
Sbjct: 431 SASGESDERKMIDNLMTKLKGFAKSTGVVLVVICHLKNPDKGKAHEEGRPVSITDLRGSG 490
Query: 458 GIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRLK--SY 515
+ QLS+ IIALERN Q D N VR++K RFTG+TG+A + Y++ TG L+ SY
Sbjct: 491 ALRQLSDTIIALERNQQGD---MPNLVLVRILKCRFTGDTGIAGYMEYNKETGWLEPSSY 547
Query: 516 EG 517
G
Sbjct: 548 SG 549
>ref|NP_523316.1| (NC_003298) helicase [Bacteriophage T3]
emb|CAA35136.1| (X17255) 4B [Bacteriophage T3]
emb|CAC86279.1| (AJ318471) helicase [Bacteriophage T3]
Length = 504
Score = 255 bits (651), Expect = 9e-067
Identities = 164/478 (34%), Positives = 246/478 (51%), Gaps = 20/478 (4%)
Query: 56 GKIMAVPSRGLNKATAEK--YRALTSDDK---ISLLYTTEGKVTSFKERGLSEKTFKFNG 110
G+ + +RG++K T +K Y D++ ++ G + S K R +K FK G
Sbjct: 13 GRYSDLTARGISKETCQKAGYWLAKVDNRMYQVADYRDQNGSIVSQKVRD-KDKNFKTTG 71
Query: 111 PAQTDLFGQSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYE 170
++D GGK + +TEGE DA+ ++ P VS+ +G+S A K C NYE
Sbjct: 72 SHKSDALFLKHLWSGGKKIVVTEGEIDALTVMELQDCKYPVVSLGHGASAAKKTCAANYE 131
Query: 171 WLDSFEKINIAFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAH 230
+ D FE+I + FD D AG+ A A++ KVR+ + D N+ G ++ ++
Sbjct: 132 YFDQFEQIILMFDMDDAGRKAVEEAAQVLPAGKVRVAVLPCKDANECHIMGEDKAILEQV 191
Query: 231 KKAAPFTPDGIIAGSSLYDLV-STPPEYDCVPYPF---SGLNRMTKGLRTGELITFVAGT 286
A P+ PDG+++ SL D V D V F GLN T G R GE++ +G+
Sbjct: 192 WNANPWVPDGVVSALSLKDRVKEAMTSEDAVGLLFDGCQGLNDRTLGARGGEVVMVTSGS 251
Query: 287 GVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTD----TIYT 342
G+GK+ +R+ + ++ VG LEE V DT MM + + L +D I
Sbjct: 252 GMGKSTFVRQQALAWGKRMGKRVGLAMLEESVEDTIQDMMGLNNKVRLRQSDEVKKAIAE 311
Query: 343 KEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVS-DH 401
FD Y+ G LYDSF +R+++ + Y+ C I+LDHISIVVS
Sbjct: 312 DGRFDEWYDELFGDDTFHLYDSFAEAEADRLLAKLAYMRTGLGCDVIVLDHISIVVSASE 371
Query: 402 AKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQ 461
DERK +D + TKLK + L+++ HL + K EEG +++ D+RG+ + Q
Sbjct: 372 ESDERKMIDRLMTKLKGFAKSTGVVLVVICHLKNPEKGKAHEEGRAVSITDLRGSGALRQ 431
Query: 462 LSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVADSLSYSRHTGRLK--SYEG 517
LS+ IIALERN Q D N VR++K RFTG+TG+A + Y+R TG L+ SY G
Sbjct: 432 LSDTIIALERNQQGD---MPNLVLVRLLKCRFTGDTGIAGYMEYNRETGWLEPSSYTG 486
>pdb|1CR1|A Chain A, Crystal Structure Of The Helicase Domain Of The Gene 4
Protein Of Bacteriophage T7: Complex With Dttp
pdb|1CR4|A Chain A, Crystal Structure Of The Helicase Domain Of The Gene 4
Protein Of Bacteriophage T7: Complex With Dtdp
pdb|1CR2|A Chain A, Crystal Structure Of The Helicase Domain Of The Gene 4
Protein Of Bacteriophage T7: Complex With Datp
pdb|1CR0|A Chain A, Crystal Structure Of The Helicase Domain Of The Gene4
Protein Of Bacteriophage T7
Length = 296
Score = 156 bits (395), Expect = 4e-037
Identities = 94/260 (36%), Positives = 140/260 (53%), Gaps = 10/260 (3%)
Query: 265 SGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLG 324
+G+N T G R GE+I +G+G+GK+ +R+ VG LEE V +T
Sbjct: 23 TGINDKTLGARGGEVIMVTSGSGMGKSTFVRQQALQWGTAMGKKVGLAMLEESVEETAED 82
Query: 325 MMSIQAEKPLHLTDT----IYTKEEFDNAYENTLGTGRVFLYDSFGSNSVERIVSMVRYL 380
++ + L +D+ I +FD ++ G LYDSF +R+++ + Y+
Sbjct: 83 LIGLHNRVRLRQSDSLKREIIENGKFDQWFDELFGNDTFHLYDSFAEAETDRLLAKLAYM 142
Query: 381 ARSCDCKFIILDHISIVVSDHAK-DERKALDEIATKLKTLTVELDICLLMVSHLNRDKNR 439
C IILDHISIVVS + DERK +D + TKLK + L+++ HL
Sbjct: 143 RSGLGCDVIILDHISIVVSASGESDERKMIDNLMTKLKGFAKSTGVVLVVICHLKNPDKG 202
Query: 440 KPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGV 499
K EEG +++ D+RG+ + QLS+ IIALERN Q D N VR++K RFTG+TG+
Sbjct: 203 KAHEEGRPVSITDLRGSGALRQLSDTIIALERNQQGD---MPNLVLVRILKCRFTGDTGI 259
Query: 500 ADSLSYSRHTGRLK--SYEG 517
A + Y++ TG L+ SY G
Sbjct: 260 AGYMEYNKETGWLEPSSYSG 279
>gb|AAD41902.1|AF159357_1 (AF159357) DNA primase [Enterobacteria phage SP6]
Length = 661
Score = 154 bits (390), Expect = 2e-036
Identities = 140/465 (30%), Positives = 217/465 (46%), Gaps = 49/465 (10%)
Query: 91 GKVTSFKERGLSEKTFKFNGPAQT----DLFGQSAFS----KGGKSVT--ITEGEFDAMA 140
GK+ K R L K FKF + D+FG + S KG + T I GE DA+A
Sbjct: 166 GKLVGAKCRTLP-KDFKFGHLGKLFGNQDMFGMNTMSNVLDKGRRKDTLLIVGGELDALA 224
Query: 141 AYQMLYMS--------EPC-VSVINGSSGAVKDCKRNYEWLDSFEKINIAFDNDKAGQDA 191
A QML S +P V IN +++ +N E + F+KI FD D+ GQ
Sbjct: 225 AQQMLLDSAKGTKWEGQPYHVWSINKGEACLEEIVQNREHISQFKKIMWGFDGDEIGQKL 284
Query: 192 AMAVAELFDPRKVRLVTMTLN--DPNDYIKQGRERDFIDAHKKAAPFTPDGIIAGSSLYD 249
A LF P K ++ D N + G+ ++F+DA A + GS +
Sbjct: 285 NQQAARLF-PGKSYIIEYPAGCKDANKALMAGKSKEFVDAWFNAKSSDE---VFGSQIKS 340
Query: 250 LVSTPPEYDC------VPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQ 303
+ S + + +P+ LN++T G+R +LI AG+GVGKT+ +RE++ LI+
Sbjct: 341 IASQRDKLKAARPEPGLSWPWPRLNKITLGIRKHQLIIVGAGSGVGKTEFLREVVKHLIE 400
Query: 304 QDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDT------------IYTKEEFDNAYE 351
+ +VG + E+P+ + +K + L T YT+EE + A +
Sbjct: 401 EHGESVGIISTEDPMVKVSRAFIGKWIDKRIELPPTNDPREDGYREVFDYTEEEANAAID 460
Query: 352 NTLGTGRVFLYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDERK--AL 409
TG++F+ D G S+E+ V + II+D+++ + D K AL
Sbjct: 461 YVADTGKLFVADLEGDYSMEK-VEQTCLEFEAMGISNIIIDNLTGIKLDERNFGGKVGAL 519
Query: 410 DEIATKLKTLTVELDICLLMVSHLNRDK-NRKPPEEGGTINLQDIRGTAGIGQLSNIIIA 468
DE ++ T+ + + +VSHL R R EEGG + L D RG+ IG ++ +
Sbjct: 520 DECVKRIGTIKDRHPVTIFLVSHLTRPSGQRTSHEEGGEVILSDFRGSGAIGFWASYALG 579
Query: 469 LERNTQADDELERNTTRVRVIKNRFTG-ETGVADSLSYSRHTGRL 512
+ERNT+A+ ER TT + +K+R G TG L TGRL
Sbjct: 580 IERNTRAETLDERTTTYISCVKDRDQGIYTGTKVMLKGDVSTGRL 624
>emb|CAA24348.1| (V01127) 4A protein [Enterobacteria phage T7]
Length = 179
Score = 65.9 bits (159), Expect = 1e-009
Identities = 51/168 (30%), Positives = 81/168 (47%), Gaps = 24/168 (14%)
Query: 8 HQPCQSCGSSDGAYPYEDG-IYCHVCKTKTFNDDGDTQVTEQKTLPP------------- 53
H PC +CGSSDG + DG +C+VC+ T ++ DT+ K P
Sbjct: 14 HIPCDNCGSSDGNSLFSDGHTFCYVCEKWTAGNE-DTKERASKRKPSGGKPMTYNVWNFG 72
Query: 54 -IRGKIMAVPSRGLNKATAEK--YRALTSDD---KISLLYTTEGKVTSFKERGLSEKTFK 107
G+ A+ +RG++K T +K Y D +++ G + S K R +K FK
Sbjct: 73 ESNGRYSALTARGISKETCQKAGYWIAKVDGVMYQVADYRDQNGNIVSQKVRD-KDKNFK 131
Query: 108 FNGPAQTD-LFGQSAFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSV 154
G ++D LFG+ ++ GGK + +TEGE D + ++ P VS+
Sbjct: 132 TTGSHKSDALFGKHLWN-GGKKIVVTEGEIDMLTVMELQDCKYPVVSL 178
>gb|AAF52820.1| (AE003626) CG5924 gene product [Drosophila melanogaster]
Length = 613
Score = 65.9 bits (159), Expect = 1e-009
Identities = 74/331 (22%), Positives = 135/331 (40%), Gaps = 31/331 (9%)
Query: 190 DAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKAAPFTPDGIIA-GSSLY 248
DAA A A D R+ L+ T +P ++ R + KA P I G+
Sbjct: 283 DAARAFALKLDERRCLLIRPTETEPAPHLALRRRLNLRHILAKATPVQHKAITTFGAMRN 342
Query: 249 DLVSTPPEYDCVP----YPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQ 304
D++S + V F LN++ KG R GEL TG GKT M E L Q
Sbjct: 343 DILSELQNIEKVNGVKWKRFPVLNKLLKGHRRGELTILTGPTGSGKTTFMSEYSLDLAMQ 402
Query: 305 DKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTLGTGRVFLYDS 364
V TL+ +R+T L ++ L D ++ + +E ++
Sbjct: 403 ---GVNTLWGSFEIRNTRLAATLLRQYVGYPLDDRLHEFNHWAAEFERL----PLYFMTF 455
Query: 365 FGSNSVERIVSMVRYLARSCDCKFIILDHISIV--VSDHAKDERKALDEIATKLKTLTVE 422
G ++ ++ + + + D +I+D++ + VS D+ D I ++ +
Sbjct: 456 HGQQPLKPVLEAIEHASYVHDVMHVIIDNLQFMMGVSTFRGDKFFEQDSIIAAFRSFATK 515
Query: 423 LDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQ-LSNIIIALERNTQADDELER 481
++ + +V H +++ + + + GTA Q N++I ++ + R
Sbjct: 516 HNVHVTLVMHPRKER------QEDELTTSSVFGTAKATQEADNVLIIQDKRLTS----VR 565
Query: 482 NTTRVRVIKNRFTGETGV------ADSLSYS 506
+++ KNR++G+ G+ D LSYS
Sbjct: 566 GKKYLQIAKNRYSGDLGIMPLEFDKDGLSYS 596
>ref|NP_391924.1| (NC_000964) replicative DNA helicase [Bacillus subtilis]
sp|P37469|DNAC_BACSU Replicative DNA helicase
pir||S65970 replicative DNA helicase dnaC - Bacillus subtilis
dbj|BAA05176.1| (D26185) replicative DNA helicase [Bacillus subtilis]
emb|CAB16081.1| (Z99124) replicative DNA helicase [Bacillus subtilis]
Length = 454
Score = 62.8 bits (151), Expect = 8e-009
Identities = 61/262 (23%), Positives = 118/262 (44%), Gaps = 10/262 (3%)
Query: 260 VPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVR 319
+P F+ L+RMT G + +LI A VGKT I ++ + +V LE
Sbjct: 186 IPTGFTELDRMTAGFQRNDLIIVAARPSVGKTAFALNIAQNVATKTDESVAIFSLEMGAE 245
Query: 320 DTGLGMMSIQAE-KPLHLTDTIYTKEEFD--NAYENTLGTGRVFLYDSFGSNSVERIVSM 376
+ M+ + +L T+E++ +L +++ D+ G V I +
Sbjct: 246 QLVMRMLCAEGNINAQNLRTGNLTEEDWGKLTMAMGSLSNSGIYIDDTPGIR-VSEIRAK 304
Query: 377 VRYLARSCDCKFIILDHISIVV-SDHAKDER-KALDEIATKLKTLTVELDICLLMVSHLN 434
R L + I++D++ ++ S +KD R + + EI+ +LK++ EL + ++ +S L+
Sbjct: 305 CRRLKQESGLGMILIDYLQLIQGSGRSKDNRQQEVSEISRELKSIARELQVPVIALSQLS 364
Query: 435 RDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFT 494
R ++ + + DIR + I Q ++I+ L R+ D E E +I +
Sbjct: 365 RGVEQRQDKRP---MMSDIRESGSIEQDADIVAFLYRDDYYDKETENKNIIEIIIAKQRN 421
Query: 495 GETGVADSLSYSRHTGRLKSYE 516
G G SL++ + + + E
Sbjct: 422 GPVGTV-SLAFVKEYNKFVNLE 442
>ref|NP_214015.1| (NC_000918) replicative DNA helicase [Aquifex aeolicus]
pir||H70427 replicative DNA helicase - Aquifex aeolicus
gb|AAC07414.1| (AE000742) replicative DNA helicase [Aquifex aeolicus]
Length = 468
Score = 62.4 bits (150), Expect = 1e-008
Identities = 52/216 (24%), Positives = 105/216 (48%), Gaps = 8/216 (3%)
Query: 260 VPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLF-LEEPV 318
+P F+ L+ T G G+LI A G+GKT M I+Y+L +D+G +F LE
Sbjct: 182 LPSGFTELDLKTTGFHPGDLIILAARPGMGKTAFMLSIIYNL-AKDEGKPSAVFSLEMSK 240
Query: 319 RDTGLGMMSIQAEKPL-HLTDTIYTKEEFDNAYENTLGTGRVFLY-DSFGSNSVERIVSM 376
+ ++S+ +E PL + + E+ + + + +Y D + + +
Sbjct: 241 EQLVMRLLSMMSEVPLFKIRSGSISNEDLKKLEASAIELAKYDIYLDDTPALTTTDLRIR 300
Query: 377 VRYLARSCDCKFIILDHISIVVSDHAKDER-KALDEIATKLKTLTVELDICLLMVSHLNR 435
R L + + +F+ +D++ ++ K R + + E++ LK L EL I ++ ++ L+R
Sbjct: 301 ARKLRKEKEVEFVAVDYLQLLRPPVRKSSRQEEVAEVSRNLKALAKELHIPVMALAQLSR 360
Query: 436 DKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALER 471
+ ++ + L D+R + I Q +++I+ L R
Sbjct: 361 EVEKRSDKRP---QLADLRESGQIEQDADLILFLHR 393
>ref|NP_563550.1| (NC_003366) replicative DNA helicase [Clostridium perfringens]
dbj|BAB82340.1| (AP003194) replicative DNA helicase [Clostridium perfringens]
Length = 444
Score = 56.2 bits (134), Expect = 8e-007
Identities = 54/256 (21%), Positives = 110/256 (42%), Gaps = 9/256 (3%)
Query: 264 FSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGL 323
F L+ MT G + G+++ A +GKT I ++ +V LE
Sbjct: 189 FKDLDEMTSGFQKGDMVLIAARPSMGKTTFALNIAEHAALREGKSVVIFSLEMSKEQLAY 248
Query: 324 GMMSIQAE-KPLHLTDTIYTKEEFDN--AYENTLGTGRVFLYDSFGSNSVERIVSMVRYL 380
++ +A L L T E++DN L + ++++ D+ G SV + S R L
Sbjct: 249 KLLCSEANVDMLSLRTGKLTPEDWDNIARASGPLASAKIYIDDTAGV-SVMEMRSKCRRL 307
Query: 381 ARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRK 440
I++D++ + +S + ++ + EI+ +K + E++ ++ +S L+R ++
Sbjct: 308 KMEYGIDLILIDYLQL-MSGSGESRQQEVAEISRNIKAIAKEMECPVIALSQLSRAPEQR 366
Query: 441 PPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGETGVA 500
L D+R + I Q +++++ L R+ + E E +I + G G
Sbjct: 367 ADHRP---MLSDLRESGSIEQDADLVMFLYRDEYYNKETEDKNVGECIIAKQRNGPVGTV 423
Query: 501 DSLSYSRHTGRLKSYE 516
L++ + YE
Sbjct: 424 -KLAWLGQYSKFADYE 438
>ref|NP_072756.1| (NC_000908) replicative DNA helicase (dnaB) [Mycoplasma genitalium]
pir||D64210 replicative DNA helicase (dnaB) homolog - Mycoplasma genitalium
gb|AAC71312.1| (U39689) replicative DNA helicase (dnaB) [Mycoplasma genitalium]
Length = 446
Score = 53.9 bits (128), Expect = 4e-006
Identities = 49/236 (20%), Positives = 107/236 (44%), Gaps = 22/236 (9%)
Query: 264 FSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQ-DKGNVGTLFLEEPVRDTG 322
F L+++T G + GELI A G GKT ++ + S +Q D LF +R+
Sbjct: 178 FLKLDQLTSGWKPGELIVIAARPGRGKTALLINFMASAAKQIDPKTDVVLFFSLEMRN-- 235
Query: 323 LGMMSIQAEKPLHLTDTIYT---KEEFDNAYENTLGTGR------VFLYDSFGSNSVERI 373
I +H + T YT ++ +N +E + + L+D + S +++ I
Sbjct: 236 ---REIYQRHLMHESQTSYTLTNRQRINNVFEELMEASSRIKNLPIKLFD-YSSLTLQEI 291
Query: 374 VSMVRYLARSCDCKFIILDHISIV---VSDHAKDERKALDEIATKLKTLTVELDICLLMV 430
+ + ++++ + + +I+D++ +V +++ ++ + I+ LK E + ++
Sbjct: 292 RNQITEVSKTSNVRLVIIDYLQLVNALKNNYGLTRQQEVTMISQSLKAFAKEFNTPIIAA 351
Query: 431 SHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRV 486
+ L+R R + L D+R + I Q +++++ + R E E T +
Sbjct: 352 AQLSR---RIEERKDSRPILSDLRESGSIEQDADMVLFIHRTNDDKKEQEEENTNL 404
>ref|NP_108071.1| (NC_002678) replicative DNA helicase (dnaB) [Mesorhizobium loti]
dbj|BAB54216.1| (AP003012) replicative DNA helicase; DnaB [Mesorhizobium loti]
Length = 487
Score = 49.3 bits (116), Expect = 1e-004
Identities = 51/225 (22%), Positives = 99/225 (43%), Gaps = 24/225 (10%)
Query: 267 LNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQ--------------QDKGNVGTL 312
L+R GL++ +LI G+GKT + I +++ + + G VG
Sbjct: 196 LDRRMGGLQSSDLIVLAGRPGMGKTSLATNIAFNIAEAYVPAQQADGSFKAANGGVVGFF 255
Query: 313 FLEEPVRDTGLGMMSIQAE-KPLHLTDTIYTKEEFDN--AYENTLGTGRVFLYDSFGSNS 369
LE ++S Q E + ++ +F+ A T+ +F+ D G S
Sbjct: 256 SLEMSSEQLATRIISEQTEISSSKIRRGEISEMDFEKLVACSQTMQKIPLFI-DQTGGIS 314
Query: 370 VERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDER---KALDEIATKLKTLTVELDIC 426
+ ++ + R L R I++D+I ++ A+ + + + EI T LK L EL +
Sbjct: 315 IAQLSARARRLKRQRGLDLIVIDYIQLMQGSSARASQNRVQEITEITTGLKALAKELAVP 374
Query: 427 LLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALER 471
++ +S L+R + E L D+R + I Q +++++ + R
Sbjct: 375 IIALSQLSRQVESR---EDKRPQLSDLRESGSIEQDADVVMFVYR 416
>ref|NP_439720.1| (NC_000907) replicative DNA helicase (dnaB) [Haemophilus influenzae
Rd]
sp|P45256|DNAB_HAEIN Replicative DNA helicase
pir||D64130 dnaB protein - Haemophilus influenzae (strain Rd KW20)
gb|AAC23217.1| (U32831) replicative DNA helicase (dnaB) [Haemophilus influenzae
Rd]
Length = 504
Score = 48.9 bits (115), Expect = 1e-004
Identities = 53/241 (21%), Positives = 101/241 (40%), Gaps = 9/241 (3%)
Query: 264 FSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGL 323
F+ L++ T GL+ +LI A +GKT + + + V LE P +
Sbjct: 241 FTDLDKKTAGLQPSDLIIVAARPSMGKTTFAMNLCENAAMASEKPVLVFSLEMPAEQIMM 300
Query: 324 GMMSIQAEKPLHLTDTIYTKEEFD-NAYENTLG----TGRVFLYDSFGSNSVERIVSMVR 378
M++ A T +E + N + +G +F+ DS G + R
Sbjct: 301 RMIASLARVDQTKIRTGQNLDEIEWNKIASVVGMFKQKNNLFIDDSSGLTPTDVRSRARR 360
Query: 379 YLARSCDCKFIILDHISIVVSDHAKDERK-ALDEIATKLKTLTVELDICLLMVSHLNRDK 437
+ I++D++ ++ + D R + EI+ LK L EL + ++ +S LNR
Sbjct: 361 VYRENGGLSMIMVDYLQLMRAPAFSDNRTLEIAEISRSLKALAKELQVPVVALSQLNRTL 420
Query: 438 NRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELERNTTRVRVIKNRFTGET 497
++ + +N D+R + I Q +++I+ + R+ +D E +I + G
Sbjct: 421 EQRGDKR--PVN-SDLRESGSIEQDADLIMFIYRDEVYNDNSEDKGVAEIIIGKQRNGPI 477
Query: 498 G 498
G
Sbjct: 478 G 478
>ref|NP_543071.1| (NC_003356) putative helicase [Bacteriophage P27]
emb|CAC83537.1| (AJ298298) putative helicase [Bacteriophage P27]
Length = 463
Score = 45.8 bits (107), Expect = 0.001
Identities = 53/247 (21%), Positives = 105/247 (42%), Gaps = 10/247 (4%)
Query: 267 LNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLFLEEPVRDTGLGMM 326
L+ T G+ G+L+ A +GKT++ +I+ + +Q G V +E G M+
Sbjct: 184 LDAKTGGMEPGDLVFIAARPSMGKTELALDIIDKVTEQGHG-VLLFTMEMANIQIGERMV 242
Query: 327 SIQAEKPLHLTDTIYTKEEFDNA-YENTLG--TGR-VFLYDSFGSNSVERIVSMVRYLAR 382
S P+ ++ E+ D A + +G TGR +++ D E + ++ +
Sbjct: 243 SAAGGMPVSRLKSVAHFEDEDWARFSQGVGRMTGRNIWMVDQANLTIDEICATTKHHMIK 302
Query: 383 SCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDKNRKPP 442
+ +++D++ ++ + A+ EI+ LK L L+ +S L+R +P
Sbjct: 303 HPETALVVVDYLGLIKTRTTGRHDLAVGEISKGLKGLAKSGGFPLIALSQLSRGVESRPN 362
Query: 443 EEGGTINLQDIRGTAGIGQLSNIIIALERN-TQADDELERNTTRVRVIKNRFTGETGVAD 501
+ D++ + I ++II+ L R+ D R + + K R G G
Sbjct: 363 KRPMN---SDLKNSGEIEADADIILMLYRDEVYNPDTQARGIAEINITKQR-NGSLGTIY 418
Query: 502 SLSYSRH 508
Y+ H
Sbjct: 419 RRFYNGH 425
>ref|NP_458617.1| (NC_003198) DNA helicase [Salmonella enterica subsp. enterica
serovar Typhi]
emb|CAD09305.1| (AL627282) DNA helicase [Salmonella enterica subsp. enterica
serovar Typhi]
Length = 454
Score = 45.1 bits (105), Expect = 0.002
Identities = 43/215 (20%), Positives = 99/215 (46%), Gaps = 9/215 (4%)
Query: 264 FSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYSLIQQDKGNVGTLF-LEEPVRDTG 322
F+ L+ T G + G+LI A +GKT + + ++ G + +E P +
Sbjct: 191 FAELDEKTCGWQDGDLILLGARPSMGKTAEALKHAVAALEGSPGKTVQFYSIEMPTQQLI 250
Query: 323 LGMMSIQAEKPL-HLTDTIYTKEEF----DNAYENTLGTGRVFLYDSFGSNSVERIVSMV 377
+ ++S+ A P +L + ++ ++ D + T GR+ + D+ +S
Sbjct: 251 MRLLSMLARVPFDNLRKGLLSEGQWSLLADAVAKLTSWEGRLLIDDTSYQTPSTLRISAR 310
Query: 378 RYLARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLTVELDICLLMVSHLNRDK 437
R + + I++D++ ++ ++ + + EI+ LK+L E+ + +S LNR
Sbjct: 311 RNVRKYGQPSLILVDYLQLMSCPGRENRTQEIQEISRSLKSLAKEIRCPVAALSQLNRSV 370
Query: 438 NRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERN 472
++ + + D+R + + Q +++I+ L R+
Sbjct: 371 EQRQDKRP---MVSDLRDSGSLEQDADVIMFLYRD 402
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.316 0.135 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 281,313,283
Number of Sequences: 891141
Number of extensions: 11784619
Number of successful extensions: 28065
Number of sequences better than 5.0e-02: 104
Number of HSP's better than 0.1 without gapping: 14
Number of HSP's successfully gapped in prelim test: 90
Number of HSP's that attempted gapping in prelim test: 27966
Number of HSP's gapped (non-prelim): 115
length of query: 517
length of database: 279,076,897
effective HSP length: 125
effective length of query: 392
effective length of database: 167,684,272
effective search space: 65732234624
effective search space used: 65732234624
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 93 (40.4 bits)
Query= orf24 [4981-5055]
(73 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.315 0.140 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 43,508,978
Number of Sequences: 891141
Number of extensions: 1584736
Number of successful extensions: 3554
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3554
Number of HSP's gapped (non-prelim): 0
length of query: 73
length of database: 279,076,897
effective HSP length: 49
effective length of query: 24
effective length of database: 235,410,988
effective search space: 5649863712
effective search space used: 5649863712
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf25 [5057-5173]
(115 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.321 0.139 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 73,042,536
Number of Sequences: 891141
Number of extensions: 2946156
Number of successful extensions: 6541
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6541
Number of HSP's gapped (non-prelim): 0
length of query: 115
length of database: 279,076,897
effective HSP length: 91
effective length of query: 24
effective length of database: 197,983,066
effective search space: 4751593584
effective search space used: 4751593584
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 83 (36.6 bits)
Query= orf26 [5167-5748]
(580 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064753.1| (NC_002519) RP DNA pol [Roseophage SIO1] >... 1180 0.0
ref|NP_570330.1| (NC_003390) DNA polymerase [Cyanophage P60... 209 6e-053
ref|NP_052093.1| (NC_001271) DNA polymerase [Bacteriophage ... 206 5e-052
ref|NP_523320.1| (NC_003298) DNA polymerase [Bacteriophage ... 202 1e-050
ref|NP_041982.1| (NC_001604) gene 5, DNA polymerase [Entero... 197 2e-049
pdb|1T7P|A Chain A, T7 Dna Polymerase Complexed To Dna Prim... 196 4e-049
ref|NP_052094.1| (NC_001271) 5B protein [Bacteriophage phiY... 172 7e-042
ref|NP_523322.1| (NC_003298) gene 5B protein [Bacteriophage... 171 3e-041
ref|NP_229419.1| (NC_000853) DNA-directed DNA polymerase I ... 113 6e-024
gb|AAD28505.1|AF121780_1 (AF121780) DNA polymerase I [Rhodo... 110 5e-023
ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas ... 109 9e-023
gb|AAG43104.1|AF038544_1 (AF038544) DNA polymerase I [Brach... 108 2e-022
gb|AAG43102.1|AF038542_1 (AF038542) DNA polymerase I [Pseud... 107 3e-022
sp|Q9RAA9|DPO1_RICFE DNA polymerase I (POL I) >gi|5912337|e... 104 2e-021
ref|NP_283286.1| (NC_003116) DNA polymerase I [Neisseria me... 103 4e-021
gb|AAG43148.1|AF063186_1 (AF063186) DNA polymerase I [Rhodo... 103 5e-021
ref|NP_274975.1| (NC_003112) DNA polymerase I [Neisseria me... 102 1e-020
sp|O52225|DPO1_THEFI DNA POLYMERASE I, THERMOSTABLE (TFI PO... 101 2e-020
gb|AAF24859.2|AF050678_1 (AF050678) DNA polymerase I [Therm... 99 9e-020
pdb|1KFD| Dna Polymerase I (Klenow Fragment) (E.C.2.7.7.7)... 98 3e-019
ref|NP_418300.1| (NC_000913) DNA polymerase I, 3' --> 5' po... 98 3e-019
ref|NP_290488.1| (NC_002655) DNA polymerase I, 3' --> 5' po... 97 4e-019
emb|CAB56093.1| (AJ238761) DNA polymerase I [Rickettsia rhi... 97 5e-019
ref|NP_458049.1| (NC_003198) DNA polymerase I [Salmonella e... 95 2e-018
ref|NP_268299.1| (NC_002662) DNA polymerase I (EC 2.7.7.7) ... 94 5e-018
ref|NP_229767.1| (NC_002505) DNA polymerase I [Vibrio chole... 91 3e-017
ref|NP_295430.1| (NC_001263) DNA-directed DNA polymerase [D... 91 3e-017
pdb|5KTQ|A Chain A, Large Fragment Of Taq Dna Polymerase Bo... 90 6e-017
sp|P19821|DPO1_THEAQ DNA polymerase I, thermostable (Taq po... 90 6e-017
pdb|4KTQ|A Chain A, Binary Complex Of The Large Fragment Of... 90 6e-017
ref|NP_372214.1| (NC_002758) DNA polymerase I [Staphylococc... 87 6e-016
ref|NP_188690.1| (NM_112946) DNA polymerase, putative [Arab... 45 0.002
>ref|NP_064753.1| (NC_002519) RP DNA pol [Roseophage SIO1]
gb|AAG02598.1|AF189021_17 (AF189021) Roseophage SIO1 complete genome
Length = 580
Score = 1180 bits (3053), Expect = 0.0
Identities = 580/580 (100%), Positives = 580/580 (100%)
Query: 1 MEVVFDIETDALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIP 60
MEVVFDIETDALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIP
Sbjct: 1 MEVVFDIETDALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIP 60
Query: 61 QIKKLWGYGIPLKDVADTLVMSRLLDPTRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYC 120
QIKKLWGYGIPLKDVADTLVMSRLLDPTRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYC
Sbjct: 61 QIKKLWGYGIPLKDVADTLVMSRLLDPTRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYC 120
Query: 121 KQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLR 180
KQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLR
Sbjct: 121 KQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLR 180
Query: 181 ETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGS 240
ETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGS
Sbjct: 181 ETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGS 240
Query: 241 PAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQ 300
PAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQ
Sbjct: 241 PAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQ 300
Query: 301 EWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQ 360
EWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQ
Sbjct: 301 EWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQ 360
Query: 361 AFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDGDIHTFNKEALGRYCKDRPTAK 420
AFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDGDIHTFNKEALGRYCKDRPTAK
Sbjct: 361 AFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDGDIHTFNKEALGRYCKDRPTAK 420
Query: 421 TFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAASMGWMKGLD 480
TFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAASMGWMKGLD
Sbjct: 421 TFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAASMGWMKGLD 480
Query: 481 GRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDED 540
GRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDED
Sbjct: 481 GRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDED 540
Query: 541 QAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
QAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH
Sbjct: 541 QAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
>ref|NP_570330.1| (NC_003390) DNA polymerase [Cyanophage P60]
gb|AAL73268.1| (AF338467) DNA polymerase [Cyanophage P60]
Length = 587
Score = 209 bits (532), Expect = 6e-053
Identities = 182/628 (28%), Positives = 282/628 (43%), Gaps = 89/628 (14%)
Query: 1 MEVVFDIETDAL--DATVIHVLVAKRVG---QKGFYVVRDAETFKR----LAKQVTLWIG 51
M++ FDIETD L D T+IH +VA+ + + F D + K L+K LW G
Sbjct: 1 MKLAFDIETDGLIPDLTIIHCIVARDIDTDEEFRFDGTGDYPSIKEGLELLSKADELW-G 59
Query: 52 HNVIGFDIPQIKKLWGYGIPLKDVADTLVMSRL----------------LDPTRKGGHSL 95
HN++ +D P I+KL P DTL++SRL + G HSL
Sbjct: 60 HNIVNYDYPAIQKLHPDWTPPSCTRDTLILSRLFFTDLLDRDFRSRPALMPGNLYGRHSL 119
Query: 96 DALS---GNEKIDFH-----DFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQ 147
+A G+ K +F D+STY+PEML YC +DV ++ + ++ + + S+
Sbjct: 120 EAWGHRLGHHKSEFGKSLEGDWSTYSPEMLEYCARDVEVSVALAKTFVPKIPEY-QWSVD 178
Query: 148 LEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKR 207
EH++ I+ QE+ GF D A+ + E + + +++E + V + KR
Sbjct: 179 TEHEIARIMSWQEQMGFPFDVRAAQALEGKLRLELDTLSDDMRETFHFVDGGV--MTPKR 236
Query: 208 KKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGSPAQVNKRLDR-LGWKPTVKTKSGNSYK 266
Y + K+ EFN S + + GW+P +T G K
Sbjct: 237 SNKVRHYFEN--------APFCKL--REFNPTSRHHIAWAFEHHRGWEPKERTAGGQP-K 285
Query: 267 ICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQ------EWLQKSQETGRVHGRVILTG 320
I E L I + L ++LE + L Q WL K + GR+H +L
Sbjct: 286 IDDEILREI-----NTKESLAFARILELQKHLGQLSEGKNAWL-KLERKGRLHHSCVLN- 338
Query: 321 AVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQL 380
T R AH PN+A +PS E R FK + L VG+DA+G++L
Sbjct: 339 TNTGRQAHMRPNLAQVPSAH----------------EYRSLFKPSDNHLQVGSDASGLEL 382
Query: 381 RVLAHYMN---DPIYTEQVIDGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIAS 437
R L HY++ + E+V++GDIHT E G DR + K Y + G G +
Sbjct: 383 RCLGHYLSRYDGGKFAEEVVNGDIHTALAEIYGT---DRKSGKGVTYCLIYGGGNHKLGL 439
Query: 438 ILGCN----NRQANEAMANFYEAIPSLKKLKSQASQAASMGWMKGLDGRVLRI-GSDHLA 492
G + +R+ E + + L + + A G +KGLDGR +R+ G +H A
Sbjct: 440 TAGASKSSASRKGQEIRGKIMQGLSGFADLNAAIQERAKSGVLKGLDGRPIRLQGKNHAA 499
Query: 493 LSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQA 552
L+ LQ +I +L + ++ GI++ A+VHDE Q V DQA+ ++ A
Sbjct: 500 LNYLLQSAGAIICKLWVIRTHELLQEAGIDYYPLAFVHDEQQLSVRADQAEMAAQLTTLA 559
Query: 553 IKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+KD K C +D E +IG +W + H
Sbjct: 560 MKDVEHQVKFRCALDSEYQIGNSWADCH 587
>ref|NP_052093.1| (NC_001271) DNA polymerase [Bacteriophage phiYeO3-12]
emb|CAB63614.1| (AJ251805) DNA polymerase [Bacteriophage phiYeO3-12]
Length = 704
Score = 206 bits (524), Expect = 5e-052
Identities = 184/688 (26%), Positives = 292/688 (41%), Gaps = 149/688 (21%)
Query: 24 RVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLW------GYGIPLKDVAD 77
R G G Y+ + + K+ L + HN +D+P ++KL + +P ++ D
Sbjct: 35 RPGDFGAYL----DALEAEVKRGGLIVFHNGHKYDVPALEKLAKLQLNRDFKLPRENCID 90
Query: 78 TLVMSRLLDPTRK-----------------GGHSLDALS--------------------- 99
TLV+SRLL K G H+L+A
Sbjct: 91 TLVLSRLLHANLKDTDMGLLRSGKLPGRRFGSHALEAWGYRLGEMKGEYKDDFKAMLEEQ 150
Query: 100 GNEKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKA--------------- 144
G E +D ++ ++ M+AY QDV + K+ + +L F
Sbjct: 151 GEEYVDGAEWVSFNEPMMAYNVQDVVVTVKLLEKFLTDLHYFPAGMDFTKYDADLFWREA 210
Query: 145 --SIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPV-- 200
S+ +EH+ ++ +QE+NGF +T EE+Y + + ++ E P
Sbjct: 211 GESVDIEHRAAWLLAKQERNGFPFNTQAIEELYVELAAKRAELLRKLTETFGSWYQPKGG 270
Query: 201 KEVI--------------IKRKKDGSIYSN-------QLLEGCNVQ-------GDYTKIA 232
KE+ + K G +Y + LE C++ YT +
Sbjct: 271 KELFKHPRTGKPLPSYPRVVYPKQGGVYKKPRNKAQREGLEPCDLDTRDYMEGAPYTPVE 330
Query: 233 WEEFNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPE---AVKGLKAW 289
+ F S + K+L GW P T+ G + E L + PE + +K +
Sbjct: 331 FVTFKPSSRDHIQKKLQEAGWVPEKFTEKGAPV-VDDEVLEHVRVDDPEKQACIDLIKEY 389
Query: 290 KVLETRWKLAQE----WLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDG 345
+++ R A E WL+ Q+ G++HG V GAVT RA H PN+A +P V
Sbjct: 390 LMIQKRIGQAAEGDKAWLRYVQDDGKIHGAVNPNGAVTGRATHSFPNLAQVPGV------ 443
Query: 346 ILWKMEGMYGAECRQAFKVPE------GK--LLVGTDAAGIQLRVLAHYM---NDPIYTE 394
YG +CR AF GK + G DA+G++LR LAH+M ++ Y
Sbjct: 444 -----RSPYGEQCRSAFGAEHHLDGITGKPWVQAGIDASGLELRCLAHFMARFDNGEYAN 498
Query: 395 QVIDGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFY 454
++++GDIHT N+ A +D AKTFIY FL GAG I I+G + E F
Sbjct: 499 EILNGDIHTKNQNAAELPTRD--NAKTFIYGFLYGAGDEKIGQIVGAGKERGKELKKKFL 556
Query: 455 EAIPSLKKLK-----SQASQAASMG----------WMKGLDGRVLRIGSDHLALSVYLQG 499
E P++ L+ S + +A +G W+KGLDGR + + S H AL+ LQ
Sbjct: 557 ENTPAIAALREAIQQSLVNSSAWIGGEQKVQWKRRWIKGLDGRKVHVRSPHAALNTLLQS 616
Query: 500 GETVIMRLANVFWQRQAKKEGI------NFKQCAWVHDEWQTEV-DEDQAQRLGEIQVQA 552
+I +L + + ++G+ +F Q W+HDE Q E+ A+ + E+ +A
Sbjct: 617 AGALICKLWIIKTEEMLIEKGLKHGWDGDFAQMGWIHDEVQIACRTEEVAKVVVEVAQEA 676
Query: 553 IKDAGTFFKLNCPMDGEAKIGKNWLETH 580
++ G + C +D E KIG NW H
Sbjct: 677 MRWVGEHWNFRCRLDTEGKIGPNWAVCH 704
>ref|NP_523320.1| (NC_003298) DNA polymerase [Bacteriophage T3]
sp|P20311|DPOL_BPT3 DNA POLYMERASE
pir||S07512 DNA-directed DNA polymerase (EC 2.7.7.7) - phage T3
emb|CAA35140.1| (X17255) 5 [Bacteriophage T3]
emb|CAC86283.1| (AJ318471) DNA polymerase [Bacteriophage T3]
Length = 704
Score = 202 bits (513), Expect = 1e-050
Identities = 184/690 (26%), Positives = 288/690 (41%), Gaps = 153/690 (22%)
Query: 24 RVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLWG------YGIPLKDVAD 77
R G G Y+ + + K+ L + HN +D+P + KL + +P ++ D
Sbjct: 35 RPGDFGAYL----DALEAEVKRGGLIVFHNGHKYDVPALTKLAKLQLNREFHLPRENCID 90
Query: 78 TLVMSRLLDPTRK-----------------GGHSLDALS--------------------- 99
TLV+SRL+ K G H+L+A
Sbjct: 91 TLVLSRLIHSNLKDTDMGLLRSGKLPGKRFGSHALEAWGYRLGEMKGEYKDDFKRMLEEQ 150
Query: 100 GNEKIDFHDFSTYTPEMLAYCKQDVAI---------NEKVYLQLKEELSNFGKAS----- 145
G E +D ++ + EM+ Y QDV + ++K Y + + ++ G +
Sbjct: 151 GEEYVDGMEWWNFNEEMMDYNVQDVVVTKALLEKLLSDKHYFPPEIDFTDVGYTTFWSES 210
Query: 146 ---IQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKE----------- 191
+ +EH+ ++ +QE+NGF DT EE+Y + + + E
Sbjct: 211 LEAVDVEHRAAWLLAKQERNGFPFDTKAIEELYVELAARRSELLRNLTETFGSWYQPKGG 270
Query: 192 ---FMVP-----------IAVPVKEVIIKRKKDGSIYSNQLLEGCNVQ-------GDYTK 230
F P I +P I K+ K+ + + E C + YT
Sbjct: 271 TEMFCHPRTGKPLPKYPRIKIPKVGGIFKKPKNKAQREGR--EPCELDTREYVAGAPYTP 328
Query: 231 IAWEEFNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPE---AVKGLK 287
+ FN S + K+L GW PT T G + E L + PE A+ +K
Sbjct: 329 VEHVVFNPSSRDHIQKKLQEAGWVPTKFTDKGAPV-VDDEVLEGVRVDDPEKQAAIDLIK 387
Query: 288 AWKVLETRWKLAQE----WLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGK 343
+ +++ R + E WL+ E G++HG V GAVT RA H PN+A IP V
Sbjct: 388 EYLMIQKRIGQSAEGDKAWLRYVAEDGKIHGSVNPNGAVTGRATHAFPNLAQIPGV---- 443
Query: 344 DGILWKMEGMYGAECRQAFKVPE------GK--LLVGTDAAGIQLRVLAHYM---NDPIY 392
YG +CR AF GK + G DA+G++LR LAH+M ++ Y
Sbjct: 444 -------RSPYGEQCRAAFGAEHHLDGITGKPWVQAGIDASGLELRCLAHFMARFDNGEY 496
Query: 393 TEQVIDGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMAN 452
++++GDIHT N+ A +D AKTFIY FL GAG I I+G + E
Sbjct: 497 AHEILNGDIHTKNQMAAELPTRD--NAKTFIYGFLYGAGDEKIGQIVGAGKERGKELKKK 554
Query: 453 FYEAIPSLKKLKSQASQAA---------------SMGWMKGLDGRVLRIGSDHLALSVYL 497
F E P++ L+ Q W+KGLDGR + + S H AL+ L
Sbjct: 555 FLENTPAIAALRESIQQTLVESSQWVAGEQQVKWKRRWIKGLDGRKVHVRSPHAALNTLL 614
Query: 498 QGGETVIMRLANVFWQRQAKKEGI------NFKQCAWVHDEWQTEV-DEDQAQRLGEIQV 550
Q +I +L + + ++G+ +F AW+HDE Q E+ A+ + E+
Sbjct: 615 QSAGALICKLWIIKTEEMLVEKGLKHGWDGDFAYMAWIHDEIQVACRTEEIAKTVIEVAQ 674
Query: 551 QAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+A++ G + C +D E K+G NW E H
Sbjct: 675 EAMRWVGEHWNFRCLLDTEGKMGANWKECH 704
>ref|NP_041982.1| (NC_001604) gene 5, DNA polymerase [Enterobacteria phage T7]
sp|P00581|DPOL_BPT7 DNA POLYMERASE (T7 DNA POLYMERASE)
pir||DJBPT7 DNA-directed DNA polymerase (EC 2.7.7.7) phage chain - phage T7
emb|CAA24412.1| (V01146) gene 5, DNA polymerase [Enterobacteria phage T7]
Length = 704
Score = 197 bits (502), Expect = 2e-049
Identities = 179/664 (26%), Positives = 273/664 (40%), Gaps = 145/664 (21%)
Query: 48 LWIGHNVIGFDIPQIKKLWG------YGIPLKDVADTLVMSRLLDPTRK----------- 90
L + HN +D+P + KL + +P ++ DTLV+SRL+ K
Sbjct: 55 LIVFHNGHKYDVPALTKLAKLQLNREFHLPRENCIDTLVLSRLIHSNLKDTDMGLLRSGK 114
Query: 91 ------GGHSLDALS---------------------GNEKIDFHDFSTYTPEMLAYCKQD 123
G H+L+A G E +D ++ + EM+ Y QD
Sbjct: 115 LPGKRFGSHALEAWGYRLGEMKGEYKDDFKRMLEEQGEEYVDGMEWWNFNEEMMDYNVQD 174
Query: 124 VAI---------NEKVYLQLKEELSNFGKAS--------IQLEHQMQAIVCEQEKNGFML 166
V + ++K Y + + ++ G + + +EH+ ++ +QE+NGF
Sbjct: 175 VVVTKALLEKLLSDKHYFPPEIDFTDVGYTTFWSESLEAVDIEHRAAWLLAKQERNGFPF 234
Query: 167 DTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPV----------------KEVIIKRKKD 210
DT EE+Y + + ++ E P K IK K
Sbjct: 235 DTKAIEELYVELAARRSELLRKLTETFGSWYQPKGGTEMFCHPRTGKPLPKYPRIKTPKV 294
Query: 211 GSIYSN-------QLLEGCNVQ-------GDYTKIAWEEFNLGSPAQVNKRLDRLGWKPT 256
G I+ + E C + YT + FN S + K+L GW PT
Sbjct: 295 GGIFKKPKNKAQREGREPCELDTREYVAGAPYTPVEHVVFNPSSRDHIQKKLQEAGWVPT 354
Query: 257 VKTKSGNSYKICPENLATIPDTAPE---AVKGLKAWKVLETRWKLAQE----WLQKSQET 309
T G + E L + PE A+ +K + +++ R + E WL+ E
Sbjct: 355 KYTDKGAPV-VDDEVLEGVRVDDPEKQAAIDLIKEYLMIQKRIGQSAEGDKAWLRYVAED 413
Query: 310 GRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPE--- 366
G++HG V GAVT RA H PN+A IP V YG +CR AF
Sbjct: 414 GKIHGSVNPNGAVTGRATHAFPNLAQIPGV-----------RSPYGEQCRAAFGAEHHLD 462
Query: 367 ---GK--LLVGTDAAGIQLRVLAHYM---NDPIYTEQVIDGDIHTFNKEALGRYCKDRPT 418
GK + G DA+G++LR LAH+M ++ Y ++++GDIHT K + R
Sbjct: 463 GITGKPWVQAGIDASGLELRCLAHFMARFDNGEYAHEILNGDIHT--KNQIAAELPTRDN 520
Query: 419 AKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAA------- 471
AKTFIY FL GAG I I+G + E F E P++ L+ Q
Sbjct: 521 AKTFIYGFLYGAGDEKIGQIVGAGKERGKELKKKFLENTPAIAALRESIQQTLVESSQWV 580
Query: 472 --------SMGWMKGLDGRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGI-- 521
W+KGLDGR + + S H AL+ LQ +I +L + + ++G+
Sbjct: 581 AGEQQVKWKRRWIKGLDGRKVHVRSPHAALNTLLQSAGALICKLWIIKTEEMLVEKGLKH 640
Query: 522 ----NFKQCAWVHDEWQTEV-DEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNW 576
+F AWVHDE Q E+ AQ + E +A++ G + C +D E K+G NW
Sbjct: 641 GWDGDFAYMAWVHDEIQVGCRTEEIAQVVIETAQEAMRWVGDHWNFRCLLDTEGKMGPNW 700
Query: 577 LETH 580
H
Sbjct: 701 AICH 704
>pdb|1T7P|A Chain A, T7 Dna Polymerase Complexed To Dna PrimerTEMPLATE,A
Nucleoside Triphosphate, And Its Processivity Factor
Thioredoxin
Length = 698
Score = 196 bits (499), Expect = 4e-049
Identities = 178/658 (27%), Positives = 271/658 (41%), Gaps = 139/658 (21%)
Query: 48 LWIGHNVIGFDIPQIKKLWG------YGIPLKDVADTLVMSRLLDPTRKG---------- 91
L + HN +D+P + KL + +P ++ DTLV+SRL+ K
Sbjct: 55 LIVFHNGHKYDVPALTKLAKLQLNREFHLPRENCIDTLVLSRLIHSNLKDTDMGLLRSGK 114
Query: 92 --------GHSLDALSGNEKIDFH--------------DFSTYTPEMLAYCKQDVAI--- 126
G+ L + G K DF ++ + EM+ Y QDV +
Sbjct: 115 LPGALEAWGYRLGEMKGEYKDDFKRMLEEQGEEYVDGMEWWNFNEEMMDYNVQDVVVTKA 174
Query: 127 ------NEKVYLQLKEELSNFGKAS--------IQLEHQMQAIVCEQEKNGFMLDTDIAE 172
++K Y + + ++ G + + +EH+ ++ +QE+NGF DT E
Sbjct: 175 LLEKLLSDKHYFPPEIDFTDVGYTTFWSESLEAVDIEHRAAWLLAKQERNGFPFDTKAIE 234
Query: 173 EIYTTCLRETNRIEAEIKEFMVPIAVPV----------------KEVIIKRKKDGSIYSN 216
E+Y + + ++ E P K IK K G I+
Sbjct: 235 ELYVELAARRSELLRKLTETFGSWYQPKGGTEMFCHPRTGKPLPKYPRIKTPKVGGIFKK 294
Query: 217 -------QLLEGCNVQ-------GDYTKIAWEEFNLGSPAQVNKRLDRLGWKPTVKTKSG 262
+ E C + YT + FN S + K+L GW PT T G
Sbjct: 295 PKNKAQREGREPCELDTREYVAGAPYTPVEHVVFNPSSRDHIQKKLQEAGWVPTKYTDKG 354
Query: 263 NSYKICPENLATIPDTAPE---AVKGLKAWKVLETRWKLAQE----WLQKSQETGRVHGR 315
+ E L + PE A+ +K + +++ R + E WL+ E G++HG
Sbjct: 355 APV-VDDEVLEGVRVDDPEKQAAIDLIKEYLMIQKRIGQSAEGDKAWLRYVAEDGKIHGS 413
Query: 316 VILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPE------GK- 368
V GAVT RA H PN+A IP V YG +CR AF GK
Sbjct: 414 VNPNGAVTGRATHAFPNLAQIPGV-----------RSPYGEQCRAAFGAEHHLDGITGKP 462
Query: 369 -LLVGTDAAGIQLRVLAHYM---NDPIYTEQVIDGDIHTFNKEALGRYCKDRPTAKTFIY 424
+ G DA+G++LR LAH+M ++ Y ++++GDIHT K + R AKTFIY
Sbjct: 463 WVQAGIDASGLELRCLAHFMARFDNGEYAHEILNGDIHT--KNQIAAELPTRDNAKTFIY 520
Query: 425 AFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAA------------- 471
FL GAG I I+G + E F E P++ L+ Q
Sbjct: 521 GFLYGAGDEKIGQIVGAGKERGKELKKKFLENTPAIAALRESIQQTLVESSQWVAGEQQV 580
Query: 472 --SMGWMKGLDGRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGI------NF 523
W+KGLDGR + + S H AL+ LQ +I +L + + ++G+ +F
Sbjct: 581 KWKRRWIKGLDGRKVHVRSPHAALNTLLQSAGALICKLWIIKTEEMLVEKGLKHGWDGDF 640
Query: 524 KQCAWVHDEWQTEV-DEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
AWVHDE Q E+ AQ + E +A++ G + C +D E K+G NW H
Sbjct: 641 AYMAWVHDEIQVGCRTEEIAQVVIETAQEAMRWVGDHWNFRCLLDTEGKMGPNWAICH 698
>ref|NP_052094.1| (NC_001271) 5B protein [Bacteriophage phiYeO3-12]
emb|CAB63615.1| (AJ251805) 5B protein [Bacteriophage phiYeO3-12]
Length = 384
Score = 172 bits (437), Expect = 7e-042
Identities = 124/393 (31%), Positives = 187/393 (47%), Gaps = 54/393 (13%)
Query: 228 YTKIAWEEFNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPE---AVK 284
YT + + F S + K+L GW P T+ G + E L + PE +
Sbjct: 6 YTPVEFVTFKPSSRDHIQKKLQEAGWVPEKFTEKGAPV-VDDEVLEHVRVDDPEKQACID 64
Query: 285 GLKAWKVLETRWKLAQE----WLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVP 340
+K + +++ R A E WL+ Q+ G++HG V GAVT RA H PN+A +P V
Sbjct: 65 LIKEYLMIQKRIGQAAEGDKAWLRYVQDDGKIHGAVNPNGAVTGRATHSFPNLAQVPGV- 123
Query: 341 HGKDGILWKMEGMYGAECRQAFKVPE------GK--LLVGTDAAGIQLRVLAHYM---ND 389
YG +CR AF GK + G DA+G++LR LAH+M ++
Sbjct: 124 ----------RSPYGEQCRSAFGAEHHLDGITGKPWVQAGIDASGLELRCLAHFMARFDN 173
Query: 390 PIYTEQVIDGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEA 449
Y ++++GDIHT N+ A +D AKTFIY FL GAG I I+G + E
Sbjct: 174 GEYANEILNGDIHTKNQNAAELPTRD--NAKTFIYGFLYGAGDEKIGQIVGAGKERGKEL 231
Query: 450 MANFYEAIPSLKKLK-----SQASQAASMG----------WMKGLDGRVLRIGSDHLALS 494
F E P++ L+ S + +A +G W+KGLDGR + + S H AL+
Sbjct: 232 KKKFLENTPAIAALREAIQQSLVNSSAWIGGEQKVQWKRRWIKGLDGRKVHVRSPHAALN 291
Query: 495 VYLQGGETVIMRLANVFWQRQAKKEGI------NFKQCAWVHDEWQTEV-DEDQAQRLGE 547
LQ +I +L + + ++G+ +F Q W+HDE Q E+ A+ + E
Sbjct: 292 TLLQSAGALICKLWIIKTEEMLIEKGLKHGWDGDFAQMGWIHDEVQIACRTEEVAKVVVE 351
Query: 548 IQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ +A++ G + C +D E KIG NW H
Sbjct: 352 VAQEAMRWVGEHWNFRCRLDTEGKIGPNWAVCH 384
>ref|NP_523322.1| (NC_003298) gene 5B protein [Bacteriophage T3]
emb|CAA35141.1| (X17255) 5B [Bacteriophage T3]
emb|CAC86285.1| (AJ318471) gene 5B protein [Bacteriophage T3]
Length = 372
Score = 171 bits (432), Expect = 3e-041
Identities = 123/385 (31%), Positives = 180/385 (45%), Gaps = 54/385 (14%)
Query: 236 FNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPE---AVKGLKAWKVL 292
FN S + K+L GW PT T G + E L + PE A+ +K + ++
Sbjct: 2 FNPSSRDHIQKKLQEAGWVPTKFTDKGAPV-VDDEVLEGVRVDDPEKQAAIDLIKEYLMI 60
Query: 293 ETRWKLAQE----WLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILW 348
+ R + E WL+ E G++HG V GAVT RA H PN+A IP V
Sbjct: 61 QKRIGQSAEGDKAWLRYVAEDGKIHGSVNPNGAVTGRATHAFPNLAQIPGV--------- 111
Query: 349 KMEGMYGAECRQAFKVPE------GK--LLVGTDAAGIQLRVLAHYM---NDPIYTEQVI 397
YG +CR AF GK + G DA+G++LR LAH+M ++ Y +++
Sbjct: 112 --RSPYGEQCRAAFGAEHHLDGITGKPWVQAGIDASGLELRCLAHFMARFDNGEYAHEIL 169
Query: 398 DGDIHTFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAI 457
+GDIHT N+ A +D AKTFIY FL GAG I I+G + E F E
Sbjct: 170 NGDIHTKNQMAAELPTRD--NAKTFIYGFLYGAGDEKIGQIVGAGKERGKELKKKFLENT 227
Query: 458 PSLKKLKSQASQAA---------------SMGWMKGLDGRVLRIGSDHLALSVYLQGGET 502
P++ L+ Q W+KGLDGR + + S H AL+ LQ
Sbjct: 228 PAIAALRESIQQTLVESSQWVAGEQQVKWKRRWIKGLDGRKVHVRSPHAALNTLLQSAGA 287
Query: 503 VIMRLANVFWQRQAKKEGI------NFKQCAWVHDEWQTEV-DEDQAQRLGEIQVQAIKD 555
+I +L + + ++G+ +F AW+HDE Q E+ A+ + E+ +A++
Sbjct: 288 LICKLWIIKTEEMLVEKGLKHGWDGDFAYMAWIHDEIQVACRTEEIAKTVIEVAQEAMRW 347
Query: 556 AGTFFKLNCPMDGEAKIGKNWLETH 580
G + C +D E K+G NW E H
Sbjct: 348 VGEHWNFRCLLDTEGKMGANWKECH 372
>ref|NP_229419.1| (NC_000853) DNA-directed DNA polymerase I [Thermotoga maritima]
pir||E72232 DNA-directed DNA polymerase I - Thermotoga maritima (strain MSB8)
gb|AAD36686.1|AE001805_11 (AE001805) DNA-directed DNA polymerase I [Thermotoga maritima]
Length = 893
Score = 113 bits (282), Expect = 6e-024
Identities = 142/625 (22%), Positives = 267/625 (42%), Gaps = 109/625 (17%)
Query: 6 DIETDALD---ATVIHVLVAKRVGQKGFYV---------VRDAETFKRLAKQVTLWIGHN 53
D+ET +LD ++ + V+ + ++ +Y+ + + E K+L K++ G
Sbjct: 323 DLETSSLDPFDCDIVGISVSFKP-KEAYYIPLHHRNAQNLDEKEVLKKL-KEILEDPGAK 380
Query: 54 VIGFDIP-QIKKLWGYGI-PLKDVADTLVMSRLLDPTRKGGHSLDALS----GNEKIDFH 107
++G ++ K L G+ P+ DT++ + LL+P K +LD L+ G + +
Sbjct: 381 IVGQNLKFDYKVLMVKGVEPVPPYFDTMIAAYLLEPNEKK-FNLDDLALKFLGYKMTSYQ 439
Query: 108 DFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLD 167
+ +++ + + DV + E+ +N+ CE D
Sbjct: 440 ELMSFSFPLFGFSFADVPV---------EKAANYS--------------CE--------D 468
Query: 168 TDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQ-- 225
DI +Y T + + EA+++ I +P+ V+ + + +G + L+ + +
Sbjct: 469 ADITYRLYKTLSLKLH--EADLENVFYKIEMPLVNVLARMELNGVYVDTEFLKKLSEEYG 526
Query: 226 -------GDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKI---CPENLAT 274
+ +IA E FN+ SP QV++ L ++LG KP KT Y E LA
Sbjct: 527 KKLEELAEEIYRIAGEPFNINSPKQVSRILFEKLGIKPRGKTTKTGDYSTRIEVLEELAG 586
Query: 275 IPDTAPEAVKGLKAWKVLETRWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMA 334
+ P ++ K K+ T + + + +TGR+H TG T R + PN+
Sbjct: 587 EHEIIPLILEYRKIQKLKSTYIDALPKMV--NPKTGRIHASFNQTGTATGRLSSSDPNLQ 644
Query: 335 NIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGK--LLVGTDAAGIQLRVLAHYMNDPIY 392
N+P+ + G E R+A VP+ +V D + I+LR+LAH D
Sbjct: 645 NLPT------------KSEEGKEIRKAI-VPQDPNWWIVSADYSQIELRILAHLSGDENL 691
Query: 393 TEQVIDG-DIHTFNKEALGRYCKDRPT------AKTFIYAFLLGAGTGMIASILGCNNRQ 445
+G D+HT + + T K ++ + G ++ LG ++
Sbjct: 692 LRAFEEGIDVHTLTASRIFNVKPEEVTEEMRRAGKMVNFSIIYGVTPYGLSVRLGVPVKE 751
Query: 446 ANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGRVLRIGS------------DHLA 492
A + + N++ P ++ ++ S+A G+++ L GR I + +A
Sbjct: 752 AEKMIVNYFVLYPKVRDYIQRVVSEAKEKGYVRTLFGRKRDIPQLMARDRNTQAEGERIA 811
Query: 493 LSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQA 552
++ +QG I++LA + R+ K+ + K VHDE EV ++ L V+
Sbjct: 812 INTPIQGTAADIIKLAMIEIDRELKERKMRSKMIIQVHDELVFEVPNEEKDAL----VEL 867
Query: 553 IKDAGT-FFKLNCPMDGEAKIGKNW 576
+KD T KL+ P++ + IGK W
Sbjct: 868 VKDRMTNVVKLSVPLEVDVTIGKTW 892
>gb|AAD28505.1|AF121780_1 (AF121780) DNA polymerase I [Rhodothermus marinus]
Length = 924
Score = 110 bits (274), Expect = 5e-023
Identities = 151/627 (24%), Positives = 256/627 (40%), Gaps = 121/627 (19%)
Query: 9 TDALDATVIHVLVAKRVGQKGFYVVRD------AET-FKRLA---KQVTLWIGHNVIGFD 58
T+A+ A+++ + + GQ G+YV ET +RLA ++ +G N + +D
Sbjct: 364 TEAMWASLVGIAFSWEKGQ-GYYVPTPLPDGTPTETVLERLAPILRRAQRKVGQN-LKYD 421
Query: 59 IPQIKKLWGYGIPLKDVADTLVMSRLLDPTRKGGHSLDAL-----------------SGN 101
+ + + G +P DT+V L+ P H+LD L SG
Sbjct: 422 LVVLAR-HGVQVP-PPYFDTMVAHYLIAPEEP--HNLDVLARQYLRYQMVSITELIGSGR 477
Query: 102 EKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASI--QLEHQMQAIVCEQ 159
++ D S E+ Y +D I ++ L EL G I ++E + ++ +
Sbjct: 478 DQKSMRDVSI--DEVGPYACEDTDIALQLADVLAAELDRHGLRHIAEEMEFPLIEVLADM 535
Query: 160 EKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLL 219
E+ G +D + EI ++EAE+ E V I
Sbjct: 536 ERTGICIDRAVLREI-------GKQLEAELHELEVKIY---------------------- 566
Query: 220 EGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG--NSYKICPENLATIP 276
++A EFN+GSP Q+ L +LG KP +T +G ++ + + LAT
Sbjct: 567 ----------EVAGVEFNIGSPQQLADVLFKKLGLKPRARTSTGRPSTKESVLQELAT-- 614
Query: 277 DTAPEAVKGLKA-WKVLETRWKLAQEWLQK--SQETGRVHGRVILTGAVTHRAAHQGPNM 333
+ GL W+ L + L+ ETGR+H T T R + PN+
Sbjct: 615 ---QHPLPGLILDWRHLAKLKSTYVDGLEPLIHPETGRIHTTFNQTVTATGRLSSSNPNL 671
Query: 334 ANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYT 393
NIP + G E R+AF G L+ D I+LR+LA D
Sbjct: 672 QNIP------------VRTEMGREIRRAFVPRPGWKLLSADYVQIELRILAALSGDEALR 719
Query: 394 EQVIDG-DIHTFNKEALGRYCKDRPT------AKTFIYAFLLGAGTGMIASILGCNNRQA 446
++G DIHT + + ++ T AK Y G +A L C+ R+A
Sbjct: 720 RAFLEGQDIHTATAARVFKVPPEQVTPEQRRRAKMVNYGIPYGISAWGLAQRLRCSTREA 779
Query: 447 NEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGR------------VLRIGSDHLAL 493
E + + A P + + L +A G+++ L GR R ++ +A+
Sbjct: 780 QELIEEYQRAFPGVTRYLHRVVEEARQKGYVETLLGRRRYVPNINSRNRAERSMAERIAV 839
Query: 494 SVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAI 553
++ +QG + +++LA V + K+EG K VHDE E+ ++ + + ++ Q +
Sbjct: 840 NMPIQGTQADMIKLAMVHIYHRLKREGYRAKMLLQVHDELVFEMPPEEVEPVRQLVEQEM 899
Query: 554 KDAGTFFKLNCPMDGEAKIGKNWLETH 580
K A P++ + +G NWL+ H
Sbjct: 900 KQALPL--EGVPIEVDIGVGDNWLDAH 924
>ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas aeruginosa]
pir||F82958 DNA polymerase I PA5493 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG08878.1|AE004962_2 (AE004962) DNA polymerase I [Pseudomonas aeruginosa]
Length = 913
Score = 109 bits (272), Expect = 9e-023
Identities = 134/550 (24%), Positives = 230/550 (41%), Gaps = 92/550 (16%)
Query: 70 IPLKDVA-DTLVMSRLLDPTRKGGHSLDALS----GNEKIDFHDFSTYTPEMLAYCKQDV 124
I L+ VA DT++ S +LD T H +D+L+ G+ I F D +
Sbjct: 417 IALRGVAYDTMLESYVLDSTATR-HDMDSLALKYLGHSTIRFEDIA-------------- 461
Query: 125 AINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNR 184
GK + QL AI EQ D D+ ++ ++
Sbjct: 462 -----------------GKGAKQLTFDQIAI--EQAGPYAAEDADVTLRLHQALWQKLEA 502
Query: 185 IEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLL---------EGCNVQGDYTKIAWEE 235
I + + + I +P+ V+ + +++G++ LL + ++ +A +E
Sbjct: 503 IPS-LARVLTDIEMPLVPVLARIERNGALVDANLLGIQSRELGEKMVALERQAYDLAGQE 561
Query: 236 FNLGSPAQVNKRL-DRLGWKPTVKTKSG--NSYKICPENLATIPDTAPEAVKGLKAWKVL 292
FNLGSP Q+ L D+LG KT +G ++ + LA P+ + ++ L
Sbjct: 562 FNLGSPKQLGAILYDKLGLPVLSKTATGQPSTAEAVLAELAEQDFELPKVIMQYRSMSKL 621
Query: 293 ETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKM 350
++ + +L + Q + TGR+H A T R + PN+ NIP +
Sbjct: 622 KSTYTDRLPE---QINPRTGRIHTSYHQAVAATGRLSSSDPNLQNIP------------I 666
Query: 351 EGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQV-IDGDIHTFN-KEA 408
G RQAF P+G L+ D + I+LR++AH D + D D+H E
Sbjct: 667 RTAEGRRIRQAFVAPQGYKLLAADYSQIELRIMAHLAKDDGLLDAFRHDLDVHRATAAEV 726
Query: 409 LGRYCKD-----RPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPS-LKK 462
G +D R +AK + + G +A +G ++A + ++ P L
Sbjct: 727 FGVPLEDVSGDQRRSAKAINFGLIYGMSAFGLAKQIGVERKEAQAYIDRYFARYPGVLAY 786
Query: 463 LKSQASQAASMGWMKGLDGR------------VLRIGSDHLALSVYLQGGETVIMRLANV 510
++ +QAA G+++ L GR +R ++ A++ +QG IM+ A V
Sbjct: 787 MERTRAQAAEQGFVETLFGRRLYLPEIHSKNGAMRKAAERTAINAPMQGTAADIMKRAMV 846
Query: 511 FWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEA 570
++ G++ + VHDE EV ED +++ E + A T L+ P+ EA
Sbjct: 847 AVDNWLQESGLDARVILQVHDELVLEVREDLVEQVCEGIRPLMSGAAT---LDVPLVVEA 903
Query: 571 KIGKNWLETH 580
+G NW E H
Sbjct: 904 GVGSNWDEAH 913
>gb|AAG43104.1|AF038544_1 (AF038544) DNA polymerase I [Brachyspira hyodysenteriae]
Length = 920
Score = 108 bits (269), Expect = 2e-022
Identities = 152/640 (23%), Positives = 265/640 (40%), Gaps = 133/640 (20%)
Query: 3 VVFDIETDALDA---TVIHVLVAKRVGQKGFYVVRDAET-----------FKRLAKQVTL 48
V D ET LD T+I + A + + FY+ T F L K+
Sbjct: 352 VCIDFETTGLDTFKDTIIGISFAIK-SNEAFYLDLSGRTKIGKDKCMNLVFDTLEKEDIK 410
Query: 49 WIGHNVIGFDIPQIKKLWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSGNEKIDFH 107
IGHN + ++ +K + G + ++ DT+V + L++PTR G +++D
Sbjct: 411 VIGHN-LKYEYKMMKAI---GKSIGNMYFDTMVAAYLINPTR-GRYNMD----------- 454
Query: 108 DFSTYTPEMLAYCKQDVAINEKVYLQLK-EELSNFGKASIQLEHQMQAIV---CEQEKNG 163
D+AI Y +K +L++ K ++ L+ ++ +V CE
Sbjct: 455 ---------------DLAIAYLSYNTIKYSDLTDNAKKTL-LDAPLKDVVEYACE----- 493
Query: 164 FMLDTDIAEEIYTTC--LRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEG 221
D D+ Y L +T+ +E E I +P+ V+ + DG S++ ++
Sbjct: 494 ---DADVTFRFYECFAPLLKTHNLE----ELFFNIEMPLVSVLADMEFDGVYISSERMKA 546
Query: 222 CNVQGDYTKI-----------AWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICP 269
+ +Y+ + A EEFNL SP Q+ L D++G PT KTK+G S
Sbjct: 547 --LSDEYSSLLEKTKEKIYAEAGEEFNLQSPKQLEYILFDKMGIPPTKKTKTGFS----- 599
Query: 270 ENLATIPDTAPEAVKGLKAWKVLETRWKLAQ---EWLQK-----SQETGRVHGRVILTGA 321
T + E + K + + T K A+ +L +++TGR+H T
Sbjct: 600 ----TDEEVLTELSQKYKIAEYMLTYRKYAKLKNTYLDVFPTLINEKTGRIHASFNQTVT 655
Query: 322 VTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLR 381
T R + PN+ NIP + G G E R F +G LL+ D + I+LR
Sbjct: 656 ATGRLSSSEPNLQNIP------------VRGEEGREIRNTFIPEKGNLLIAADYSQIELR 703
Query: 382 VLAHYMNDPIYTEQVIDG-DIHTFNKEALGRYCKDRPT------AKTFIYAFLLGAGTGM 434
+LAH+ NDP E + DIH + K+ T AK ++ + G
Sbjct: 704 LLAHFSNDPALVEAFKNNDDIHRKTAMKIYSVSKEHVTASMRNIAKIINFSIIYGKTAFG 763
Query: 435 IASILGCNNRQANEAMANFYEAIPSLKKLKSQ-ASQAASMGWMKGLDGRV---------- 483
++ LG ++A + + + +K + + + G+++ + GR+
Sbjct: 764 LSKELGIKKKEAEDFIKGYSSTYSRVKPFCDEVVEEVRNKGFVRTMCGRIRDLSKTINSS 823
Query: 484 ---LRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDED 540
R ++ +AL+ +QG ++++A V ++ K K VHDE EV E
Sbjct: 824 NAMARNEAERMALNTLIQGSAADMIKVAMVAIHKEFKNHLKTAKIVMQVHDELVVEVSEA 883
Query: 541 QAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+A + I + ++ + K N P+ + G +W E H
Sbjct: 884 EADKAMTIMKEIMEHS---VKANVPIIVDIHKGNSWGEVH 920
>gb|AAG43102.1|AF038542_1 (AF038542) DNA polymerase I [Pseudomonas fluorescens]
Length = 939
Score = 107 bits (267), Expect = 3e-022
Identities = 119/472 (25%), Positives = 196/472 (41%), Gaps = 53/472 (11%)
Query: 142 GKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVK 201
GK + QL A+ EQ D D+ ++ T + + I + + + I +P+
Sbjct: 488 GKGAKQLTFDQIAL--EQAGPYAAEDADVTLRLHQTLFEKLSAIPS-LASVLTDIEIPLV 544
Query: 202 EVIIKRKKDGSIYSNQLL---------EGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRL 251
V+ + ++ G+ LL + ++ + +IA EEFNLGSP Q+ L ++L
Sbjct: 545 PVLARIERQGAFVDADLLGIQSIELGNKMVALEREAFEIAGEEFNLGSPKQLGVILYEKL 604
Query: 252 GWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWL--QKSQET 309
G KT G E LA + + K L + + + L Q + T
Sbjct: 605 GLPVLKKTAKGQP-STAEEVLAKLAEDDHRLPKVLMEHRSMSKLKSTYTDRLPEQINPRT 663
Query: 310 GRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKL 369
GR+H A T R + PN+ NIP + G RQAF P+G
Sbjct: 664 GRIHTSYHQAVASTGRLSSSDPNLQNIP------------VRTAEGRRIRQAFVAPKGYK 711
Query: 370 LVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHTFN-----KEALGRYCKD-RPTAKTF 422
L+ D + I+LR++AH D + D+HT K L D R AK
Sbjct: 712 LLAADYSQIELRIMAHLSRDEGLMNAFRNNLDVHTATAAEVFKVELNEVTSDQRRGAKAI 771
Query: 423 IYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQA-SQAASMGWMKGLDG 481
+ + G G + +G + + A + ++ P +++ + +QAA G+++ + G
Sbjct: 772 NFGLIYGMGAQKLGKDIGVDTKTAKAYIDTYFARYPGVREYMDRTRAQAADQGFVETIFG 831
Query: 482 RVL------------RIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWV 529
R L R ++ A++ +QG I++ A V G++ K V
Sbjct: 832 RRLYLPDINSNKPQERAAAERTAINAPMQGTAADIIKKAMVAVDNWLCASGLDAKVILQV 891
Query: 530 HDEWQTEVDEDQAQRL-GEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
HDE EV ED ++ EI+V + A KL+ P+ E +G NW E H
Sbjct: 892 HDELVLEVREDLVDQVREEIRVHMSEAA----KLDVPLLVEVGVGNNWDEAH 939
>sp|Q9RAA9|DPO1_RICFE DNA polymerase I (POL I)
emb|CAB56067.1| (AJ238763) DNA polymerase I [Rickettsia felis]
Length = 922
Score = 104 bits (260), Expect = 2e-021
Identities = 101/415 (24%), Positives = 181/415 (43%), Gaps = 45/415 (10%)
Query: 196 IAVPVKEVIIKRKKDG----SIYSNQL-----LEGCNVQGDYTKIAWEEFNLGSPAQVNK 246
I +P+ ++ K +K G + Y NQL E ++ + ++ +FN+GSP Q+ +
Sbjct: 523 IDLPICFILDKMEKIGIKVDANYLNQLSAEFGAEILKLEEEIFALSGTKFNIGSPKQLGE 582
Query: 247 RLDRLGWKPTVKTKS-GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQK 305
L P K + +SY E L + + L W+ L + L K
Sbjct: 583 ILFEKMQLPFGKASAKASSYSTGAEILEKLSEHGYNIADLLLRWRQLTKLKNTYTDSLPK 642
Query: 306 SQE--TGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFK 363
+ T RVH + T T R + Q PN+ N+P + G + RQAF
Sbjct: 643 QIDNITHRVHTTFLQTSTTTGRLSSQEPNLQNVP------------IRSSEGNKIRQAFI 690
Query: 364 VPEGKLLVGTDAAGIQLRVLAHYMN-DPIYTEQVIDGDIHT------FNKEALGRYCKDR 416
EG L+ D + I+LR+L+H N D + + DIHT FN + + R
Sbjct: 691 AEEGYKLISADYSQIELRILSHIANIDALKQAFINKDDIHTQTACQIFNLQKHELTSEHR 750
Query: 417 PTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQ-AASMGW 475
AK + + G +A L +N +A+E + ++ +++ Q A+S G+
Sbjct: 751 RKAKAINFGIIYGISAFGLAKQLNVSNGEASEYIKKYFAEYKGVQEYMEQTKAFASSNGY 810
Query: 476 MKGLDGR----------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQ 525
+ GR L+ ++ A++ +QG I+++A + ++ +K + +
Sbjct: 811 VINFFGRKCFVPLIHDKKLKQFAERAAINAPIQGTNADIIKIAMINLDQEIEKNNLKTRL 870
Query: 526 CAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+HDE EV ED+ + + I ++ I + T ++ P+ E ++G NW+E H
Sbjct: 871 VLQIHDELLFEVPEDEVELVTPI-IKKIMENST--NMDVPIITEIRVGNNWMEIH 922
>ref|NP_283286.1| (NC_003116) DNA polymerase I [Neisseria meningitidis Z2491]
pir||F81963 DNA-directed DNA polymerase (EC 2.7.7.7) I NMA0462 [imported] -
Neisseria meningitidis (group A strain Z2491)
emb|CAB83758.1| (AL162753) DNA polymerase I [Neisseria meningitidis Z2491]
Length = 938
Score = 103 bits (258), Expect = 4e-021
Identities = 149/641 (23%), Positives = 257/641 (39%), Gaps = 125/641 (19%)
Query: 6 DIET---DALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQI 62
D ET DA++A+++ + +A + G+ + V + T + +G + P +
Sbjct: 357 DTETTSLDAMNASLVGISIAFQAGEAVYIPVGHSLTAAPEQLDLQDVLGRLKPHLENPAL 416
Query: 63 KKLW-----------GYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALS----GNEKIDF 106
KK+ YGI L +A D ++ S +++ GH LD LS G E I +
Sbjct: 417 KKIGQNLKYDQHVFANYGIALNGIAGDAMLASYIIESHL--GHGLDELSERWLGLETITY 474
Query: 107 HDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFML 166
+ +++ DVAI Q E + +++LE ++A
Sbjct: 475 ESLCGKGAKQISFA--DVAIG-----QATEYAAQDADFALRLEAHLRA------------ 515
Query: 167 DTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLL------- 219
+++A+ E + +PV +V+ + +++G L
Sbjct: 516 -----------------QMDAKQLEMYEKMELPVAQVLFEMERNGVQIDRAELARQSAEL 558
Query: 220 --EGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLG--WKPTVKTKSGNSYKICPENLAT 274
E ++ + A + FNL SP Q+ + L D++G K KT G N A
Sbjct: 559 GAELMKLEQEAYAAAGQPFNLNSPKQLQEILFDKMGIPTKGLKKTAKGG----ISTNEAV 614
Query: 275 IPDTAPE------AVKGLKAWKVLETRWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAH 328
+ AP+ ++ K+ T E + S + GRVH A+T R A
Sbjct: 615 LEQLAPDYPLPKIILQNRSLAKLKSTYTDKLPEMI--SPKDGRVHTTYAQAVAITGRLAS 672
Query: 329 QGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMN 388
PN+ NIP + G R+AF P+G ++V D + I+LR++AH
Sbjct: 673 NNPNLQNIP------------IRTAEGRRVRRAFTAPQGSVIVSADYSQIELRIMAHLSG 720
Query: 389 DPIYTEQVIDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGC 441
D +G D+H F + R AKT + + G G +A LG
Sbjct: 721 DKTLIAAFQNGEDVHRRTAAEVFGIAPENVSSEQRRYAKTINFGLIYGMGQYGLAKSLGI 780
Query: 442 NNRQANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGRVL------------RIGS 488
+N A + ++ P + + ++ QAA+ G+++ L GR L R G+
Sbjct: 781 DNISAKNFIDRYFARYPGVAEYMQRTKEQAAAQGFVETLFGRRLYLPDIRNKNTNARAGA 840
Query: 489 DHLALSVYLQGGETVIMRLANVFWQRQAKK-------EGINFKQCAWVHDEWQTEVDEDQ 541
+ A++ +QG + +++ A + R + E + K VHDE EV E +
Sbjct: 841 ERAAINAPMQGTASDLIKRAMIDVSRWLSECEASPWDELLQSKLIMQVHDELVLEVVETE 900
Query: 542 AQRLGE--IQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ E Q+ A D G L+ P+ E +G+NW E H
Sbjct: 901 LDFVKEKLPQIMAKVDGGL---LDVPLVAEVGVGENWEEAH 938
>gb|AAG43148.1|AF063186_1 (AF063186) DNA polymerase I [Rhodococcus erythropolis]
Length = 920
Score = 103 bits (257), Expect = 5e-021
Identities = 105/423 (24%), Positives = 179/423 (41%), Gaps = 49/423 (11%)
Query: 188 EIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGC---------NVQGDYTKIAWEEFNL 238
E+ + I VPV V+ + +++G L+ N++ +IA E FN+
Sbjct: 517 ELVNILHNIEVPVARVLTQMEENGIELDLAFLDQLGGEFSNTMQNLENQIMEIAGESFNV 576
Query: 239 GSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWK 297
SP QV + L ++LG K KT +G Y L I + + + L++ +
Sbjct: 577 SSPKQVGEVLFEKLGLKGGKKTTTGQ-YSTSESVLEKIEHPIAQLILEYRGLSKLKSTYT 635
Query: 298 LAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAE 357
Q + +T RVH T R + PN+ NIP + G +
Sbjct: 636 DGL-CKQANPDTHRVHTSYHQALTATGRLSSTDPNLQNIP------------IRAEIGRQ 682
Query: 358 CRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHTFN-KEALGRYCKD 415
R+AF PEG++L+ D + I+LR++AH D + I G D+H E LG ++
Sbjct: 683 IRKAFVAPEGRVLLAADYSQIELRLMAHLSQDEALLDAFIHGQDVHRRTAAEVLGIPLEE 742
Query: 416 -----RPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQA 470
R AK + L G + LG +++ + + ++ P + + Q
Sbjct: 743 VTNDQRRQAKAVNFGLLYGMSEFGLIRQLGFTRQESQDYIKQYFHRYPGIYDYMQRTRQV 802
Query: 471 A-SMGWMKGLDGR------------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAK 517
A G+++ L GR ++R ++ A++ LQG I+++A + +
Sbjct: 803 ALEQGFVETLLGRRLYTPDIDARNMMVRKAAERAAINAPLQGSAADIIKMAMIEVDKMLP 862
Query: 518 KEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWL 577
K+ K VHDE EVDED A L + ++ + +++ P+ E GKNW
Sbjct: 863 KD--QAKMLLQVHDELVFEVDEDIADELAPKLAEVMQ---SVLQISVPLVVEVGKGKNWD 917
Query: 578 ETH 580
E H
Sbjct: 918 EAH 920
>ref|NP_274975.1| (NC_003112) DNA polymerase I [Neisseria meningitidis MC58]
pir||G81020 DNA polymerase I NMB1982 [imported] - Neisseria meningitidis (group
B strain MD58)
gb|AAF42310.1| (AE002546) DNA polymerase I [Neisseria meningitidis MC58]
Length = 938
Score = 102 bits (254), Expect = 1e-020
Identities = 149/633 (23%), Positives = 259/633 (40%), Gaps = 109/633 (17%)
Query: 6 DIET---DALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQI 62
D ET DA++A+++ + +A + G+ + V + T + +G P +
Sbjct: 357 DTETTSLDAMNASLVGISIAFQAGEAVYIPVGHSLTAAPEQLDLQDVLGRLKPHLGNPAL 416
Query: 63 KKLW-----------GYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALS----GNEKIDF 106
KK+ YGI L +A D ++ S +++ GH LD LS G E I +
Sbjct: 417 KKIGQNLKYDQHVFANYGIALNGIAGDAMLASYIIESHL--GHGLDELSERWLGLETITY 474
Query: 107 HDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGF-M 165
+ + + DVAI Q E + +++LE ++A + E++ +
Sbjct: 475 ESLCGKGAKQIGFA--DVAIG-----QATEYAAQDADFALRLEAHLRAQMDEKQLEMYEK 527
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQ 225
++ +A+ ++ E N ++ + E A E++ K + Y+
Sbjct: 528 MELPVAQVLFEM---ERNGVQIDRAELARQSAELGAELM---KLEQEAYA---------- 571
Query: 226 GDYTKIAWEEFNLGSPAQVNKRL-DRLG--WKPTVKTKSGNSYKICPENLATIPDTAPE- 281
A + FNL SP Q+ + L D++G K KT G N A + AP+
Sbjct: 572 -----AAGQPFNLNSPKQLQEILFDKMGIPTKGLKKTAKGG----ISTNEAVLEQLAPDY 622
Query: 282 -----AVKGLKAWKVLETRWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANI 336
++ K+ T E + S + GRVH A+T R A PN+ NI
Sbjct: 623 PLPKIILQNRSLAKLKSTYTDKLPEMI--SPKDGRVHTTYAQAVAITGRLASNNPNLQNI 680
Query: 337 PSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQV 396
P + G + R+AF P+G ++V D + I+LR++AH D
Sbjct: 681 P------------IRTEEGRKVRRAFTAPQGSVIVSADYSQIELRIMAHLSGDKTLIAAF 728
Query: 397 IDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEA 449
+G D+H F + R AK+ + + G G +A LG +N A
Sbjct: 729 QNGEDVHRRTAAEVFGTAPENVSSEQRRYAKSINFGLIYGMGQYGLAKSLGIDNLSAKNF 788
Query: 450 MANFYEAIPSLKK-LKSQASQAASMGWMKGLDGRVL------------RIGSDHLALSVY 496
+ ++ P + + ++ QAA+ G+++ L GR L R G++ A++
Sbjct: 789 IDRYFARYPGVAEYMQRTKEQAAAQGYVETLFGRRLYLPDIRNKNANARAGAERAAINAP 848
Query: 497 LQGGETVIMRLANVFWQRQAKK-------EGINFKQCAWVHDEWQTEVDEDQAQRLGE-- 547
+QG + +++ A + R + E + K VHDE EV E + + E
Sbjct: 849 MQGTASDLIKRAMIDVSRWLSECEASPWDELLQSKLIMQVHDELVLEVVETELDFVKEKL 908
Query: 548 IQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
Q+ A D G L+ P+ E +G+NW E H
Sbjct: 909 PQIMAKVDGGL---LDVPLVAEVGVGENWEEAH 938
>sp|O52225|DPO1_THEFI DNA POLYMERASE I, THERMOSTABLE (TFI POLYMERASE 1)
gb|AAC46079.1| (AF030320) thermostable DNA polymerase [Thermus filiformis]
Length = 833
Score = 101 bits (252), Expect = 2e-020
Identities = 107/431 (24%), Positives = 188/431 (42%), Gaps = 56/431 (12%)
Query: 174 IYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAW 233
+Y R +R+ A ++ V + VP+ E + S + +E ++G+ ++A
Sbjct: 429 LYQEVERPLSRVLAHMEARGVRLDVPLLEAL-------SFELEKEME--RLEGEVFRLAG 479
Query: 234 EEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVL 292
FNL S Q+ + L D LG P +T+ L + P V+ + ++ L
Sbjct: 480 HPFNLNSRDQLERVLFDELGLTPVGRTEKTGKRSTAQGALEALRGAHP-IVELILQYREL 538
Query: 293 ETRWKLAQEWLQK-----SQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGIL 347
KL +L TGR+H R T T R + PN+ NIP
Sbjct: 539 S---KLKSTYLDPLPRLVHPRTGRLHTRFNQTATATGRLSSSDPNLQNIP---------- 585
Query: 348 WKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHT--- 403
+ G R+AF EG LL+ D + I+LRVLAH D +G DIHT
Sbjct: 586 --VRTPLGQRIRKAFVAEEGWLLLAADYSQIELRVLAHLSGDENLKRVFREGKDIHTETA 643
Query: 404 ---FNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSL 460
F + K R AKT + L G ++ LG + ++A + ++++ P +
Sbjct: 644 AWMFGLDPALVDPKMRRAAKTVNFGVLYGMSAHRLSQELGIDYKEAEAFIERYFQSFPKV 703
Query: 461 KK-LKSQASQAASMGWMKGLDG------------RVLRIGSDHLALSVYLQGGETVIMRL 507
+ ++ + + G+++ L G R +R ++ +A ++ +QG +M++
Sbjct: 704 RAWIERTLEEGRTRGYVETLFGRRRYVPDLASRVRSVREAAERMAFNMPVQGTAADLMKI 763
Query: 508 ANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMD 567
A V + K G + VHDE EV ED+A+ + + +++A + L+ P++
Sbjct: 764 AMVKLFPRLKPLGAHL--LLQVHDELVLEVPEDRAEEAKALVKEVMENA---YPLDVPLE 818
Query: 568 GEAKIGKNWLE 578
E +G++WLE
Sbjct: 819 VEVGVGRDWLE 829
>gb|AAF24859.2|AF050678_1 (AF050678) DNA polymerase I [Thermomicrobium roseum]
Length = 917
Score = 99.4 bits (246), Expect = 9e-020
Identities = 97/373 (26%), Positives = 168/373 (45%), Gaps = 34/373 (9%)
Query: 230 KIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAP--EAVKGL 286
++A FN+ SP Q++ L D LG +TK+G Y + E L + DT P EA+
Sbjct: 545 ELAGHPFNITSPQQLSTLLFDELGLPRGKRTKTG--YSVSQEVLENLRDTHPIVEAILEY 602
Query: 287 KAWKVLETRWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGI 346
+ L++ + A Q +TGRVH T A T R + PN+ NIP+ G+ G+
Sbjct: 603 RQLLKLKSTYVDALP-RQVHPQTGRVHTIFHQTVAATGRLSSSDPNLQNIPA--RGELGL 659
Query: 347 LWKMEGMYGAECRQAFKVP-EGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHTF 404
+ + A+ R +++ E LL+ D + I+LR++AH+ DP +G DIH
Sbjct: 660 A--VRRAFIADNRPGYRIADEPILLLSADYSQIELRLMAHFSQDPALLRAFAEGKDIHAA 717
Query: 405 NKEALGRYCKDRPT------AKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIP 458
+ D T AK + + G +A G + + A + +++ P
Sbjct: 718 TASEVFGVPLDAVTPEMRRIAKVVNFGIMYGMQAYGLARDTGMSRQDAQRFIEAYFQRFP 777
Query: 459 SLKK-LKSQASQAASMGWMKGLDGR------------VLRIGSDHLALSVYLQGGETVIM 505
+ + L+ +AA +G+++ L GR R ++ +A+++ LQG IM
Sbjct: 778 GVARYLEETRRRAAELGYVETLFGRRRYLPEITSSNPARRQAAERMAVNMPLQGTAADIM 837
Query: 506 RLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCP 565
+L + R + G+ + VHDE EV E + E+ + + + P
Sbjct: 838 KLVMIAVHRPLGERGLRSRMLLQVHDELVLEVPESELATTTELVTTRM---SRVVEPSVP 894
Query: 566 MDGEAKIGKNWLE 578
++ +AK G NW +
Sbjct: 895 LEVDAKAGPNWAD 907
>pdb|1KFD| Dna Polymerase I (Klenow Fragment) (E.C.2.7.7.7) Complexed With
Dctp
pdb|1DPI| DNA Polymerase I (Klenow Fragment) (E.C.2.7.7.7) - dCMP Complex
Length = 605
Score = 97.8 bits (242), Expect = 3e-019
Identities = 147/630 (23%), Positives = 254/630 (39%), Gaps = 109/630 (17%)
Query: 5 FDIETDALD---ATVIHVLVAKRVGQKGFYVV--------------RDAETFKRLAK-QV 46
FD ETD+LD A ++ + A G + V R E K L + +
Sbjct: 31 FDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEK 90
Query: 47 TLWIGHNVIGFDIPQIKKLWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSGNEKID 105
L +G N + +D + YGI L+ +A DT++ S +L+ G H +D+L+ E+
Sbjct: 91 ALKVGQN-LKYDRGILAN---YGIELRGIAFDTMLESYILNSVA-GRHDMDSLA--ERWL 143
Query: 106 FHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFM 165
H T+ EE++ GK + I E+
Sbjct: 144 KHKTITF-----------------------EEIAGKGKNQLTFNQ----IALEEAGRYAA 176
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEG---- 221
D D+ +++ + + + + F I +P+ V+ + +++G ++L
Sbjct: 177 EDADVTLQLHLKMWPDLQKHKGPLNVFE-NIEMPLVPVLSRIERNGVKIDPKVLHNHSEE 235
Query: 222 -----CNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG---NSYKICPENL 272
++ +IA EEFNL S Q+ L ++ G KP KT G S ++ E
Sbjct: 236 LTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELA 295
Query: 273 ATIPDTAPEAVKGLKAWKVLETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQG 330
P P+ + + L++ + KL L + +TGRVH T R +
Sbjct: 296 LDYP--LPKVILEYRGLAKLKSTYTDKLP---LMINPKTGRVHTSYHQAVTATGRLSSTD 350
Query: 331 PNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDP 390
PN+ NIP + G RQAF PE ++V D + I+LR++AH D
Sbjct: 351 PNLQNIP------------VRNEEGRRIRQAFIAPEDYVIVSADYSQIELRIMAHLSRDK 398
Query: 391 IYTEQVIDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNN 443
+G DIH F + R +AK + + G +A L
Sbjct: 399 GLLTAFAEGKDIHRATAAEVFGLPLETVTSEQRRSAKAINFGLIYGMSAFGLARQLNIPR 458
Query: 444 RQANEAMANFYEAIPS-LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDH 490
++A + M ++E P L+ ++ +QA G+++ LDGR L R ++
Sbjct: 459 KEAQKYMDLYFERYPGVLEYMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAER 518
Query: 491 LALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQV 550
A++ +QG I++ A + + E + VHDE EV +D + + Q+
Sbjct: 519 AAINAPMQGTAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAK-QI 577
Query: 551 QAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ + T +L+ P+ E G+NW + H
Sbjct: 578 HQLMENCT--RLDVPLLVEVGSGENWDQAH 605
>ref|NP_418300.1| (NC_000913) DNA polymerase I, 3' --> 5' polymerase, 5' --> 3' and
3' --> 5' exonuclease [Escherichia coli K12]
sp|P00582|DPO1_ECOLI DNA polymerase I (POL I)
pir||DJECI DNA-directed DNA polymerase (EC 2.7.7.7) I - Escherichia coli
emb|CAA23607.1| (V00317) DNA polymerase I [Escherichia coli]
gb|AAA24402.1| (J01663) DNA polymerase I [Escherichia coli]
gb|AAB02998.1| (L19201) DNA polymerase I [Escherichia coli]
gb|AAC76861.1| (AE000461) DNA polymerase I, 3' --> 5' polymerase, 5' --> 3' and 3'
--> 5' exonuclease [Escherichia coli K12]
Length = 928
Score = 97.8 bits (242), Expect = 3e-019
Identities = 147/630 (23%), Positives = 254/630 (39%), Gaps = 109/630 (17%)
Query: 5 FDIETDALD---ATVIHVLVAKRVGQKGFYVV--------------RDAETFKRLAK-QV 46
FD ETD+LD A ++ + A G + V R E K L + +
Sbjct: 354 FDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEK 413
Query: 47 TLWIGHNVIGFDIPQIKKLWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSGNEKID 105
L +G N + +D + YGI L+ +A DT++ S +L+ G H +D+L+ E+
Sbjct: 414 ALKVGQN-LKYDRGILAN---YGIELRGIAFDTMLESYILNSVA-GRHDMDSLA--ERWL 466
Query: 106 FHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFM 165
H T+ EE++ GK + I E+
Sbjct: 467 KHKTITF-----------------------EEIAGKGKNQLTFNQ----IALEEAGRYAA 499
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEG---- 221
D D+ +++ + + + + F I +P+ V+ + +++G ++L
Sbjct: 500 EDADVTLQLHLKMWPDLQKHKGPLNVFE-NIEMPLVPVLSRIERNGVKIDPKVLHNHSEE 558
Query: 222 -----CNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG---NSYKICPENL 272
++ +IA EEFNL S Q+ L ++ G KP KT G S ++ E
Sbjct: 559 LTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELA 618
Query: 273 ATIPDTAPEAVKGLKAWKVLETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQG 330
P P+ + + L++ + KL L + +TGRVH T R +
Sbjct: 619 LDYP--LPKVILEYRGLAKLKSTYTDKLP---LMINPKTGRVHTSYHQAVTATGRLSSTD 673
Query: 331 PNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDP 390
PN+ NIP + G RQAF PE ++V D + I+LR++AH D
Sbjct: 674 PNLQNIP------------VRNEEGRRIRQAFIAPEDYVIVSADYSQIELRIMAHLSRDK 721
Query: 391 IYTEQVIDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNN 443
+G DIH F + R +AK + + G +A L
Sbjct: 722 GLLTAFAEGKDIHRATAAEVFGLPLETVTSEQRRSAKAINFGLIYGMSAFGLARQLNIPR 781
Query: 444 RQANEAMANFYEAIPS-LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDH 490
++A + M ++E P L+ ++ +QA G+++ LDGR L R ++
Sbjct: 782 KEAQKYMDLYFERYPGVLEYMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAER 841
Query: 491 LALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQV 550
A++ +QG I++ A + + E + VHDE EV +D + + Q+
Sbjct: 842 AAINAPMQGTAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAK-QI 900
Query: 551 QAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ + T +L+ P+ E G+NW + H
Sbjct: 901 HQLMENCT--RLDVPLLVEVGSGENWDQAH 928
>ref|NP_290488.1| (NC_002655) DNA polymerase I, 3' --> 5' polymerase, 5' --> 3' and
3' --> 5' exonuclease [Escherichia coli O157:H7 EDL933]
ref|NP_312813.1| (NC_002695) DNA polymerase I [Escherichia coli O157:H7]
gb|AAG59052.1|AE005617_1 (AE005617) DNA polymerase I, 3' --> 5' polymerase, 5' --> 3' and 3'
--> 5' exonuclease [Escherichia coli O157:H7 EDL933]
dbj|BAB38209.1| (AP002567) DNA polymerase I [Escherichia coli O157:H7]
Length = 928
Score = 97.4 bits (241), Expect = 4e-019
Identities = 147/630 (23%), Positives = 253/630 (39%), Gaps = 109/630 (17%)
Query: 5 FDIETDALD---ATVIHVLVAKRVGQKGFYVV--------------RDAETFKRLAK-QV 46
FD ETD+LD A ++ + A G + V R E K L + +
Sbjct: 354 FDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEK 413
Query: 47 TLWIGHNVIGFDIPQIKKLWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSGNEKID 105
L +G N + +D + YGI L+ +A DT++ S +L+ G H +D+L+ E+
Sbjct: 414 ALKVGQN-LKYDRGILAN---YGIELRGIAFDTMLESYILNSVA-GRHDMDSLA--ERWL 466
Query: 106 FHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFM 165
H T+ EE++ GK + I E+
Sbjct: 467 KHKTITF-----------------------EEIAGKGKNQLTFNQ----IALEEAGRYAA 499
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEG---- 221
D D+ +++ + + + + F I +P+ V+ + +++G ++L
Sbjct: 500 EDADVTLQLHLKMWPDLQKHKGPLNVFE-NIEMPLVPVLSRIERNGVKIDPKVLHNHSEE 558
Query: 222 -----CNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG---NSYKICPENL 272
++ +IA EEFNL S Q+ L ++ G KP KT G S ++ E
Sbjct: 559 LTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELA 618
Query: 273 ATIPDTAPEAVKGLKAWKVLETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQG 330
P P+ + + L++ + KL L + +TGRVH T R +
Sbjct: 619 LDYP--LPKVILEYRGLAKLKSTYTDKLP---LMINPKTGRVHTSYHQAVTATGRLSSTD 673
Query: 331 PNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDP 390
PN+ NIP + G RQAF PE ++V D + I+LR++AH D
Sbjct: 674 PNLQNIP------------VRNEEGRRIRQAFIAPEDYVIVSADYSQIELRIMAHLSRDK 721
Query: 391 IYTEQVIDG-DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNN 443
+G DIH F + R +AK + + G +A L
Sbjct: 722 GLLTAFAEGKDIHRATAAEVFGLPLETVTSEQRRSAKAINFGLIYGMSAFGLARQLNIPR 781
Query: 444 RQANEAMANFYEAIPS-LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDH 490
++A + M ++E P L+ ++ +QA G+++ LDGR L R ++
Sbjct: 782 KEAQKYMDLYFERYPGVLEYMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAER 841
Query: 491 LALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQV 550
A++ +QG I++ A + + E + VHDE EV D + + Q+
Sbjct: 842 AAINAPMQGTAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHRDDVDAVAK-QI 900
Query: 551 QAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+ + T +L+ P+ E G+NW + H
Sbjct: 901 HQLMENCT--RLDVPLLVEVGSGENWDQAH 928
>emb|CAB56093.1| (AJ238761) DNA polymerase I [Rickettsia rhipicephali]
Length = 875
Score = 97.1 bits (240), Expect = 5e-019
Identities = 99/415 (23%), Positives = 176/415 (41%), Gaps = 45/415 (10%)
Query: 196 IAVPVKEVIIKRKKDG----SIYSNQL-----LEGCNVQGDYTKIAWEEFNLGSPAQVNK 246
I +P+ ++ K +K G + Y QL E ++ + ++ +FN+GSP Q+ +
Sbjct: 476 IDLPICFILDKMEKVGITVDANYLKQLSAEFGTEILKLEEEIFALSGTKFNIGSPKQLGE 535
Query: 247 RLDRLGWKPTVKTKS-GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQK 305
L P K + +SY E L + + L W+ L + L K
Sbjct: 536 ILFEKMQLPFGKASAKASSYSTGAEILEKLSEHGYNIADLLLRWRQLTKLKNTYTDSLPK 595
Query: 306 SQE--TGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFK 363
+ T RVH + T T R + Q PN+ NIP + G + RQAF
Sbjct: 596 QIDNITHRVHTTFLQTSTTTGRLSSQEPNLQNIP------------IRSSEGNQIRQAFI 643
Query: 364 VPEGKLLVGTDAAGIQLRVLAHYMN-DPIYTEQVIDGDIHT------FNKEALGRYCKDR 416
EG L+ D + I+LR+L+H N D + + DIHT FN + + R
Sbjct: 644 AEEGYKLISADYSQIELRILSHIANIDALKQAFINKDDIHTQTACQIFNLQKHELTSEHR 703
Query: 417 PTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQ-AASMGW 475
AK + + G +A L N +A+E + ++ +++ Q A+S G+
Sbjct: 704 RKAKAINFGIIYGISAFGLAKQLNATNGEASEYIKKYFAEYKGVQEYMEQTKAFASSNGY 763
Query: 476 MKGLDGR----------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQ 525
+ GR L+ ++ A++ +QG I+++A + ++ +K + +
Sbjct: 764 VTNFFGRKCFVPLIHDKKLKQFAERAAINAPIQGTNADIIKIAMINLDQEIEKRKLKTRL 823
Query: 526 CAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH 580
+HDE E D+ + + I ++ I + T ++ P+ E + G NW+E H
Sbjct: 824 VLQIHDELLFEAPIDEVEIITPI-IKKIMENST--NMDVPIITEIRAGNNWMEIH 875
>ref|NP_458049.1| (NC_003198) DNA polymerase I [Salmonella enterica subsp. enterica
serovar Typhi]
emb|CAD03100.1| (AL627280) DNA polymerase I [Salmonella enterica subsp. enterica
serovar Typhi]
Length = 928
Score = 95.1 bits (235), Expect = 2e-018
Identities = 129/561 (22%), Positives = 224/561 (38%), Gaps = 105/561 (18%)
Query: 65 LWGYGIPLKDVA-DTLVMSRLLDPTRKGGHSLDALSG----NEKIDFHDFSTYTPEMLA- 118
L YGI L+ +A DT++ S +L+ G H +D+LS ++ I F D + L
Sbjct: 428 LQNYGIELRGIAFDTMLESYILNSVA-GRHDMDSLSDRWLKHKTITFEDIAGKGKNQLTF 486
Query: 119 ----------YCKQDVAINEKVYLQLKEELSNF-GKASI--QLEHQMQAIVCEQEKNGFM 165
Y +D + +++L++ EL G ++ +E + ++ E+NG
Sbjct: 487 NQIALEEAGRYAAEDADVTLQLHLKMWPELQQHKGPLNVFENIEMPLVPVLSRVERNGVK 546
Query: 166 LDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQ 225
+D + + + I + E + K+ D
Sbjct: 547 IDPAVLH-------KHSEEITLRLAE------------LEKKAHD--------------- 572
Query: 226 GDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSG---NSYKICPENLATIPDTAPE 281
IA E FNL S Q+ L ++ G KP KT G S ++ E P P+
Sbjct: 573 -----IAGEAFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELALDYP--LPK 625
Query: 282 AVKGLKAWKVLETRW--KLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSV 339
+ + L++ + KL L + +TGRVH T R + PN+ NIP
Sbjct: 626 VILEYRGLAKLKSTYTDKLP---LMINPKTGRVHTSYHQAVTATGRLSSTDPNLQNIP-- 680
Query: 340 PHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG 399
+ G RQAF PE L+V D + I+LR++AH D +G
Sbjct: 681 ----------VRNEEGRRIRQAFIAPEDYLIVSADYSQIELRIMAHLSRDKGLLTAFAEG 730
Query: 400 -DIH------TFNKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMAN 452
DIH F + R +AK + + G ++ L ++A + M
Sbjct: 731 KDIHRATAAEVFGLPLDSVTGEQRRSAKAINFGLIYGMSAFGLSRQLNIPRKEAQKYMDL 790
Query: 453 FYEAIPS-LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDHLALSVYLQG 499
++E P L+ ++ +QA G+++ L+GR L R G++ A++ +QG
Sbjct: 791 YFERYPGVLEYMERTRAQAKEQGYVETLEGRRLYLPDIKSSNAARRAGAERAAINAPMQG 850
Query: 500 GETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTF 559
I++ A + + E + VHDE EV +D + + Q +++
Sbjct: 851 TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDLDAVAKRIHQLMENC--- 907
Query: 560 FKLNCPMDGEAKIGKNWLETH 580
+++ P+ E G+NW + H
Sbjct: 908 TRIDVPLLVEVGSGENWDQAH 928
>ref|NP_268299.1| (NC_002662) DNA polymerase I (EC 2.7.7.7) [Lactococcus lactis
subsp. lactis]
sp|Q9CDS1|DPO1_LACLA DNA polymerase I (POL I)
gb|AAK06240.1|AE006442_7 (AE006442) DNA polymerase I (EC 2.7.7.7) [Lactococcus lactis subsp.
lactis]
Length = 877
Score = 93.6 bits (231), Expect = 5e-018
Identities = 98/429 (22%), Positives = 181/429 (41%), Gaps = 56/429 (13%)
Query: 185 IEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYT---------KIAWEE 235
+E E + + + +P+ EV+ K + G S LE + + +A EE
Sbjct: 468 LENEQEHLLSDMELPLAEVLAKMEITGIAVSQNTLEEIGAENEEKLASLTREIYDLAGEE 527
Query: 236 FNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLET 294
FN+ SP Q+ L ++L KTK+G Y + L + +P K L+ ++ +
Sbjct: 528 FNINSPKQLGVILFEKLQLPVGKKTKTG--YSTAVDVLEDLAALSPVVAKILEYRQINKV 585
Query: 295 RWKLAQEWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMY 354
+ + + + + G++H R + T R + PN+ NIP ++E
Sbjct: 586 QSTYVKGLIPQIADDGKIHTRYVQDLTQTGRLSSVDPNLQNIPV----------RLEE-- 633
Query: 355 GAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVID-----GDIHTFN---- 405
G + R+AF + LL+ +D + I+LRVLAH D E +ID DIHT
Sbjct: 634 GRKIRKAFVPSKDSLLLSSDYSQIELRVLAHISGD----EHLIDAFKHGADIHTSTAMRV 689
Query: 406 ---KEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKK 462
++A DR AK + + G +A LG + A + ++E P +K
Sbjct: 690 FGIEKAEDVTANDRRNAKAVNFGVVYGISDFGLARNLGITRKDAKNYIETYFERYPGIKT 749
Query: 463 -LKSQASQAASMGWMKGLDGRVLRIG------------SDHLALSVYLQGGETVIMRLAN 509
+++ +A G+++ + R +I ++ A++ +QG I+++A
Sbjct: 750 YMENIVREARDKGFVETMSHRRRKIPDINARNFNVRGFAERTAINSPIQGSAADILKIAM 809
Query: 510 VFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGE 569
+ + K VHDE +V ++ + + Q +++A +L P+ +
Sbjct: 810 INLDKALSARDFKSKLLLQVHDEIILDVPLEELDEIKVLVKQTMEEA---IELAVPLKVD 866
Query: 570 AKIGKNWLE 578
GK W E
Sbjct: 867 ENTGKTWYE 875
>ref|NP_229767.1| (NC_002505) DNA polymerase I [Vibrio cholerae]
pir||B82364 DNA polymerase I VC0108 [imported] - Vibrio cholerae (group O1
strain N16961)
gb|AAF93286.1| (AE004101) DNA polymerase I [Vibrio cholerae]
Length = 934
Score = 91.3 bits (225), Expect = 3e-017
Identities = 109/434 (25%), Positives = 181/434 (41%), Gaps = 62/434 (14%)
Query: 186 EAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCN---------VQGDYTKIAWEEF 236
+A++++ I +P+ V+ + ++ G + + LL + ++ + ++A + F
Sbjct: 524 DAKLEQVYREIEMPLVPVLSRIERTGVMIDDMLLSAQSQEIALRLDQLEQNAYELAGQPF 583
Query: 237 NLGSPAQVNKRLDRLGWKPTV-KTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVL-ET 294
NL SP Q+ L P + KT SG +T + E KVL E
Sbjct: 584 NLSSPKQLQTILFEQMKLPVLQKTPSGTP--------STNEEVLQELALDYPLPKVLIEY 635
Query: 295 RW--KLAQEWLQK-----SQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGIL 347
R KL + K + TGRVH T R + PN+ NIP
Sbjct: 636 RGLAKLKSTYTDKLPKMINPSTGRVHTSYHQAVTATGRLSSTDPNLQNIP---------- 685
Query: 348 WKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDG-DIHTFN- 405
+ G RQAF P G ++ D + I+LR++AH D + DG DIH
Sbjct: 686 --VRNEEGRRIRQAFVAPHGWKIMAVDYSQIELRIMAHLSGDQALLDAFRDGKDIHAATA 743
Query: 406 KEALG-----RYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPS- 459
E +G + R AK + + G +A LG +A E M ++E P
Sbjct: 744 AEIIGVPIDQVSSEQRRRAKAVNFGLIYGMSAFGLAKQLGIPRGEAQEYMDKYFERYPGV 803
Query: 460 LKKLKSQASQAASMGWMKGLDGRVL------------RIGSDHLALSVYLQGGETVIMRL 507
++ ++ S+AA +G+++ + GR L R ++ A++ +QG I++
Sbjct: 804 MQYMEDTRSRAAQLGYVETIFGRRLHLPEITSRNAMRRKAAERAAINAPMQGTAADIIKK 863
Query: 508 ANVFWQRQAKKEGI-NFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPM 566
A + ++EG K VHDE EV E + E +VQ + ++ +L P+
Sbjct: 864 AMLLVDEWIEREGDGRVKLLMQVHDELVFEVKESSLSEI-ESKVQQLMESAA--ELAVPL 920
Query: 567 DGEAKIGKNWLETH 580
EA G NW + H
Sbjct: 921 VAEAGHGDNWEQAH 934
>ref|NP_295430.1| (NC_001263) DNA-directed DNA polymerase [Deinococcus radiodurans]
sp|P52027|DPO1_DEIRA DNA polymerase I (POL I)
pir||E75363 DNA-directed DNA polymerase - Deinococcus radiodurans (strain R1)
gb|AAF11264.1|AE002012_4 (AE002012) DNA-directed DNA polymerase [Deinococcus radiodurans]
Length = 956
Score = 91.3 bits (225), Expect = 3e-017
Identities = 94/383 (24%), Positives = 163/383 (42%), Gaps = 47/383 (12%)
Query: 223 NVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKSGNSYKICPENLATIPDTAPE 281
+++ + A EEF++ SP Q+ L D+L + KTK L + D P
Sbjct: 594 DLESQIHEYAGEEFHIRSPKQLETVLYDKLELASSKKTKLTGQRSTAVSALEPLRDAHP- 652
Query: 282 AVKGLKAWKVLETRWKLAQEWLQK-----SQETGRVHGRVILTGAVTHRAAHQGPNMANI 336
+ + ++ L+ KL +L + TGR+H T T R + PN+ NI
Sbjct: 653 IIPLVLEFRELD---KLRGTYLDPIPNLVNPHTGRLHTTFAQTAVATGRLSSLNPNLQNI 709
Query: 337 PSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQV 396
P + G E R+ F +G L+ D + I+LR+LAH +DP+ +
Sbjct: 710 P------------IRSELGREIRKGFIAEDGFTLIAADYSQIELRLLAHIADDPLMQQAF 757
Query: 397 IDG-DIHTFN-KEALGR-----YCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEA 449
++G DIH + LG R AKT + L G +++ LG +A
Sbjct: 758 VEGADIHRRTAAQVLGLDEATVDANQRRAAKTVNFGVLYGMSAHRLSNDLGIPYAEAATF 817
Query: 450 MANFYEAIPSLKK-LKSQASQAASMGWMKGLDG------------RVLRIGSDHLALSVY 496
+ ++ P +++ + + G+++ L G RV R + LA ++
Sbjct: 818 IEIYFATYPGIRRYINHTLDFGRTHGYVETLYGRRRYVPGLSSRNRVQREAEERLAYNMP 877
Query: 497 LQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDA 556
+QG IM+LA V Q + + I + VHDE E D+A+++ + + +++
Sbjct: 878 IQGTAADIMKLAMV--QLDPQLDAIGARMLLQVHDELLIEAPLDKAEQVAALTKKVMENV 935
Query: 557 GTFFKLNCPMDGEAKIGKNWLET 579
+L P+ E G NW +T
Sbjct: 936 ---VQLKVPLAVEVGTGPNWFDT 955
>pdb|5KTQ|A Chain A, Large Fragment Of Taq Dna Polymerase Bound To Dctp
pdb|1KTQ| Dna Polymerase
Length = 543
Score = 90.1 bits (222), Expect = 6e-017
Identities = 128/581 (22%), Positives = 224/581 (38%), Gaps = 111/581 (19%)
Query: 28 KGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLWGYGIPLKDVADTLVMSRLLDP 87
+G V R E +K L L ++ D+ + G G+P D D ++++ LLDP
Sbjct: 39 RGGRVHRAPEPYKALRD---LKEARGLLAKDLSVLALREGLGLPPGD--DPMLLAYLLDP 93
Query: 88 TRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASI- 146
+ + G E +T E + A++E+++ L L G+ +
Sbjct: 94 SNTTPEGVARRYGGE---------WTEE----AGERAALSERLFANLWGRLE--GEERLL 138
Query: 147 ----QLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKE 202
++E + A++ E G LD + E R+EAE+
Sbjct: 139 WLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEV------------- 185
Query: 203 VIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKS 261
++A FNL S Q+ + L D LG KT+
Sbjct: 186 --------------------------FRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEK 219
Query: 262 GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQKSQE-----TGRVHGRV 316
L + + P K L+ ++ KL ++ + TGR+H R
Sbjct: 220 TGKRSTSAAVLEALREAHPIVEKILQYRELT----KLKSTYIDPLPDLIHPRTGRLHTRF 275
Query: 317 ILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAA 376
T T R + PN+ NIP + G R+AF EG LLV D +
Sbjct: 276 NQTATATGRLSSSDPNLQNIP------------VRTPLGQRIRRAFIAEEGWLLVALDYS 323
Query: 377 GIQLRVLAHYMNDPIYTEQVIDG-DIHTFNKE---ALGRYCKD---RPTAKTFIYAFLLG 429
I+LRVLAH D +G DIHT + R D R AKT + L G
Sbjct: 324 QIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYG 383
Query: 430 AGTGMIASILGCNNRQANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGR------ 482
++ L +A + ++++ P ++ ++ + G+++ L GR
Sbjct: 384 MSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPD 443
Query: 483 ------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTE 536
+R ++ +A ++ +QG +M+LA V + + E + + VHDE E
Sbjct: 444 LEARVKSVREAAERMAFNMPVQGTAADLMKLAMV--KLFPRLEEMGARMLLQVHDELVLE 501
Query: 537 VDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWL 577
+++A+ + + + ++ + L P++ E IG++WL
Sbjct: 502 APKERAEAVARLAKEVMEGV---YPLAVPLEVEVGIGEDWL 539
>sp|P19821|DPO1_THEAQ DNA polymerase I, thermostable (Taq polymerase 1)
pir||A33530 DNA-directed DNA polymerase (EC 2.7.7.7) I - Thermus aquaticus
pdb|1BGX|T Chain T, Taq Polymerase In Complex With Tp7, An Inhibitory Fab
pdb|1CMW|A Chain A, Crystal Structure Of Taq Dna-Polymerase Shows A New
Orientation For The Structure-Specific Nuclease Domain
gb|AAA27507.1| (J04639) DNA polymerase [Thermus aquaticus]
Length = 832
Score = 90.1 bits (222), Expect = 6e-017
Identities = 128/581 (22%), Positives = 224/581 (38%), Gaps = 111/581 (19%)
Query: 28 KGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLWGYGIPLKDVADTLVMSRLLDP 87
+G V R E +K L L ++ D+ + G G+P D D ++++ LLDP
Sbjct: 328 RGGRVHRAPEPYKALRD---LKEARGLLAKDLSVLALREGLGLPPGD--DPMLLAYLLDP 382
Query: 88 TRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASI- 146
+ + G E +T E + A++E+++ L L G+ +
Sbjct: 383 SNTTPEGVARRYGGE---------WTEE----AGERAALSERLFANLWGRLE--GEERLL 427
Query: 147 ----QLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKE 202
++E + A++ E G LD + E R+EAE+
Sbjct: 428 WLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEV------------- 474
Query: 203 VIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKS 261
++A FNL S Q+ + L D LG KT+
Sbjct: 475 --------------------------FRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEK 508
Query: 262 GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQKSQE-----TGRVHGRV 316
L + + P K L+ ++ KL ++ + TGR+H R
Sbjct: 509 TGKRSTSAAVLEALREAHPIVEKILQYRELT----KLKSTYIDPLPDLIHPRTGRLHTRF 564
Query: 317 ILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAA 376
T T R + PN+ NIP + G R+AF EG LLV D +
Sbjct: 565 NQTATATGRLSSSDPNLQNIP------------VRTPLGQRIRRAFIAEEGWLLVALDYS 612
Query: 377 GIQLRVLAHYMNDPIYTEQVIDG-DIHTFNKE---ALGRYCKD---RPTAKTFIYAFLLG 429
I+LRVLAH D +G DIHT + R D R AKT + L G
Sbjct: 613 QIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYG 672
Query: 430 AGTGMIASILGCNNRQANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGR------ 482
++ L +A + ++++ P ++ ++ + G+++ L GR
Sbjct: 673 MSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPD 732
Query: 483 ------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTE 536
+R ++ +A ++ +QG +M+LA V + + E + + VHDE E
Sbjct: 733 LEARVKSVREAAERMAFNMPVQGTAADLMKLAMV--KLFPRLEEMGARMLLQVHDELVLE 790
Query: 537 VDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWL 577
+++A+ + + + ++ + L P++ E IG++WL
Sbjct: 791 APKERAEAVARLAKEVMEGV---YPLAVPLEVEVGIGEDWL 828
>pdb|4KTQ|A Chain A, Binary Complex Of The Large Fragment Of Dna Polymerase I
From T. Aquaticus Bound To A PrimerTEMPLATE DNA
Length = 539
Score = 90.1 bits (222), Expect = 6e-017
Identities = 128/581 (22%), Positives = 224/581 (38%), Gaps = 111/581 (19%)
Query: 28 KGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIPQIKKLWGYGIPLKDVADTLVMSRLLDP 87
+G V R E +K L L ++ D+ + G G+P D D ++++ LLDP
Sbjct: 35 RGGRVHRAPEPYKALRD---LKEARGLLAKDLSVLALREGLGLPPGD--DPMLLAYLLDP 89
Query: 88 TRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYCKQDVAINEKVYLQLKEELSNFGKASI- 146
+ + G E +T E + A++E+++ L L G+ +
Sbjct: 90 SNTTPEGVARRYGGE---------WTEE----AGERAALSERLFANLWGRLE--GEERLL 134
Query: 147 ----QLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLRETNRIEAEIKEFMVPIAVPVKE 202
++E + A++ E G LD + E R+EAE+
Sbjct: 135 WLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEV------------- 181
Query: 203 VIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGSPAQVNKRL-DRLGWKPTVKTKS 261
++A FNL S Q+ + L D LG KT+
Sbjct: 182 --------------------------FRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEK 215
Query: 262 GNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQEWLQKSQE-----TGRVHGRV 316
L + + P K L+ ++ KL ++ + TGR+H R
Sbjct: 216 TGKRSTSAAVLEALREAHPIVEKILQYRELT----KLKSTYIDPLPDLIHPRTGRLHTRF 271
Query: 317 ILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVPEGKLLVGTDAA 376
T T R + PN+ NIP + G R+AF EG LLV D +
Sbjct: 272 NQTATATGRLSSSDPNLQNIP------------VRTPLGQRIRRAFIAEEGWLLVALDYS 319
Query: 377 GIQLRVLAHYMNDPIYTEQVIDG-DIHTFNKE---ALGRYCKD---RPTAKTFIYAFLLG 429
I+LRVLAH D +G DIHT + R D R AKT + L G
Sbjct: 320 QIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYG 379
Query: 430 AGTGMIASILGCNNRQANEAMANFYEAIPSLKK-LKSQASQAASMGWMKGLDGR------ 482
++ L +A + ++++ P ++ ++ + G+++ L GR
Sbjct: 380 MSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPD 439
Query: 483 ------VLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTE 536
+R ++ +A ++ +QG +M+LA V + + E + + VHDE E
Sbjct: 440 LEARVKSVREAAERMAFNMPVQGTAADLMKLAMV--KLFPRLEEMGARMLLQVHDELVLE 497
Query: 537 VDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWL 577
+++A+ + + + ++ + L P++ E IG++WL
Sbjct: 498 APKERAEAVARLAKEVMEGV---YPLAVPLEVEVGIGEDWL 535
>ref|NP_372214.1| (NC_002758) DNA polymerase I [Staphylococcus aureus subsp. aureus
Mu50]
ref|NP_374801.1| (NC_002745) DNA polymerase I [Staphylococcus aureus subsp. aureus
N315]
dbj|BAB42780.1| (AP003134) DNA polymerase I [Staphylococcus aureus subsp. aureus
N315]
dbj|BAB57852.1| (AP003363) DNA polymerase I [Staphylococcus aureus subsp. aureus
Mu50]
Length = 876
Score = 86.7 bits (213), Expect = 6e-016
Identities = 127/547 (23%), Positives = 224/547 (40%), Gaps = 94/547 (17%)
Query: 69 GIPLKDVA-DTLVMSRLLDPTR--KGGHSLDALSGNEKIDFHDFSTY-------TPEMLA 118
GI +++++ D ++ S ++DP+R S+ +L G + D S Y PE
Sbjct: 385 GIDIQNISFDIMLASYIIDPSRTISDVQSVVSLYGQSFVK-DDVSIYGKGKKFKVPEDDV 443
Query: 119 YCKQDVAINEKVYL---QLKEELSNFGKASI--QLEHQMQAIVCEQEKNGFMLDTDIAEE 173
+I + +Y + ++L + + + LE + I+ E E+ G D EE
Sbjct: 444 LNPYVASITDAIYFAKPNMDKQLEEYNQVELLADLELPLAKILSEMEEIGIFTDVHDLEE 503
Query: 174 IYTTCLRETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAW 233
+ E EI+E + +V+I+ D A
Sbjct: 504 M-----------EKEIQEKL--------DVLIRNIHDA--------------------AG 524
Query: 234 EEFNLGSPAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLE 293
E+FN+ SP Q+ L P +K K+ Y + L + P + + ++ L
Sbjct: 525 EDFNINSPKQLGVVLFETLQLPVIK-KTKTGYSTAVDVLEQLQGEHP-IIDYILEYRQLS 582
Query: 294 TRWKLAQEWLQKS-QETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEG 352
E LQK + R+H R T A T R + PN+ NIP ++E
Sbjct: 583 KLQSTYVEGLQKVISDDQRIHTRFNQTLAQTGRLSSVDPNLQNIPV----------RLEE 632
Query: 353 MYGAECRQAFK-VPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDGD-IHT------F 404
G + R+AFK + +++ D + I+LRVLAH D E I+GD IHT F
Sbjct: 633 --GRKIRKAFKPTSKDSVILSADYSQIELRVLAHITQDESMKEAFINGDDIHTATAMKVF 690
Query: 405 NKEALGRYCKDRPTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLK 464
EA R AK + + G ++ LG ++A + ++ + P +K+
Sbjct: 691 GVEADQVDSLMRRQAKAVNFGIVYGISDYGLSQSLGITRKKAKAFIDDYLASFPGVKQYM 750
Query: 465 SQ-ASQAASMGWMKGLDGRV------------LRIGSDHLALSVYLQGGETVIMRLANVF 511
S A ++G+++ L R LR ++ A++ +QG I++LA V
Sbjct: 751 SDIVKDAKALGYVETLLHRRRYIPDITSRNFNLRGFAERTAMNTPIQGSAADIIKLAMVK 810
Query: 512 WQRQAKKEGINFKQCAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFKLNCPMDGEAK 571
+ ++ K+ K VHDE EV + + E + +++A +L+ P+ ++
Sbjct: 811 FAQKMKETTYQAKLLLQVHDELIFEVPKSEVDSFSEFVEEIMENA---LQLDVPLKVDSS 867
Query: 572 IGKNWLE 578
G W +
Sbjct: 868 YGATWYD 874
>ref|NP_188690.1| (NM_112946) DNA polymerase, putative [Arabidopsis thaliana]
Length = 1160
Score = 45.1 bits (105), Expect = 0.002
Identities = 63/292 (21%), Positives = 117/292 (39%), Gaps = 36/292 (12%)
Query: 306 SQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQAFKVP 365
S + GRVH + + T R + + PN+ N P++ + I RQAF
Sbjct: 750 SGKDGRVHCSLNIN-TETGRLSARRPNLQNQPALEKDRYKI------------RQAFIAS 796
Query: 366 EGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVI-DGDIHTFNKEALGRYCK--------DR 416
G L+ D ++LR+LAH + E I GD H ++ A+ Y +R
Sbjct: 797 PGNSLIVADYGQLELRILAHLASCESMKEAFIAGGDFH--SRTAMNMYPHIREAVENGER 854
Query: 417 PTAKTFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKS-QASQAASMGW 475
AK ++ G ++ + +A + + +Y ++K + + +A G+
Sbjct: 855 RKAKMLNFSIAYGKTAIGLSRDWKVSREEAQDTVNLWYNDRQEVRKWQELRKKEAIQKGY 914
Query: 476 MKGLDGRV-------LRIGSDHL---ALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQ 525
+ L GR R +H+ A++ +QG + A + + + + +K
Sbjct: 915 VLTLLGRARKFPEYRSRAQKNHIERAAINTPVQGSAADVAMCAMLEISNNQRLKELGWKL 974
Query: 526 CAWVHDEWQTEVDEDQAQRLGEIQVQAIKDAGTFFK-LNCPMDGEAKIGKNW 576
VHDE E + A+ +I V + + L+ + +AK +NW
Sbjct: 975 LLQVHDEVILEGPSESAENAKDIVVNCMSEPFNGKNILSVDLSVDAKCAQNW 1026
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.319 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 328,945,603
Number of Sequences: 891141
Number of extensions: 14198551
Number of successful extensions: 30063
Number of sequences better than 5.0e-02: 122
Number of HSP's better than 0.1 without gapping: 10
Number of HSP's successfully gapped in prelim test: 112
Number of HSP's that attempted gapping in prelim test: 29647
Number of HSP's gapped (non-prelim): 188
length of query: 580
length of database: 279,076,897
effective HSP length: 126
effective length of query: 454
effective length of database: 166,793,131
effective search space: 75724081474
effective search space used: 75724081474
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 94 (40.8 bits)
Query= orf27 [5753-5946]
(192 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064754.1| (NC_002519) gp16 [Roseophage SIO1] >gi|994... 385 e-106
>ref|NP_064754.1| (NC_002519) gp16 [Roseophage SIO1]
gb|AAG02599.1|AF189021_18 (AF189021) Roseophage SIO1 complete genome
Length = 192
Score = 385 bits (988), Expect = e-106
Identities = 192/192 (100%), Positives = 192/192 (100%)
Query: 1 MYNNQTDQRHKEINMADKKIVLKNVEVSWAKLLEAGLKYKSETEYEFSVAVKANDQLRDL 60
MYNNQTDQRHKEINMADKKIVLKNVEVSWAKLLEAGLKYKSETEYEFSVAVKANDQLRDL
Sbjct: 1 MYNNQTDQRHKEINMADKKIVLKNVEVSWAKLLEAGLKYKSETEYEFSVAVKANDQLRDL 60
Query: 61 MKSFKLNKQFKSKDSTFDGEEFIQLTLDTRTKSGWVRHGEVFDEFGDPTKDLIGNGSKMN 120
MKSFKLNKQFKSKDSTFDGEEFIQLTLDTRTKSGWVRHGEVFDEFGDPTKDLIGNGSKMN
Sbjct: 61 MKSFKLNKQFKSKDSTFDGEEFIQLTLDTRTKSGWVRHGEVFDEFGDPTKDLIGNGSKMN 120
Query: 121 LFVSIGQSSYGNIIKLGHLEDMDMESKEMMFHFGQVMELVPFEAPSAVIKKEKEINESVE 180
LFVSIGQSSYGNIIKLGHLEDMDMESKEMMFHFGQVMELVPFEAPSAVIKKEKEINESVE
Sbjct: 121 LFVSIGQSSYGNIIKLGHLEDMDMESKEMMFHFGQVMELVPFEAPSAVIKKEKEINESVE 180
Query: 181 AMAEDEMEIPFG 192
AMAEDEMEIPFG
Sbjct: 181 AMAEDEMEIPFG 192
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.314 0.132 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 100,502,937
Number of Sequences: 891141
Number of extensions: 4030092
Number of successful extensions: 7732
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7731
Number of HSP's gapped (non-prelim): 1
length of query: 192
length of database: 279,076,897
effective HSP length: 113
effective length of query: 79
effective length of database: 178,377,964
effective search space: 14091859156
effective search space used: 14091859156
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 87 (38.1 bits)
Query= orf28 [5944-6091]
(146 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.316 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 87,633,330
Number of Sequences: 891141
Number of extensions: 3735784
Number of successful extensions: 7470
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7470
Number of HSP's gapped (non-prelim): 0
length of query: 146
length of database: 279,076,897
effective HSP length: 108
effective length of query: 38
effective length of database: 182,833,669
effective search space: 6947679422
effective search space used: 6947679422
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 85 (37.4 bits)
Query= orf29 [6088-6376]
(287 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064755.1| (NC_002519) gp17 [Roseophage SIO1] >gi|994... 583 e-166
gb|AAL02215.1| (AF405604) DNA polymerase [uncultured organism] 129 4e-029
gb|AAL02210.1| (AF405599) DNA polymerase [uncultured organism] 126 2e-028
gb|AAL02212.1| (AF405601) DNA polymerase [uncultured organism] 126 2e-028
gb|AAL02211.1| (AF405600) DNA polymerase [uncultured organism] 126 3e-028
gb|AAL02214.1| (AF405603) DNA polymerase [uncultured organism] 125 6e-028
>ref|NP_064755.1| (NC_002519) gp17 [Roseophage SIO1]
gb|AAG02600.1|AF189021_19 (AF189021) Roseophage SIO1 complete genome
Length = 287
Score = 583 bits (1504), Expect = e-166
Identities = 287/287 (100%), Positives = 287/287 (100%)
Query: 1 MTEETKSIDTLIDDIYSVFTDGYSKSPDNEKLIDAFGEAMKGLMRSRLTPRESSGGTLRL 60
MTEETKSIDTLIDDIYSVFTDGYSKSPDNEKLIDAFGEAMKGLMRSRLTPRESSGGTLRL
Sbjct: 1 MTEETKSIDTLIDDIYSVFTDGYSKSPDNEKLIDAFGEAMKGLMRSRLTPRESSGGTLRL 60
Query: 61 SAIGKPARQLWYDSRGVEKPDFTGDQLLKFFYGDVIEEVLLTLAKLSGHSVTNEQQKVVV 120
SAIGKPARQLWYDSRGVEKPDFTGDQLLKFFYGDVIEEVLLTLAKLSGHSVTNEQQKVVV
Sbjct: 61 SAIGKPARQLWYDSRGVEKPDFTGDQLLKFFYGDVIEEVLLTLAKLSGHSVTNEQQKVVV 120
Query: 121 AGITGHMDAVIDGHVIDVKSASATAFKKFDQGSLVFDDPFGYMHQIAAYSEAVEGNKGSG 180
AGITGHMDAVIDGHVIDVKSASATAFKKFDQGSLVFDDPFGYMHQIAAYSEAVEGNKGSG
Sbjct: 121 AGITGHMDAVIDGHVIDVKSASATAFKKFDQGSLVFDDPFGYMHQIAAYSEAVEGNKGSG 180
Query: 181 FLAMNKVDGKLTLFQPDPDFLPDTQERVDYLKEALASDTPPERCYEEVTETNGNKKLPMG 240
FLAMNKVDGKLTLFQPDPDFLPDTQERVDYLKEALASDTPPERCYEEVTETNGNKKLPMG
Sbjct: 181 FLAMNKVDGKLTLFQPDPDFLPDTQERVDYLKEALASDTPPERCYEEVTETNGNKKLPMG 240
Query: 241 CAFCSFKKECWKDANDGSGLRGFGYSFGTVYLTHVEKAPRVGEVDVE 287
CAFCSFKKECWKDANDGSGLRGFGYSFGTVYLTHVEKAPRVGEVDVE
Sbjct: 241 CAFCSFKKECWKDANDGSGLRGFGYSFGTVYLTHVEKAPRVGEVDVE 287
>gb|AAL02215.1| (AF405604) DNA polymerase [uncultured organism]
Length = 182
Score = 129 bits (323), Expect = 4e-029
Identities = 73/161 (45%), Positives = 102/161 (63%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKSASA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQAEIEIDDIKGHIDCMIDGELVDVKSASAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L DD FGY+ QI+ Y+ A G K FLA +K DG+L +
Sbjct: 122 LPDDDAFGYISQISGYANAF-GKKSGTFLAFDKSDGELATY 161
>gb|AAL02210.1| (AF405599) DNA polymerase [uncultured organism]
Length = 182
Score = 126 bits (317), Expect = 2e-028
Identities = 72/161 (44%), Positives = 101/161 (62%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKSASA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQSEIEIDDIKGHIDCMIDGELVDVKSASAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L DD FGY+ QI+ Y+ A G K FLA +K G+L +
Sbjct: 122 LPDDDAFGYISQISGYANAF-GKKSGTFLAFDKSGGELATY 161
>gb|AAL02212.1| (AF405601) DNA polymerase [uncultured organism]
Length = 182
Score = 126 bits (317), Expect = 2e-028
Identities = 72/161 (44%), Positives = 102/161 (62%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKSASA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQAEIEIDDIKGHIDCMIDGGLVDVKSASAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L+ DD FGY+ QI+ Y+ A G K FLA +K G+L +
Sbjct: 122 LLDDDAFGYISQISGYANAF-GKKSGTFLAFDKSGGELATY 161
>gb|AAL02211.1| (AF405600) DNA polymerase [uncultured organism]
Length = 182
Score = 126 bits (316), Expect = 3e-028
Identities = 72/161 (44%), Positives = 101/161 (62%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKSASA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQAEIEIDDIKGHIDCMIDGELVDVKSASAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L DD FGY+ QI+ Y+ A G K FLA +K G+L +
Sbjct: 122 LPDDDAFGYISQISGYANAF-GKKSGTFLAFDKSGGELATY 161
>gb|AAL02214.1| (AF405603) DNA polymerase [uncultured organism]
Length = 182
Score = 125 bits (313), Expect = 6e-028
Identities = 71/161 (44%), Positives = 100/161 (62%), Gaps = 4/161 (2%)
Query: 37 GEAMKGLMRSRLTPRESSGGT-LRLSAIGKP-ARQLWYDSRGVEKPD-FTGDQLLKFFYG 93
G + MR +L +++ LR+S IGKP +R LWYD G E+ + FT LKF G
Sbjct: 2 GATVLDAMRKQLWVSQAAPNPKLRMSNIGKPCSRALWYDINGDEQAESFTPQTKLKFIVG 61
Query: 94 DVIEEVLLTLAKLSGHSVTNEQQKVVVAGITGHMDAVIDGHVIDVKSASATAFKKFDQGS 153
D++E +L+ LAK +GHSVT Q ++ + I GH+D +IDG ++DVKS SA + KKF G+
Sbjct: 62 DIVEAMLIYLAKEAGHSVTEMQSEIEIDDIKGHIDCMIDGELVDVKSTSAFSMKKFKNGT 121
Query: 154 LVFDDPFGYMHQIAAYSEAVEGNKGSGFLAMNKVDGKLTLF 194
L DD FGY+ QI+ Y+ A G K FLA +K G+L +
Sbjct: 122 LPDDDAFGYISQISGYANAF-GKKSGTFLAFDKSGGELATY 161
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.316 0.136 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 170,639,081
Number of Sequences: 891141
Number of extensions: 7630392
Number of successful extensions: 16603
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 7
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 16589
Number of HSP's gapped (non-prelim): 7
length of query: 287
length of database: 279,076,897
effective HSP length: 119
effective length of query: 168
effective length of database: 173,031,118
effective search space: 29069227824
effective search space used: 29069227824
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 90 (39.3 bits)
Query= orf30 [6372-6507]
(134 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064756.1| (NC_002519) RP Endonuclease I [Roseophage ... 275 4e-074
ref|NP_052083.1| (NC_001271) endonuclease [Bacteriophage ph... 128 1e-029
ref|NP_041972.1| (NC_001604) gene 3, endonuclease [Enteroba... 125 5e-029
pdb|1FZR|A Chain A, Crystal Structure Of Bacteriophage T7 E... 124 2e-028
ref|NP_523312.1| (NC_003298) endonuclease [Bacteriophage T3... 118 8e-027
ref|NP_570326.1| (NC_003390) endonuclease A [Cyanophage P60... 64 3e-010
ref|NP_570327.1| (NC_003390) endonuclease B [Cyanophage P60... 37 0.042
>ref|NP_064756.1| (NC_002519) RP Endonuclease I [Roseophage SIO1]
gb|AAG02601.1|AF189021_20 (AF189021) Roseophage SIO1 complete genome
Length = 134
Score = 275 bits (704), Expect = 4e-074
Identities = 134/134 (100%), Positives = 134/134 (100%)
Query: 1 MLNSKSSTRKRALKAGYRSGLEEQTAKDLKKRKVLFTYEETKIKWLDSKVRTYTPDFVLP 60
MLNSKSSTRKRALKAGYRSGLEEQTAKDLKKRKVLFTYEETKIKWLDSKVRTYTPDFVLP
Sbjct: 1 MLNSKSSTRKRALKAGYRSGLEEQTAKDLKKRKVLFTYEETKIKWLDSKVRTYTPDFVLP 60
Query: 61 NGVIIETKGRFVAADRRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGF 120
NGVIIETKGRFVAADRRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGF
Sbjct: 61 NGVIIETKGRFVAADRRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGF 120
Query: 121 LYADKTIPEDWLNE 134
LYADKTIPEDWLNE
Sbjct: 121 LYADKTIPEDWLNE 134
>ref|NP_052083.1| (NC_001271) endonuclease [Bacteriophage phiYeO3-12]
emb|CAB63604.1| (AJ251805) endonuclease [Bacteriophage phiYeO3-12]
Length = 153
Score = 128 bits (321), Expect = 1e-029
Identities = 61/119 (51%), Positives = 88/119 (73%), Gaps = 2/119 (1%)
Query: 17 YRSGLEEQTAKDLKKRKVLFTYEETKIKW-LDSKVRTYTPDFVLPNGVIIETKGRFVAAD 75
+RSGLE++ +K L+ + + F YE +I + + + YTPDF+LPNG+ IETKG + + D
Sbjct: 16 FRSGLEDKVSKQLESKGIKFDYELWRIPYVIPASDHLYTPDFLLPNGIFIETKGLWDSDD 75
Query: 76 RRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGFLYADKTIPEDWLNE 134
R+KHL I++Q L DIR VF++S++KLYKG+ +SYA+WC KHG L+ADK IP +WL E
Sbjct: 76 RKKHLLIREQHPEL-DIRLVFSSSRSKLYKGSPTSYAEWCEKHGILFADKLIPVEWLKE 133
>ref|NP_041972.1| (NC_001604) gene 3, endonuclease [Enterobacteria phage T7]
sp|P00641|ENRN_BPT7 ENDODEOXYRIBONUCLEASE I (ENDONUCLEASE)
pir||NEBP37 endodeoxyribonuclease I (EC 3.1.21.-) - phage T7
emb|CAA24345.1| (V01127) 3 protein [Enterobacteria phage T7]
emb|CAA24402.1| (V01146) gene 3, endonuclease [Enterobacteria phage T7]
Length = 149
Score = 125 bits (315), Expect = 5e-029
Identities = 58/119 (48%), Positives = 87/119 (72%), Gaps = 2/119 (1%)
Query: 17 YRSGLEEQTAKDLKKRKVLFTYEETKIKW-LDSKVRTYTPDFVLPNGVIIETKGRFVAAD 75
+RSGLE++ +K L+ + + F YEE K+ + + + TYTPDF+LPNG+ +ETKG + + D
Sbjct: 15 FRSGLEDKVSKQLESKGIKFEYEEWKVPYVIPASNHTYTPDFLLPNGIFVETKGLWESDD 74
Query: 76 RRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGFLYADKTIPEDWLNE 134
R+KHL I++Q L DIR VF++S+ KLYKG+ +SY ++C KHG +ADK IP +W+ E
Sbjct: 75 RKKHLLIREQHPEL-DIRIVFSSSRTKLYKGSPTSYGEFCEKHGIKFADKLIPAEWIKE 132
>pdb|1FZR|A Chain A, Crystal Structure Of Bacteriophage T7 Endonuclease I
pdb|1FZR|B Chain B, Crystal Structure Of Bacteriophage T7 Endonuclease I
pdb|1FZR|C Chain C, Crystal Structure Of Bacteriophage T7 Endonuclease I
pdb|1FZR|D Chain D, Crystal Structure Of Bacteriophage T7 Endonuclease I
Length = 138
Score = 124 bits (311), Expect = 2e-028
Identities = 57/119 (47%), Positives = 87/119 (72%), Gaps = 2/119 (1%)
Query: 17 YRSGLEEQTAKDLKKRKVLFTYEETKIKW-LDSKVRTYTPDFVLPNGVIIETKGRFVAAD 75
+RSGLE++ +K L+ + + F YEE K+ + + + TYTPDF+LPNG+ ++TKG + + D
Sbjct: 4 FRSGLEDKVSKQLESKGIKFEYEEWKVPYVIPASNHTYTPDFLLPNGIFVKTKGLWESDD 63
Query: 76 RRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGFLYADKTIPEDWLNE 134
R+KHL I++Q L DIR VF++S+ KLYKG+ +SY ++C KHG +ADK IP +W+ E
Sbjct: 64 RKKHLLIREQHPEL-DIRIVFSSSRTKLYKGSPTSYGEFCEKHGIKFADKLIPAEWIKE 121
>ref|NP_523312.1| (NC_003298) endonuclease [Bacteriophage T3]
sp|P20314|ENRN_BPT3 ENDODEOXYRIBONUCLEASE I (ENDONUCLEASE)
pir||S07505 endodeoxyribonuclease I (EC 3.1.21.-) - phage T3
emb|CAA35132.1| (X17255) 3 [Bacteriophage T3]
emb|CAC86275.1| (AJ318471) endonuclease [Bacteriophage T3]
Length = 152
Score = 118 bits (296), Expect = 8e-027
Identities = 57/113 (50%), Positives = 82/113 (72%), Gaps = 2/113 (1%)
Query: 17 YRSGLEEQTAKDLKKRKVLFTYEETKIKW-LDSKVRTYTPDFVLPNGVIIETKGRFVAAD 75
+RSGLE++ +K L+ + + F YE +I + + YTPDF+LPNG+ IETKG + + D
Sbjct: 15 FRSGLEDKVSKQLESKGIKFDYELWRIPYVIPESDHLYTPDFLLPNGIFIETKGLWDSDD 74
Query: 76 RRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGFLYADKTIP 128
R+KHL I++Q L DIR VF++S++KLYKG+ +SY +WC KHG L+ADK IP
Sbjct: 75 RKKHLLIREQHPEL-DIRLVFSSSRSKLYKGSPTSYGEWCEKHGILFADKLIP 126
>ref|NP_570326.1| (NC_003390) endonuclease A [Cyanophage P60]
gb|AAL73264.1| (AF338467) endonuclease A [Cyanophage P60]
Length = 66
Score = 63.5 bits (153), Expect = 3e-010
Identities = 33/62 (53%), Positives = 41/62 (65%), Gaps = 2/62 (3%)
Query: 17 YRSGLEEQTAKDLKKRKVLFTYEETKIKWLDSKVRTYTPDFVLPNGVIIETKGRFVAADR 76
YRSGLEE+ +K L + V + YE K ++ S YTPDF LPNGVI+E KG F +DR
Sbjct: 3 YRSGLEEKVSKWLDRENVPYLYEVQKFDYVTSS--KYTPDFFLPNGVILEVKGFFKPSDR 60
Query: 77 RK 78
RK
Sbjct: 61 RK 62
>ref|NP_570327.1| (NC_003390) endonuclease B [Cyanophage P60]
gb|AAL73265.1| (AF338467) endonuclease B [Cyanophage P60]
Length = 54
Score = 36.6 bits (83), Expect = 0.042
Identities = 18/43 (41%), Positives = 25/43 (57%), Gaps = 2/43 (4%)
Query: 91 DIRFVFTNSKAKLYKGAKSSYADWCNKHGFLYA-DKTIPEDWL 132
D+RFVF L K +K++Y+ W KHGF + IP+ WL
Sbjct: 12 DLRFVFQRDNT-LTKASKTTYSMWAEKHGFPWCIYPNIPQQWL 53
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.318 0.134 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 72,438,750
Number of Sequences: 891141
Number of extensions: 2676979
Number of successful extensions: 6775
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 6
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 6760
Number of HSP's gapped (non-prelim): 7
length of query: 134
length of database: 279,076,897
effective HSP length: 110
effective length of query: 24
effective length of database: 181,051,387
effective search space: 4345233288
effective search space used: 4345233288
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf31 [6505-6567]
(61 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.320 0.143 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 30,539,836
Number of Sequences: 891141
Number of extensions: 902909
Number of successful extensions: 2291
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2291
Number of HSP's gapped (non-prelim): 0
length of query: 61
length of database: 279,076,897
effective HSP length: 37
effective length of query: 24
effective length of database: 246,104,680
effective search space: 5906512320
effective search space used: 5906512320
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf32 [6564-6666]
(101 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064757.1| (NC_002519) gp19 [Roseophage SIO1] >gi|994... 152 5e-037
ref|NP_039701.1| (NC_001335) Hypothetical protein [Mycobact... 103 3e-022
ref|NP_212894.1| (NC_001318) B. burgdorferi predicted codin... 83 6e-016
ref|NP_267982.1| (NC_002662) HYPOTHETICAL PROTEIN [Lactococ... 79 7e-015
ref|NP_203440.1| (NC_003085) p26 [Bacteriophage Mx8] >gi|15... 59 1e-008
ref|NP_465673.1| (NC_003210) similar to unknown proteins [L... 55 1e-007
ref|NP_559109.1| (NC_003364) hypothetical protein [Pyrobacu... 37 0.038
>ref|NP_064757.1| (NC_002519) gp19 [Roseophage SIO1]
gb|AAG02602.1|AF189021_21 (AF189021) Roseophage SIO1 complete genome
Length = 77
Score = 152 bits (385), Expect = 5e-037
Identities = 77/77 (100%), Positives = 77/77 (100%)
Query: 25 MGLANEAGEVLGKVKKIIRDGTFNRDDIADELGDVLWYAAALARDLNTDLSAIAQRNLDK 84
MGLANEAGEVLGKVKKIIRDGTFNRDDIADELGDVLWYAAALARDLNTDLSAIAQRNLDK
Sbjct: 1 MGLANEAGEVLGKVKKIIRDGTFNRDDIADELGDVLWYAAALARDLNTDLSAIAQRNLDK 60
Query: 85 LASRKERGTLKGSGDKR 101
LASRKERGTLKGSGDKR
Sbjct: 61 LASRKERGTLKGSGDKR 77
>ref|NP_039701.1| (NC_001335) Hypothetical protein [Mycobacterium phage L5]
sp|Q05247|VG37_BPML5 GENE 37 PROTEIN (GP37)
pir||S30982 gene 37 protein - Mycobacterium phage L5
emb|CAA79413.1| (Z18946) predicted 12.5Kd protein [Mycobacterium phage L5]
Length = 114
Score = 103 bits (257), Expect = 3e-022
Identities = 57/109 (52%), Positives = 70/109 (63%), Gaps = 11/109 (10%)
Query: 4 YQQQAVETAIYPSTAQ------VTYPAMGLANEAGEVLGKVKKIIRD--GTF---NRDDI 52
YQ A +TAIYP ++Y MGL EAGE+ KVKKI+RD G NRDD+
Sbjct: 6 YQIAASDTAIYPGAGDCDSIEGLSYVTMGLVGEAGEIANKVKKILRDKDGVITQENRDDL 65
Query: 53 ADELGDVLWYAAALARDLNTDLSAIAQRNLDKLASRKERGTLKGSGDKR 101
ELGDV+WY A LA L+ LS + QRNL+KL SRK+RG ++GSGD R
Sbjct: 66 QAELGDVMWYVAQLANQLDVWLSTVTQRNLNKLQSRKDRGVIQGSGDNR 114
>ref|NP_212894.1| (NC_001318) B. burgdorferi predicted coding region BB0760 [Borrelia
burgdorferi]
pir||G70194 hypothetical protein BB0760 - Lyme disease spirochete
gb|AAC67101.1| (AE001175) B. burgdorferi predicted coding region BB0760 [Borrelia
burgdorferi]
Length = 124
Score = 82.8 bits (203), Expect = 6e-016
Identities = 45/107 (42%), Positives = 65/107 (60%), Gaps = 6/107 (5%)
Query: 1 MNDYQQQAVETAIYPSTAQ-VTYPAMGLANEAGEVLGKVKKIIRDGTFNRDD-----IAD 54
+N+YQ++A +TA Y + + + +GLA E GEV+ K+KK+ RD + DD I
Sbjct: 18 LNEYQEKAKKTAKYKNKKEELILTTLGLAGETGEVVEKIKKLGRDKNYIIDDEYLISIKK 77
Query: 55 ELGDVLWYAAALARDLNTDLSAIAQRNLDKLASRKERGTLKGSGDKR 101
ELGDVLWY ++L+ +L L +A NL K+ R E GT+ G GD R
Sbjct: 78 ELGDVLWYLSSLSNNLGITLEDVALTNLKKIQKRHENGTINGEGDDR 124
>ref|NP_267982.1| (NC_002662) HYPOTHETICAL PROTEIN [Lactococcus lactis subsp. lactis]
gb|AAK05923.1|AE006412_10 (AE006412) HYPOTHETICAL PROTEIN [Lactococcus lactis subsp. lactis]
Length = 115
Score = 79.3 bits (194), Expect = 7e-015
Identities = 43/82 (52%), Positives = 53/82 (64%), Gaps = 5/82 (6%)
Query: 25 MGLANEAGEVLGKVKKIIRD--GTFNRDD---IADELGDVLWYAAALARDLNTDLSAIAQ 79
+GL+ EAGEV KVKK+IRD G + +D I ELGD+LWY A AR L L IA
Sbjct: 34 LGLSGEAGEVADKVKKVIRDQKGKLSANDKKAIGQELGDILWYVATSARYLGISLEQIAS 93
Query: 80 RNLDKLASRKERGTLKGSGDKR 101
N++KL SR RG + GSGD+R
Sbjct: 94 ENIEKLESRLSRGKISGSGDER 115
>ref|NP_203440.1| (NC_003085) p26 [Bacteriophage Mx8]
gb|AAK94361.1|AF396866_26 (AF396866) p26 [Bacteriophage Mx8]
Length = 124
Score = 58.5 bits (140), Expect = 1e-008
Identities = 36/96 (37%), Positives = 50/96 (51%), Gaps = 7/96 (7%)
Query: 4 YQQQAVETAIYPSTAQ------VTYPAMGLANEAGEVLGKVKKIIRDG-TFNRDDIADEL 56
YQQ A+ TA ++ + +GL EAGEV VKK + G + + + EL
Sbjct: 11 YQQAALRTANIGGPSEPFNHLKLESDCLGLTGEAGEVADHVKKFVGHGHDLDVEKVKKEL 70
Query: 57 GDVLWYAAALARDLNTDLSAIAQRNLDKLASRKERG 92
GDVLWY A +A L+ DL +A N++KL R G
Sbjct: 71 GDVLWYVAVIAARLDLDLGDVAAANVEKLRKRYPDG 106
>ref|NP_465673.1| (NC_003210) similar to unknown proteins [Listeria monocytogenes
EGD-e]
emb|CAD00227.1| (AL591982) similar to unknown proteins [Listeria monocytogenes]
Length = 105
Score = 55.1 bits (131), Expect = 1e-007
Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 2/91 (2%)
Query: 3 DYQQQAVETAIYPSTAQVTYPAMGLANEAGEVLGKVKKIIRDG-TFNRDDIADELGDVLW 61
+YQ A TA A Y +G+ EAGEV +KK G +++ + ELGDVLW
Sbjct: 5 EYQILANRTAATHEQALTNY-GLGITGEAGEVADLIKKYAFHGHDLDKESLTKELGDVLW 63
Query: 62 YAAALARDLNTDLSAIAQRNLDKLASRKERG 92
Y + +A+ + + +A+ N++KL R +G
Sbjct: 64 YVSQIAKWADISMETVAELNIEKLKRRYPQG 94
>ref|NP_559109.1| (NC_003364) hypothetical protein [Pyrobaculum aerophilum]
gb|AAL63291.1| (AE009807) hypothetical protein [Pyrobaculum aerophilum]
Length = 112
Score = 37.0 bits (84), Expect = 0.038
Identities = 25/79 (31%), Positives = 35/79 (43%), Gaps = 9/79 (11%)
Query: 19 QVTYPAMGLANEAGEVLGKVKKIIRDGTFNRDDI---------ADELGDVLWYAAALARD 69
++ Y LA E GE VKK++R + R D+ A+EL DV Y +A
Sbjct: 33 RLEYFTNALAGEVGEAANIVKKVVRSVVYGRGDVKLGDVKEALAEELTDVFIYVITMAGL 92
Query: 70 LNTDLSAIAQRNLDKLASR 88
L DL + L+K R
Sbjct: 93 LELDLEREFFKKLEKNKQR 111
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.313 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 49,912,023
Number of Sequences: 891141
Number of extensions: 1721804
Number of successful extensions: 3061
Number of sequences better than 5.0e-02: 8
Number of HSP's better than 0.1 without gapping: 7
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 3047
Number of HSP's gapped (non-prelim): 8
length of query: 101
length of database: 279,076,897
effective HSP length: 77
effective length of query: 24
effective length of database: 210,459,040
effective search space: 5051016960
effective search space used: 5051016960
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf33 [6665-6726]
(60 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_532351.1| (NC_003304) conserved hypothetical protein... 42 0.002
ref|NP_539989.1| (NC_003317) Hypothetical Protein [Brucella... 40 0.007
ref|NP_104800.1| (NC_002678) unknown protein [Mesorhizobium... 39 0.015
ref|NP_102735.1| (NC_002678) unknown protein [Mesorhizobium... 38 0.026
ref|NP_385651.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN... 38 0.026
>ref|NP_532351.1| (NC_003304) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAL42667.1| (AE009123) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 50
Score = 41.6 bits (96), Expect = 0.002
Identities = 17/35 (48%), Positives = 25/35 (70%)
Query: 11 LATWREHRKVIKQLNKFSDRELRDIGITRGDIDRL 45
L WR++R+ + +L + SDREL D+GI R DI R+
Sbjct: 9 LTNWRKYRQTVTELGRMSDRELNDLGIGRQDIRRV 43
>ref|NP_539989.1| (NC_003317) Hypothetical Protein [Brucella melitensis]
gb|AAL52253.1| (AE009547) Hypothetical Protein [Brucella melitensis]
Length = 48
Score = 39.7 bits (91), Expect = 0.007
Identities = 15/29 (51%), Positives = 22/29 (75%)
Query: 14 WREHRKVIKQLNKFSDRELRDIGITRGDI 42
WR +R+ + +LN+ S REL D+GI+R DI
Sbjct: 10 WRRYRETVNELNQLSTRELNDLGISRADI 38
>ref|NP_104800.1| (NC_002678) unknown protein [Mesorhizobium loti]
dbj|BAB50586.1| (AP003002) unknown protein [Mesorhizobium loti]
Length = 59
Score = 38.5 bits (88), Expect = 0.015
Identities = 14/38 (36%), Positives = 25/38 (64%)
Query: 8 INYLATWREHRKVIKQLNKFSDRELRDIGITRGDIDRL 45
I WR + + +++LN+ + R+L D+GI R DI+R+
Sbjct: 17 IRAFRNWRMYNETVRELNRLNARQLSDLGINRADIERI 54
>ref|NP_102735.1| (NC_002678) unknown protein [Mesorhizobium loti]
dbj|BAB48521.1| (AP002996) unknown protein [Mesorhizobium loti]
Length = 69
Score = 37.7 bits (86), Expect = 0.026
Identities = 14/29 (48%), Positives = 22/29 (75%)
Query: 14 WREHRKVIKQLNKFSDRELRDIGITRGDI 42
WR +R + +L++ S+REL D+GI+R DI
Sbjct: 33 WRRYRDTVSELSRLSNRELTDLGISRSDI 61
>ref|NP_385651.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium
meliloti]
emb|CAC46124.1| (AL591787) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium
meliloti]
Length = 47
Score = 37.7 bits (86), Expect = 0.026
Identities = 15/29 (51%), Positives = 21/29 (71%)
Query: 14 WREHRKVIKQLNKFSDRELRDIGITRGDI 42
WR++R+ +L + SDREL D+GI R DI
Sbjct: 10 WRKYRQTCNELGRMSDRELTDLGIGRADI 38
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.323 0.139 5.49e-269
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 36,012,391
Number of Sequences: 891141
Number of extensions: 1070184
Number of successful extensions: 2600
Number of sequences better than 5.0e-02: 5
Number of HSP's better than 0.1 without gapping: 5
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2595
Number of HSP's gapped (non-prelim): 5
length of query: 60
length of database: 279,076,897
effective HSP length: 36
effective length of query: 24
effective length of database: 246,995,821
effective search space: 5927899704
effective search space used: 5927899704
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 84 (37.0 bits)
Query= orf34 [6724-7396]
(671 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064758.1| (NC_002519) RP ribonucleotide reductase [R... 1333 0.0
gb|AAA03078.1| (L20047) ribonucleoside triphosphate reducta... 150 3e-035
ref|NP_046865.1| (NC_001900) putative ribonucleotide reduct... 134 4e-030
ref|NP_039714.1| (NC_001335) Hypothetical protein [Mycobact... 133 7e-030
ref|NP_488075.1| (NC_003272) unknown protein [Nostoc sp. PC... 76 1e-012
ref|NP_570337.1| (NC_003390) ribonucleoside triphosphate re... 71 4e-011
ref|NP_215084.1| (NC_000962) nrdZ [Mycobacterium tuberculos... 52 3e-005
ref|NP_335005.1| (NC_002755) ribonucleoside-diphosphate red... 52 3e-005
ref|NP_070492.1| (NC_000917) ribonucleotide reductase (nrd)... 51 5e-005
ref|NP_110611.1| (NC_002689) Ribonucleotide reductase alpha... 43 0.009
>ref|NP_064758.1| (NC_002519) RP ribonucleotide reductase [Roseophage SIO1]
gb|AAG02603.1|AF189021_22 (AF189021) Roseophage SIO1 complete genome
Length = 671
Score = 1333 bits (3450), Expect = 0.0
Identities = 657/671 (97%), Positives = 657/671 (97%)
Query: 1 MNNYQEVSTRAEVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQ 60
MNNYQEVSTRAEVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQ
Sbjct: 1 MNNYQEVSTRAEVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQ 60
Query: 61 SAELTELRQLMLDRKATTSGRTLWLGGTDVAKKREASQFNCSFGNVETVHDVVDAQHLLL 120
SAELTELRQLMLDRKATTSGRTLWLGGTDVAKKREASQFNCSFGNVETVHDVVDAQHLLL
Sbjct: 61 SAELTELRQLMLDRKATTSGRTLWLGGTDVAKKREASQFNCSFGNVETVHDVVDAQHLLL 120
Query: 121 QGCGVGFYPSVGILSGFTAPVEVQVVRSEKQKSDPKGAEDNVEAFYQEDGKSVWEVVVGD 180
QGCGVGFYPSVGILSGFTAPVEVQVVRSEKQKSDPKGAEDNVEAFYQEDGKSVWEVVVGD
Sbjct: 121 QGCGVGFYPSVGILSGFTAPVEVQVVRSEKQKSDPKGAEDNVEAFYQEDGKSVWEVVVGD 180
Query: 181 SAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGIL 240
SAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGIL
Sbjct: 181 SAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGIL 240
Query: 241 NDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHR 300
NDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHR
Sbjct: 241 NDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHR 300
Query: 301 QQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFINAEAARKRAPWFKGVNPCAEILLG 360
QQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFINAEAARKRAPWFKGVNPCAEILLG
Sbjct: 301 QQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFINAEAARKRAPWFKGVNPCAEILLG 360
Query: 361 NKSFCNLVEVDWGKFLGDHGGLERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLC 420
NKSFCNLVEVDWGKFLGDHGGLERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLC
Sbjct: 361 NKSFCNLVEVDWGKFLGDHGGLERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLC 420
Query: 421 GVGGTGIVKFLDHYTGRNNVASMLQSLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLS 480
GVGGTGIVKFLDHYTGRNNVASMLQSLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLS
Sbjct: 421 GVGGTGIVKFLDHYTGRNNVASMLQSLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLS 480
Query: 481 KIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMPVCYDD 540
KIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMPVCYDD
Sbjct: 481 KIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMPVCYDD 540
Query: 541 VSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILTN 600
VSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILTN
Sbjct: 541 VSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILTN 600
Query: 601 WDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVITEEAYVAYTSQLKPXXXXXXXXXXXXXX 660
WDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVITEEAYVAYTSQLKP
Sbjct: 601 WDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVITEEAYVAYTSQLKPLDLSGLTSDDDLVD 660
Query: 661 EACATGACPIR 671
EACATGACPIR
Sbjct: 661 EACATGACPIR 671
>gb|AAA03078.1| (L20047) ribonucleoside triphosphate reductase [Lactobacillus
leichmannii]
Length = 739
Score = 150 bits (380), Expect = 3e-035
Identities = 198/748 (26%), Positives = 310/748 (40%), Gaps = 126/748 (16%)
Query: 13 VVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRA---ELNEKQSAELTELRQ 69
V +RTY R L ++G E W++TV RV++ L R Q + EL + + E L +
Sbjct: 29 VTYKRTYARWLPEKGRS-ENWDETVKRVVEGNINLDPRLQDSPSLELKQSLTEEAERLYK 87
Query: 70 LMLDRKATTSGRTLWLGGTDVAKKREASQFNCSF--------GNVETVHDVVDAQHL--- 118
L+ AT SGR LW+ GTD ++ S NC F G+ + V + Q
Sbjct: 88 LIYGLGATPSGRNLWISGTDYQRRTGDSLNNCWFVAIRPQKYGDSKIVPSYLGKQEKAVS 147
Query: 119 ---------LLQGCGVGFYPSVGILSGFTA---PVEVQVVRSEKQKSDPKGAEDNVEAFY 166
L++G GVGF + +S +++Q+V E +S +
Sbjct: 148 MPFSCLFDELMKGGGVGFSVARSNISQIPRVDFAIDLQLVVDETSESYDASVKVGAVGKN 207
Query: 167 Q--EDGKSVWEVVVGDSAEAWAKSFGKLVAMK------KRVDIIRLNYREIRAAGMRLKG 218
+ +D S++ + D+ E W + L+ + R + L+ +IR G + G
Sbjct: 208 ELVQDADSIY-YRLPDTREGWVLANALLIDLHFAQTNPDRKQKLILDLSDIRPYGAEIHG 266
Query: 219 YGWISSGDSDLSVAQTAICGILNDRAGQLLRHMDILDLLNHMGTVLSS---RRSAEIAVL 275
+G +SG L + +LN++AG L +D D+ N +G + + RRSAE+A+
Sbjct: 267 FGGTASGPMPLISMLLDVNEVLNNKAGGRLTAVDAADICNLIGKAVVAGNVRRSAELAL- 325
Query: 276 PVTNPEIDAFISAKKDF--WLHGNEHRQQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEP 333
+N + D FIS K+D +H HR SNNS+ D+ + ++ I I E G EP
Sbjct: 326 -GSNDDQD-FISMKQDQEKLMH---HRWASNNSVAVDSAFSGYQP--IAAGIRENG--EP 376
Query: 334 GFINAEAARKRAPWF-----------KGVNPCAEILLGNKSFCNLVEVDWGKFLGDHGG- 381
G +N + ++ +G NPC EI L N CNL EV + + G
Sbjct: 377 GIVNLDLSKNYGRIVDGYQAGIDGDVEGTNPCGEISLANGEPCNLFEVF--PLIAEEQGW 434
Query: 382 -LERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLCGVGGTGIVKFLDHYTGRNNV 440
L+ LAAR R T D + + E+ + R G+ +GI +L G V
Sbjct: 435 DLQEVFALAARYAKRVTFSPYDWEISR----EIIQKNRRIGISMSGIQDWLLTRLGNRVV 490
Query: 441 ASMLQSLRAQAH---------KGANSMAEEFGTPMPKAV---------------TTVKPS 476
+ H K A M ++ + KA TTVKPS
Sbjct: 491 TGFKDDFDPETHEAIKVPVYDKRAIKMVDQLYKAVVKADQDYSKTLGCNESIKHTTVKPS 550
Query: 477 GTLSKIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMP- 535
GT++K+ +EG+H G Y+I I F DP++ +L+A GY+ + ++ V P
Sbjct: 551 GTVAKLAGASEGMHFHYGAYLIQRIRFQDSDPLLPALKACGYRTEADIYTENTTCVEFPI 610
Query: 536 --VCYDDVSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAI 593
V D+ +F GTV S EQ L + D+ S TI++ SE +
Sbjct: 611 KAVGADNPNF----ASAGTV-----SIAEQFATQAFLQTYWSDNAVSCTITFQDSEGDQV 661
Query: 594 IDWI--------LTNWDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVI--TEEAYVAYTSQ 643
+ T+ Y G S + P + + Y QE+ EE + S
Sbjct: 662 ESLLRQYRFITKSTSLLPYFGGSL--QQAPKEPIDKETYEKRSQEITGNVEEVFSQLNSD 719
Query: 644 LKPXXXXXXXXXXXXXXEACATGACPIR 671
+K C GACPI+
Sbjct: 720 VKDLELVDQTD--------CEGGACPIK 739
>ref|NP_046865.1| (NC_001900) putative ribonucleotide reductase; gp50
[Mycobacteriophage D29]
sp|O64240|VG50_BPMD2 PUTATIVE ADENOSYLCOBALAMIN-DEPENDENT RIBONUCLEOTIDE REDUCTASE
(GP50)
pir||G72805 probable ribonucleotide reductase protein - Mycobacterium phage D29
gb|AAC18490.1| (AF022214) putative ribonucleotide reductase; gp50
[Mycobacteriophage D29]
Length = 693
Score = 134 bits (336), Expect = 4e-030
Identities = 178/724 (24%), Positives = 280/724 (38%), Gaps = 107/724 (14%)
Query: 12 EVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQSAELTELRQLM 71
E+V RTY+R + +GT ETW +TV+RV+ L + + + E +L +LM
Sbjct: 13 ELVYNRTYSR-VKPDGTR-ETWPETVERVVSGNLALVDSRYQLP------GEREDLLRLM 64
Query: 72 LDRKATTSGRTLWLGGTDVAKKREASQFNCSF-GNVETVHDVVDAQHL-LLQGCGVGFYP 129
+ K +GR LW G K FNC G E D + + L++G GVG
Sbjct: 65 REFKILPAGRHLWASGV----KNAQHLFNCWVSGWTEKPSDHFEFTFMRLMEGGGVGANY 120
Query: 130 SVGILSGFT---APVEVQVVRSEKQKSDPKGAE-DNVEAFYQEDGKSVWEVVVGDSAEAW 185
S L+ + +EV +V E AE + + Y D + V+ DS E W
Sbjct: 121 SNRFLADYPHVKQELEVHIVCDEDHDDYADLAEAGQLSSRYDSDWVDAF--VIEDSREGW 178
Query: 186 AKSFGKLVAMKKRVDIIR----LNYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGILN 241
A + L+ R D+ + +RAAG +LK +G +SG L+ T + IL+
Sbjct: 179 AAALVDLIDTHYRDDVAHKERVYDVSRVRAAGRKLKTFGGTASGPVPLAKMLTEVSEILS 238
Query: 242 DRA----------GQLLRHMDILDLLNHMGTVLSS---RRSAEIAVLPVTNPEIDAFISA 288
A G L +D +++ + + + + RRSA +A++ + +++ F +
Sbjct: 239 RCAREGEQYRFEDGGALTGLDAMEIDHAIAQCVVAGGVRRSARMAMMHWADWQVETFTNI 298
Query: 289 KKDFWLHGNEHRQQSNNSIMFDT--------KPTKWELSYIFDKIVEAG--GSEPGFINA 338
K+D H + + + P + + + E EPG ++
Sbjct: 299 KQDSGSHWTTNISVEVDDAFWSLAKAPVDPLNPRSTKAHRVLKALSEGAVRNGEPGMWDS 358
Query: 339 EAARKRAP-WFKGVNPCAEILLGNKSFCNLVEVDWGKFLGDHG-----GLERAVWLAARA 392
+ P NPC EI L CNL ++ F+ D G L RA L R
Sbjct: 359 SLSNVGEPNEVVCTNPCGEITLEPWEPCNLGHINLAAFVTDAGKTDYIDLIRAHRLMTRF 418
Query: 393 NYRQTCVNLDDGVLQRSWHELNEFLRLCGVGGTGIVKFL-------DHYTGRNNVASMLQ 445
R T V E+ + R GVG G+ +L G + L+
Sbjct: 419 LIRATF----SAVADPKSREVLDRNRRIGVGHLGVASYLALTGRRYSQAPGDKRFTAFLR 474
Query: 446 SLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLSKIMDTTEGVHKPLGKYIINNITFS- 504
+ A+ + A + E P+P TV P+GT++K+ +EG+H +Y I I FS
Sbjct: 475 EMAAEVDRAAEEFSHELRIPVPVKKRTVAPTGTIAKMPGVSEGIHPIFSRYFIRRIRFSV 534
Query: 505 --KDDPIIASLEAA-GYKVFDKPFEPSSKLVAMPVCYDDVSFDEVETDRGTVHVNLESAV 561
D + AS AA GY V ++ S + + D + V G ++ +
Sbjct: 535 LDNDQFLTASQYAADGYHVEKDQYDKSGNTWVVEIPTKDTLVEAVAARFGRDAEDIVESA 594
Query: 562 EQLDRYKLL------MDNYVDHNCSVTISYDPSEVPAIIDWILTNWDTYVGVSFIYRNDP 615
+L ++LL + D+ S T + DP D Y GV
Sbjct: 595 NELTLHQLLAFQALYQTCWADNAVSFTANVDP--------------DAYEGVD------- 633
Query: 616 TKTAEDL--------GYAYLPQEVITEEAYVAYTSQLKPXXXXXXXXXXXXXXEACATGA 667
A DL G P+E + Y T Q E CA GA
Sbjct: 634 --VAADLQRFSGLIKGSTIFPEESFPQAPYERITKQ--QYEAAAIKAVADGVDEECANGA 689
Query: 668 CPIR 671
CPI+
Sbjct: 690 CPIK 693
>ref|NP_039714.1| (NC_001335) Hypothetical protein [Mycobacterium phage L5]
sp|Q05262|VG50_BPML5 PUTATIVE ADENOSYLCOBALAMIN-DEPENDENT RIBONUCLEOTIDE REDUCTASE
(GP50)
pir||S30995 gene 50 protein - Mycobacterium phage L5
emb|CAA79426.1| (Z18946) predicted 76.3Kd protein [Mycobacterium phage L5]
Length = 682
Score = 133 bits (334), Expect = 7e-030
Identities = 168/709 (23%), Positives = 287/709 (39%), Gaps = 87/709 (12%)
Query: 12 EVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQSAELTELRQLM 71
E+V RTY R D ETW +TV+RV+D L +R+ E E +L +LM
Sbjct: 12 ELVYNRTYARTKPDGSK--ETWPETVERVVDGNLALVDRRHMLE------GEREDLVRLM 63
Query: 72 LDRKATTSGRTLWLGGTDVAKKREASQFNCSF-GNVETVHDVVDAQHL-LLQGCGVGFYP 129
K +GR LW G K FNC G E + D + L++G GVG
Sbjct: 64 TKFKVLPAGRHLWASGV----KNAQHLFNCWVAGWPEKISDHFQFTFMRLMEGGGVGANY 119
Query: 130 SVGILSGFTA---PVEVQVVRSEKQKSDPKGAEDN--VEAFYQEDGKSVWEVVVGDSAEA 184
S L G+ P+ V++V + + D + +D + Y D + + DS E
Sbjct: 120 SNHYLEGYPEVVNPLRVEIV-CDPEHVDYQAMKDVGILSEHYSHDWAGAY--AIEDSREG 176
Query: 185 WAKSFGKLVAMKKRVDIIRL----NYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGIL 240
WA + L+ R + + IR G +LK +G ++SG +V + I
Sbjct: 177 WAAALVDLIDTHYRPGTVHYQRVYDVSRIRPQGAKLKTFGGVASGPLPFAVMLQKVANIF 236
Query: 241 NDRAGQLLRHMDILDLLNHMGTVLSS---RRSAEIAVLPVTNPEIDAFISAKK------- 290
++ AG L + +++ + + + + RRSA ++++ + +ID FI+ K
Sbjct: 237 SEYAGITLDGIAAMEVDHAIAQCVVAGGVRRSARMSMMHWDDYQIDKFINIKASTGEHWT 296
Query: 291 ---------DFWLHGNEHRQQSNNSIMFDTKPTKWELSYIFDKIVEAG--GSEPGFINAE 339
+FW E + ++ + P + + + E EPG ++
Sbjct: 297 TNISVEVDDEFWRLAKEG--YGSFAVAREDMPKQKRAHRVLKALSEGAVRNGEPGMWDSS 354
Query: 340 AARKRAP-WFKGVNPCAEILLGNKSFCNLVEVDWGKFLG-----DHGGLERAVWLAARAN 393
+ P NPC EI L CNL ++ F+ D+ L RA L R
Sbjct: 355 LSNVGEPNRVVCTNPCGEITLEPWEPCNLGHINLAAFVTPAGKTDYLDLIRAHRLMTRFL 414
Query: 394 YRQTCVNLDDGVLQRSWHELNEFLRLCGVGGTGIVKFL-------DHYTGRNNVASMLQS 446
R T + D + E+ + R GVG G+ +L G L+
Sbjct: 415 IRATFSEVADPKSR----EVLDRNRRIGVGHLGVASYLALTGSRYSEAPGDKRFTQFLRE 470
Query: 447 LRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLSKIMDTTEGVHKPLGKYIINNITF--S 504
+ A+ A A + P+P T+ P+GT++K+ +EG+H +Y I + F +
Sbjct: 471 MAAEVDSEAERFAHDLRIPVPVKKRTIAPTGTVAKMPGVSEGIHPIFSRYFIRRVRFNLN 530
Query: 505 KDDPIIASLEAAGYKVFDKPFEPSSKLVAMPVCYDDV-SFDEVETDRGTVHVNLESAVEQ 563
D + L A GY++ F P++++V++P V + +E+ D V E + Q
Sbjct: 531 SDMAELRRLAAEGYEIEKDLFAPNTEVVSIPTKDTLVQAVEELGWDEEIVESADELTLHQ 590
Query: 564 LDRYKLLMDN-YVDHNCSVTISYDPSEVPAIIDWILTNWDTYVGVSFIYRNDPTKTAEDL 622
L ++ L + D+ S T + P A + + + + G+ + +
Sbjct: 591 LLAFQALYQMCWADNAVSFTANVHPDAYEA--EDVAADLKRFAGLI-----KGSTIFPES 643
Query: 623 GYAYLPQEVITEEAYVAYTSQLKPXXXXXXXXXXXXXXEACATGACPIR 671
+ P E IT++ Y + ++ E CA GACPI+
Sbjct: 644 SFPQAPYERITKQQYESAAAK----------AVEDGVDEECANGACPIK 682
>ref|NP_488075.1| (NC_003272) unknown protein [Nostoc sp. PCC 7120]
dbj|BAB75734.1| (AP003595) ORF_ID:all4035~unknown protein [Nostoc sp. PCC 7120]
Length = 1172
Score = 76.3 bits (186), Expect = 1e-012
Identities = 99/406 (24%), Positives = 157/406 (38%), Gaps = 64/406 (15%)
Query: 280 PEIDAFISAKKDFWLHGNEHRQQSNNSIM---FDTKPTKWELSYIFDKIVEAGGSE---- 332
P ++ + A + + G Q + ++ D PT+ L F K E G ++
Sbjct: 734 PTLEECVDAVRKQYYSGEGAIQWAGEAVARANIDLLPTQ-ALKVEFLKAYEQGTAKDWLQ 792
Query: 333 ---PGFINAEAARKRAPWFKGVNPCAEILLGNKSFCNLVEVDWGKFLGDHGGLERAVWLA 389
P F E + A + +NPC EI+ G+ CNL E+ + + E+
Sbjct: 793 KRHPHFDEKELKHRLARY--ALNPCGEII-GSNFHCNLSEIHLNQ-IDPFNYKEQEEAFT 848
Query: 390 ARANYRQTCVNLDDGVLQRSWHELNEFLRLCGVGGTGIVKFLDHYTG------------- 436
A A T +N + + E + GV TG+ F H G
Sbjct: 849 AGALSVATLLN--HKFPEPRYQYSRELDPIVGVSFTGLFDFFVHAFGAEWLQWWEAGRPA 906
Query: 437 -------RNNVASMLQSLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLSKIMDTTEGV 489
+ L S R + + P TTV+PSGT S + + G
Sbjct: 907 TPQGLAFKREEEKYLSSWREIVQRVVWDYCDRHNIKRPNRCTTVQPSGTKSLLTGASPGW 966
Query: 490 HKPLGKYIINNITFSKDDPIIASLEAAGYKV-------------FDKPFEP--SSKLVAM 534
H P + I ITF K+DP+ + GY V + P +P S LV +
Sbjct: 967 HPPKAQRFIRRITFRKNDPVALACIDYGYNVIPSQSDKDEQGNLLNDPLDPRVSEWLVEI 1026
Query: 535 PVCYDDVSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAI- 593
PV VS+ ++ T+ ++ +A+ Q+D Y + YV HN S TI EV +
Sbjct: 1027 PVA---VSWADI-PGADTIDISKFNAIAQMDFYMQVQKFYVTHNTSATIEVREHEVETLG 1082
Query: 594 --IDWILTNWDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVITEEAY 637
I + N + Y+ + + R D +T + LP E I++E Y
Sbjct: 1083 TQIWQAIQNDEGYISAALLARFDDHQT-----FPRLPFEPISKEQY 1123
Score = 62.4 bits (150), Expect = 1e-008
Identities = 61/264 (23%), Positives = 115/264 (43%), Gaps = 24/264 (9%)
Query: 4 YQEVSTRAEVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQSAE 63
+ E + A V RTY+R E + ETW++ DR + + EL + E
Sbjct: 14 FPESAPAANPVFFRTYSR--RTEAGLRETWDEVCDRTL---------KDLIELGKLSPEE 62
Query: 64 LTELRQLMLDRKATTSGRTLWLGGTDVAKKRE--ASQFNCSFGNVETVHDVVDAQHLLLQ 121
+ L + + A SGR LW+GGTD K + + +NC+ N++ +L +
Sbjct: 63 VAILDKTQRNLTALPSGRWLWVGGTDWINKSKNFSGAYNCTSTNLQDWKAFGLMMNLAMM 122
Query: 122 GCGVGFYPSVGILSGFTAPV--EVQVVRSEKQKSDPKGAEDNVEAFYQEDGKSVWEVVVG 179
GCG G ++ P+ + ++ + + PK + E + + VG
Sbjct: 123 GCGTGAVIEPEYINQL-PPIRNHLNIIVKGEIGTTPKHLRREYTQTHIEGNSAT--IYVG 179
Query: 180 DSAEAWAKSFGKLVAM---KKRVDIIR--LNYREIRAAGMRLKGYGWISSGDSDLSVAQT 234
DS E W +S+ L+ + ++ D ++ ++ ++R +G L G+G +++ L
Sbjct: 180 DSREGWVESYQALLELSTDERFTDEVQVFVDVSDVRQSGETLNGFGGVAN-PVKLPGLYQ 238
Query: 235 AICGILNDRAGQLLRHMDILDLLN 258
ILN G+ L ++ L++
Sbjct: 239 RCAAILNKALGRKLNSVECCLLID 262
>ref|NP_570337.1| (NC_003390) ribonucleoside triphosphate reductase B [Cyanophage
P60]
gb|AAL73275.1| (AF338467) ribonucleoside triphosphate reductase B [Cyanophage P60]
Length = 446
Score = 70.9 bits (172), Expect = 4e-011
Identities = 80/322 (24%), Positives = 128/322 (38%), Gaps = 45/322 (13%)
Query: 350 GVNPCAEILLGNKSFCNLVEVDWGKFLGDHGGLE----RAVWLAARA-------NYRQTC 398
G+NPC EIL G CNL EV D + +A LAA A R
Sbjct: 90 GLNPCGEIL-GKDFHCNLSEVHLNTLDPDDTDAQVKAFQAGALAAGALLHHKFTTERYQY 148
Query: 399 VNLDDGVLQRSWHELNEF---------LRLCGVGGTGIVKFLDHYTGRNNVASMLQSLRA 449
D ++ S+ L +F LR +G + +F + G A +L+ +
Sbjct: 149 SREQDPIVGVSFTGLFDFCVNKWGAEWLRWWSLGRPFVPEFHEFVLGE---ARLLEFWKG 205
Query: 450 QAHKGANSMAEEFGTPMPKAVTTVKPSGTLSKIMDTTEGVHKPLGKYIINNITFSKDDPI 509
+ + E+ G MP TTV+P+GT S + + G H P I ITF+++DP+
Sbjct: 206 VVKETLEAYCEKHGLKMPNRYTTVQPAGTKSLLTGASPGWHPPKSARFIRRITFAREDPV 265
Query: 510 IASLEAAGYKV-------------FDKPFEPSSKLVAMPVCYDDVSFDEVETDRGTVHVN 556
+ GY V + P++ K + + + V ++ D+ N
Sbjct: 266 ALACIDYGYTVVPSQSSRDEEGNLLNDPWDERVKEWLVEIPTETVWANKPGCDQ--YEPN 323
Query: 557 LESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILTN-WDTYVGVSFIYRNDP 615
S Q D Y + + HN S T+ + +E+ + + I D Y+ + + R D
Sbjct: 324 TFSVKAQFDFYMQVQKFWTTHNTSATLEFRENEIEELSELIYRGCGDGYISAALLARFDA 383
Query: 616 TKTAEDLGYAYLPQEVITEEAY 637
+T + LP E I E Y
Sbjct: 384 NET-----FPRLPFEPIDRETY 400
>ref|NP_215084.1| (NC_000962) nrdZ [Mycobacterium tuberculosis H37Rv]
pir||A70933 probable nrdZ protein - Mycobacterium tuberculosis (strain H37RV)
emb|CAA17441.1| (AL021942) nrdZ [Mycobacterium tuberculosis H37Rv]
Length = 692
Score = 51.6 bits (122), Expect = 3e-005
Identities = 85/367 (23%), Positives = 137/367 (37%), Gaps = 85/367 (23%)
Query: 178 VGDSAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAIC 237
+ DS ++ + G+ +++ + +R AG R+ G +SG +
Sbjct: 198 IEDSLQSIFATLGQAAELQRAGGGTGYAFSHLRPAGDRVASTGGTASGPVSFLRLYDSAA 257
Query: 238 GILNDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKK------- 290
G+++ MG RR A +AVL V++P+I F++AK
Sbjct: 258 GVVS------------------MG---GRRRGACMAVLDVSHPDICDFVTAKAESPSELP 296
Query: 291 ---------DFWLHGNE----HRQQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFIN 337
D +L E HR + + + EL K AGG +PG +
Sbjct: 297 HFNLSVGVTDAFLRAVERNGLHRLVNPRTGKIVARMPAAELFDAICKAAHAGG-DPGLVF 355
Query: 338 AEAARKRAPW-----FKGVNPCAEILLGNKSFCNLVEVDWGKFLGDH----GGLERAVWL 388
+ + P + NPC E+ L CNL ++ + L D LE +
Sbjct: 356 LDTINRANPVPGRGRIEATNPCGEVPLLPYESCNLGSINLARMLADGRVDWDRLEEVAGV 415
Query: 389 AARANYRQTCVNLDD--GVLQRSWHELNEFLRLC---GVGGTGIVKFL-------DHYTG 436
A R LDD V + + EL E R G+G G+ + L D
Sbjct: 416 AVRF--------LDDVIDVSRYPFPELGEAARATRKIGLGVMGLAELLAALGIPYDSEEA 467
Query: 437 RNNVASMLQSLRAQAHKGANSMAEEFGT-------------PMPKA-VTTVKPSGTLSKI 482
+++ ++ AH + +AEE G P A VT+V P+GT+S I
Sbjct: 468 VRLATRLMRRIQQAAHTASRRLAEERGAFPAFTDSRFARSGPRRNAQVTSVAPTGTISLI 527
Query: 483 MDTTEGV 489
TT G+
Sbjct: 528 AGTTAGI 534
>ref|NP_335005.1| (NC_002755) ribonucleoside-diphosphate reductase, alpha subunit,
putative [Mycobacterium tuberculosis CDC1551]
gb|AAK44819.1| (AE006957) ribonucleoside-diphosphate reductase, alpha subunit,
putative [Mycobacterium tuberculosis CDC1551]
Length = 735
Score = 51.6 bits (122), Expect = 3e-005
Identities = 85/367 (23%), Positives = 137/367 (37%), Gaps = 85/367 (23%)
Query: 178 VGDSAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAIC 237
+ DS ++ + G+ +++ + +R AG R+ G +SG +
Sbjct: 241 IEDSLQSIFATLGQAAELQRAGGGTGYAFSHLRPAGDRVASTGGTASGPVSFLRLYDSAA 300
Query: 238 GILNDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKK------- 290
G+++ MG RR A +AVL V++P+I F++AK
Sbjct: 301 GVVS------------------MG---GRRRGACMAVLDVSHPDICDFVTAKAESPSELP 339
Query: 291 ---------DFWLHGNE----HRQQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFIN 337
D +L E HR + + + EL K AGG +PG +
Sbjct: 340 HFNLSVGVTDAFLRAVERNGLHRLVNPRTGKIVARMPAAELFDAICKAAHAGG-DPGLVF 398
Query: 338 AEAARKRAPW-----FKGVNPCAEILLGNKSFCNLVEVDWGKFLGDH----GGLERAVWL 388
+ + P + NPC E+ L CNL ++ + L D LE +
Sbjct: 399 LDTINRANPVPGRGRIEATNPCGEVPLLPYESCNLGSINLARMLADGRVDWDRLEEVAGV 458
Query: 389 AARANYRQTCVNLDD--GVLQRSWHELNEFLRLC---GVGGTGIVKFL-------DHYTG 436
A R LDD V + + EL E R G+G G+ + L D
Sbjct: 459 AVRF--------LDDVIDVSRYPFPELGEAARATRKIGLGVMGLAELLAALGIPYDSEEA 510
Query: 437 RNNVASMLQSLRAQAHKGANSMAEEFGT-------------PMPKA-VTTVKPSGTLSKI 482
+++ ++ AH + +AEE G P A VT+V P+GT+S I
Sbjct: 511 VRLATRLMRRIQQAAHTASRRLAEERGAFPAFTDSRFARSGPRRNAQVTSVAPTGTISLI 570
Query: 483 MDTTEGV 489
TT G+
Sbjct: 571 AGTTAGI 577
>ref|NP_070492.1| (NC_000917) ribonucleotide reductase (nrd) [Archaeoglobus fulgidus]
pir||G69457 ribonucleoside-diphosphate reductase (EC 1.17.4.1) large chain nrd
- Archaeoglobus fulgidus
gb|AAB89584.1| (AE000988) ribonucleotide reductase (nrd) [Archaeoglobus fulgidus]
Length = 752
Score = 50.8 bits (120), Expect = 5e-005
Identities = 67/277 (24%), Positives = 107/277 (38%), Gaps = 62/277 (22%)
Query: 267 RRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHRQQSNNSIMFDT--KPTKWEL------ 318
RR A + VL V +P+I+ FI AK + + N + + + K ++L
Sbjct: 159 RRGANMGVLNVHHPDIEEFIKAKWEEGVLRNFNISVAATDAFMEALKKDDDYDLINPRTG 218
Query: 319 --------SYIFDKIVEAG--GSEPGFINAEAARKRAPW-----FKGVNPCAEILLGNKS 363
IF+ IVE EPG I + + P + NPC E L
Sbjct: 219 EVVRRVPARKIFNLIVEGAWRNGEPGMIFIDTINRFNPTPHVGQIEATNPCGEQPLLPYE 278
Query: 364 FCNLVEVDWGKFLG----DHGGLERAVWLAARANYRQTCVNLDDGVLQRSW-----HELN 414
CNL ++ KF+ D GL VW+A R LDD + S+ ++
Sbjct: 279 SCNLGSINLAKFVKDGEIDWDGLREVVWIAVRF--------LDDVIDVNSYPIPEIEQMT 330
Query: 415 EFLRLCGVGGTGIVKFL-------DHYTGRNNVASMLQSLRAQAHKGANSMAEEFGT-PM 466
+ R G+G G L D +++ ++ ++H+ + +AEE G P
Sbjct: 331 KANRKIGLGVMGWADMLFLLNIPYDSDEAIQLAEKVMKFIQEESHRASQQLAEERGVFPN 390
Query: 467 PK--------------AVTTVKPSGTLSKIMDTTEGV 489
K TT+ P+GT+S I + G+
Sbjct: 391 WKGSVWEKKGIKMRNATTTTIAPTGTISIIAGCSSGI 427
>ref|NP_110611.1| (NC_002689) Ribonucleotide reductase alpha subunit [Thermoplasma
volcanium]
Length = 767
Score = 43.1 bits (100), Expect = 0.009
Identities = 60/300 (20%), Positives = 111/300 (37%), Gaps = 75/300 (25%)
Query: 251 MDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHRQQSNNSIMFD 310
M I D+ + RR A + +L +P+I F+ +K +E++ SN +I
Sbjct: 307 MKIFDVTTDVIKQGGKRRGANMGILNYNHPDIMEFVLSKD------SENKVLSNFNISVG 360
Query: 311 TKPTKWE-----------------------LSYIFDKIVEAGG--SEPGFINAEAARKRA 345
+E ++D I++ ++PG I + ++
Sbjct: 361 VTDEFFEKLDNDDYVDLINPRTKKVTKRIKAKELWDAIIDQAWKTADPGLIFLDEINRKN 420
Query: 346 PW-----FKGVNPCAEILLGNKSFCNLVEVDWGKFLG----DHGGLERAVWLAARANYRQ 396
P + NPC E L CNL ++ K++ D+ L VWLA R
Sbjct: 421 PVKNVGDIESTNPCGEQPLLPYESCNLGSINLSKYVNGKDIDYDRLRETVWLATRF---- 476
Query: 397 TCVNLDDGVLQRSW--HELNEFLRLCGVGGTGIVKFLD----------HYTGRNNVASML 444
LDD + + ++ + R+ G G++ F D + ++
Sbjct: 477 ----LDDVIDANKFPVEQIKKVTRMTRKIGLGVMGFADLLIKLEVPYNSWEALEIAEKVM 532
Query: 445 QSLRAQAHKGANSMAEE-------FGTPMPK--------AVTTVKPSGTLSKIMDTTEGV 489
+ ++HK + +AEE +G+ K TT+ P+GT+S I + +
Sbjct: 533 SFINEESHKASQHLAEERGVFPAWYGSEWEKEGIKMRNSTTTTIAPTGTISIIAGCSSSI 592
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 369,650,582
Number of Sequences: 891141
Number of extensions: 15550131
Number of successful extensions: 35724
Number of sequences better than 5.0e-02: 12
Number of HSP's better than 0.1 without gapping: 6
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 35689
Number of HSP's gapped (non-prelim): 14
length of query: 671
length of database: 279,076,897
effective HSP length: 127
effective length of query: 544
effective length of database: 165,901,990
effective search space: 90250682560
effective search space used: 90250682560
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 94 (40.8 bits)
Query= orf35 [7432-7511]
(78 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_049698.1| (NC_000866) thioredoxin [Enterobacteria ph... 39 0.011
pdb|1DE1|A Chain A, Nmr Structures Of Oxidized Bacteriophag... 39 0.011
ref|NP_569495.1| (NC_003385) putative glutaredoxin [Salmone... 38 0.019
ref|NP_233395.1| (NC_002506) glutaredoxin-related protein [... 37 0.041
ref|NP_246286.1| (NC_002663) unknown [Pasteurella multocida... 37 0.041
>ref|NP_049698.1| (NC_000866) thioredoxin [Enterobacteria phage T4]
sp|P00276|THIO_BPT4 Thioredoxin
pir||TXBPT4 thioredoxin [validated] - phage T4
pdb|1AAZ|A Chain A, Glutaredoxin
pdb|1AAZ|B Chain B, Glutaredoxin
gb|AAA32529.1| (M13894) thioredoxin [Enterobacteria phage T4]
gb|AAD42626.1|AF158101_213 (AF158101) thioredoxin [Enterobacteria phage T4]
Length = 87
Score = 38.9 bits (89), Expect = 0.011
Identities = 23/72 (31%), Positives = 37/72 (50%), Gaps = 11/72 (15%)
Query: 12 CKYCTAALGLLESLGHKVEVYDVQ-------DENMSLVVNML----RVTNTVPQLYAPDG 60
C YC A LL E ++ DE ++ ++ L ++ T+PQ++APDG
Sbjct: 14 CVYCDNAKRLLTVKKQPFEFINIMPEKGVFDDEKIAELLTKLGRDTQIGLTMPQVFAPDG 73
Query: 61 QFIGGYTELKEY 72
IGG+ +L+EY
Sbjct: 74 SHIGGFDQLREY 85
>pdb|1DE1|A Chain A, Nmr Structures Of Oxidized Bacteriophage T4 Glutaredoxin
pdb|1DE2|A Chain A, Nmr Structures Of Reduced Bacteriophage T4 Glutaredoxin
Length = 87
Score = 38.9 bits (89), Expect = 0.011
Identities = 23/72 (31%), Positives = 37/72 (50%), Gaps = 11/72 (15%)
Query: 12 CKYCTAALGLLESLGHKVEVYDVQ-------DENMSLVVNML----RVTNTVPQLYAPDG 60
C YC A LL E ++ DE ++ ++ L ++ T+PQ++APDG
Sbjct: 14 CVYCDNAKRLLTVKKQPFEFINIMPEKGVFDDEKIAELLTKLGRDTQIGLTMPQVFAPDG 73
Query: 61 QFIGGYTELKEY 72
IGG+ +L+EY
Sbjct: 74 SHIGGFDQLREY 85
>ref|NP_569495.1| (NC_003385) putative glutaredoxin [Salmonella enterica subsp.
enterica serovar Typhi]
emb|CAD09890.1| (AL513384) putative glutaredoxin [Salmonella enterica subsp.
enterica serovar Typhi]
Length = 81
Score = 38.1 bits (87), Expect = 0.019
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 12/75 (16%)
Query: 3 KYIMLTQDNCKYCTAALGLLESL-GHKVEVYD--------VQDENMSLVVNMLRVTNTVP 53
K ++ +DNC YC A+ L + L GH Y+ + E +S +V+ + T+P
Sbjct: 2 KVVIYGRDNCSYCKRAVELAKQLRGHGFGDYEYIDIVSAGIDKEKLSYLVS--KPVETIP 59
Query: 54 QLYAPDGQFIGGYTE 68
Q++ +G+ IGGY E
Sbjct: 60 QVFV-NGESIGGYEE 73
>ref|NP_233395.1| (NC_002506) glutaredoxin-related protein [Vibrio cholerae]
pir||H82389 glutaredoxin-related protein VCA1011 [imported] - Vibrio cholerae
(group O1 strain N16961)
gb|AAF96907.1| (AE004427) glutaredoxin-related protein [Vibrio cholerae]
Length = 212
Score = 37.0 bits (84), Expect = 0.041
Identities = 25/84 (29%), Positives = 43/84 (50%), Gaps = 9/84 (10%)
Query: 1 MQKYIMLTQDNCKYCTAALGLLESLGHKVEVYDVQDENMSLVVNMLRVTN--------TV 52
+Q I+ + C YC A LL++ G +DV E+ +L + V TV
Sbjct: 112 LQGNIVFGKATCPYCVKAKQLLDTAGIDYRYHDVVKESAALYRMIPEVKAIIGEKTPVTV 171
Query: 53 PQLYAPDGQFIGGYTELKEYLNDS 76
PQ++ +GQ+IGG L+++L ++
Sbjct: 172 PQIWL-NGQYIGGCDALEKWLQNN 194
>ref|NP_246286.1| (NC_002663) unknown [Pasteurella multocida]
gb|AAK03431.1| (AE006173) unknown [Pasteurella multocida]
Length = 241
Score = 37.0 bits (84), Expect = 0.041
Identities = 19/67 (28%), Positives = 37/67 (54%), Gaps = 1/67 (1%)
Query: 6 MLTQDNCKYCTAALGLLESLGHKVEVYDVQDENMSLVVNMLRVTNTVPQLYAPDGQFIGG 65
++T+ C YC A LL++ + E + + ++ V + +VPQ++ G++IGG
Sbjct: 174 VITKPGCPYCAKAKALLKAKDYTFEEIVLGRDASTISVRAITGKTSVPQVFI-GGKYIGG 232
Query: 66 YTELKEY 72
EL++Y
Sbjct: 233 SEELEKY 239
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 41,662,997
Number of Sequences: 891141
Number of extensions: 1269543
Number of successful extensions: 2794
Number of sequences better than 5.0e-02: 5
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 2793
Number of HSP's gapped (non-prelim): 5
length of query: 78
length of database: 279,076,897
effective HSP length: 54
effective length of query: 24
effective length of database: 230,955,283
effective search space: 5542926792
effective search space used: 5542926792
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf36 [7506-7656]
(149 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.322 0.139 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 79,044,722
Number of Sequences: 891141
Number of extensions: 2890956
Number of successful extensions: 4601
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4601
Number of HSP's gapped (non-prelim): 0
length of query: 149
length of database: 279,076,897
effective HSP length: 109
effective length of query: 40
effective length of database: 181,942,528
effective search space: 7277701120
effective search space used: 7277701120
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 85 (37.4 bits)
Query= orf37 [7758-7891]
(132 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_569498.1| (NC_003385) putative PhoH-like protein [Sa... 95 1e-019
ref|NP_309293.1| (NC_002695) ATP-binding pho regulon compon... 95 1e-019
ref|NP_455620.1| (NC_003198) PhoH protein (phosphate starva... 95 1e-019
ref|NP_287027.1| (NC_002655) PhoB-dependent, ATP-binding ph... 95 1e-019
ref|NP_267240.1| (NC_002662) phosphate starvation inducible... 91 2e-018
ref|NP_405516.1| (NC_003143) putative phosphate starvation-... 91 2e-018
ref|NP_230608.1| (NC_002505) phoH family protein [Vibrio ch... 83 5e-016
ref|NP_353389.1| (NC_003062) AGR_C_625p [Agrobacterium tume... 83 5e-016
ref|NP_390412.1| (NC_000964) phosphate starvation-induced p... 82 1e-015
ref|NP_268762.1| (NC_002737) phosphate starvation-induced p... 81 2e-015
ref|NP_518651.1| (NC_003295) PROBABLE PHOH-LIKE PROTEIN [Ra... 79 8e-015
ref|NP_298194.1| (NC_002488) ATP-binding protein [Xylella f... 79 8e-015
ref|NP_213930.1| (NC_000918) phosphate starvation-inducible... 79 8e-015
ref|NP_540892.1| (NC_003317) PHOH PROTEIN [Brucella meliten... 78 1e-014
emb|CAB66426.1| (AL136534) PhoH-like protein. [Streptomyces... 75 1e-013
>ref|NP_569498.1| (NC_003385) putative PhoH-like protein [Salmonella enterica subsp.
enterica serovar Typhi]
emb|CAD09893.1| (AL513384) putative PhoH-like protein [Salmonella enterica subsp.
enterica serovar Typhi]
Length = 277
Score = 95.1 bits (235), Expect = 1e-019
Identities = 52/122 (42%), Positives = 75/122 (60%), Gaps = 5/122 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV DVL+K LG +E +K +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 148 PFFRPVYDVLQKRLGGSFLEYCLKPEVAKVEIAPFAYMRGRTFENAVVILDEAQNVTASQ 207
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDL--KEADGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GL + K V +IEF +D VRS
Sbjct: 208 MKMFLTRMGENVTVIVNGDVTQCDLPGNVKSGLEDALERFKPSS-HVGLIEFSTEDCVRS 266
Query: 118 DI 119
++
Sbjct: 267 EL 268
>ref|NP_309293.1| (NC_002695) ATP-binding pho regulon component PhoH [Escherichia
coli O157:H7]
ref|NP_415539.1| (NC_000913) PhoB-dependent, ATP-binding pho regulon component; may
be helicase; induced by P starvation [Escherichia coli
K12]
sp|P31544|PHOH_ECOLI PhoH protein (Phosphate starvation-inducible protein psiH)
pir||B47065 phosphate starvation-inducible protein psiH - Escherichia coli
dbj|BAA01230.1| (D10391) PhoH protein [Escherichia coli]
dbj|BAA35801.1| (D90739) PhoH protein (phosphate starvation-inducible protein
PsiH). [Escherichia coli]
gb|AAC74105.1| (AE000204) PhoB-dependent, ATP-binding pho regulon component; may
be helicase; induced by P starvation [Escherichia coli
K12]
dbj|BAB34689.1| (AP002554) ATP-binding pho regulon component PhoH [Escherichia coli
O157:H7]
Length = 354
Score = 94.7 bits (234), Expect = 1e-019
Identities = 50/120 (41%), Positives = 77/120 (63%), Gaps = 5/120 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV DVL + LG ++ ++ G +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 225 PYFRPVYDVLVRRLGASFMQYCLRPEIGKVEIAPFAYMRGRTFENAVVILDEAQNVTAAQ 284
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDLKE--ADGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GLS + ++ ++ V I+ FG +D VRS
Sbjct: 285 MKMFLTRLGENVTVIVNGDITQCDLPRGVCSGLSDALERFEEDEM-VGIVRFGKEDCVRS 343
>ref|NP_455620.1| (NC_003198) PhoH protein (phosphate starvation-inducible protein
PsiH) [Salmonella enterica subsp. enterica serovar
Typhi]
ref|NP_460098.1| (NC_003197) PhoB-dependent, ATP-binding pho regulon component
[Salmonella typhimurium LT2]
gb|AAL20057.1| (AE008749) PhoB-dependent, ATP-binding pho regulon component
[Salmonella typhimurium LT2]
emb|CAD08250.1| (AL627269) PhoH protein (phosphate starvation-inducible protein
PsiH) [Salmonella enterica subsp. enterica serovar
Typhi]
Length = 284
Score = 94.7 bits (234), Expect = 1e-019
Identities = 51/120 (42%), Positives = 76/120 (62%), Gaps = 5/120 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV DVL K LG ++ ++ G +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 155 PYFRPVYDVLLKRLGASFMQYCLRPEIGKVEIAPFAYMRGRTFENAVVILDEAQNVTAAQ 214
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDLKEA--DGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GLS + ++ ++ V I+ F DD VRS
Sbjct: 215 MKMFLTRLGENVTVIVNGDITQCDLPRGVRSGLSDALERFEEDEM-VGIVHFNKDDCVRS 273
>ref|NP_287027.1| (NC_002655) PhoB-dependent, ATP-binding pho regulon component; may
be helicase; induced by P starvation [Escherichia coli
O157:H7 EDL933]
gb|AAG55638.1|AE005302_1 (AE005302) PhoB-dependent, ATP-binding pho regulon component; may
be helicase; induced by P starvation [Escherichia coli
O157:H7 EDL933]
Length = 354
Score = 94.7 bits (234), Expect = 1e-019
Identities = 50/120 (41%), Positives = 77/120 (63%), Gaps = 5/120 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV DVL + LG ++ ++ G +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 225 PYFRPVYDVLVRRLGASFMQYCLRPEIGKVEIAPFAYMRGRTFENAVVILDEAQNVTAAQ 284
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDLKE--ADGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GLS + ++ ++ V I+ FG +D VRS
Sbjct: 285 MKMFLTRLGENVTVIVNGDITQCDLPRGVCSGLSDALERFEEDEM-VGIVRFGKEDCVRS 343
>ref|NP_267240.1| (NC_002662) phosphate starvation inducible protein [Lactococcus
lactis subsp. lactis]
gb|AAK05182.1|AE006341_1 (AE006341) phosphate starvation inducible protein [Lactococcus
lactis subsp. lactis]
Length = 322
Score = 90.5 bits (223), Expect = 2e-018
Identities = 50/132 (37%), Positives = 76/132 (56%), Gaps = 11/132 (8%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ PV D L + LGK A E ++ G IE+APLA MRGR+ D+AF+I+DE QN +T ++K
Sbjct: 193 PYLRPVYDALFQILGKTATERMMERGIIEIAPLAYMRGRTLDDAFVILDEAQNTTTMQMK 252
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKE------ADGLSKVIHLAKKHQLPVPIIEFGVDDIV 115
M LTR+G + +++NGD Q DL D K+ H+ + + F D+V
Sbjct: 253 MFLTRLGFNSKMIVNGDISQIDLPNKVKSGLIDAKDKLSHVKS-----IDFVYFMAKDVV 307
Query: 116 RSDITAMWVRTF 127
R + A ++ +
Sbjct: 308 RHPVVAEIIKAY 319
>ref|NP_405516.1| (NC_003143) putative phosphate starvation-inducible protein
[Yersinia pestis]
pir||T47002 hypothetical protein [imported] - Yersinia pestis
emb|CAA21345.1| (AL031866) ORF22, len: 262 aa, phoH, highly similar to phosphate
starvation-inducible protein phoH, from E. coli
ECOPHOH_2, (354 aa), 91.2% identity in 262 aa overlap.
Fasta scores: opt: 1567, E(): 0. [Yersinia pestis]
emb|CAC90772.1| (AJ414150) putative phosphate starvation-inducible protein
[Yersinia pestis]
Length = 262
Score = 90.5 bits (223), Expect = 2e-018
Identities = 48/120 (40%), Positives = 74/120 (61%), Gaps = 5/120 (4%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKN--GNIEMAPLALMRGRSFDNAFIIVDETQNISTHE 59
P+ PV D+L + LG ++ ++ G +E+AP A MRGR+F+NA +I+DE QN++ +
Sbjct: 133 PYFRPVYDILLRRLGSSFMQYCLRPEIGKVEIAPFAYMRGRTFENAVVILDEAQNVTASQ 192
Query: 60 LKMLLTRVGEGTTIVLNGDAQQSDLKEA--DGLSKVIHLAKKHQLPVPIIEFGVDDIVRS 117
+KM LTR+GE T+++NGD Q DL GLS + + ++ + II F D VRS
Sbjct: 193 MKMFLTRLGENVTVIVNGDITQCDLPRGVKSGLSDALERFTEDEM-IGIIRFDKQDCVRS 251
>ref|NP_230608.1| (NC_002505) phoH family protein [Vibrio cholerae]
pir||D82261 phoH family protein VC0961 [imported] - Vibrio cholerae (group O1
strain N16961)
gb|AAF94123.1| (AE004177) phoH family protein [Vibrio cholerae]
Length = 367
Score = 82.8 bits (203), Expect = 5e-016
Identities = 50/130 (38%), Positives = 72/130 (54%), Gaps = 3/130 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L + LG VE I+ IE+APLA MRGR+ ++AFII+DE+QN + ++K
Sbjct: 196 PYLRPLYDALFEMLGFERVEKLIERNVIEVAPLAYMRGRTLNDAFIILDESQNTTVEQMK 255
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKEA--DGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDI 119
M LTR+G + V+ GD Q DL GL I + + + F DD+VR +
Sbjct: 256 MFLTRIGFNSRAVITGDVTQIDLPRGAKSGLRHAIEVLNEVD-EISFNFFQADDVVRHPV 314
Query: 120 TAMWVRTFLK 129
A V + K
Sbjct: 315 VARIVNAYEK 324
>ref|NP_353389.1| (NC_003062) AGR_C_625p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
gb|AAK86174.1| (AE007973) AGR_C_625p [Agrobacterium tumefaciens str. C58 (Cereon)]
Length = 358
Score = 82.8 bits (203), Expect = 5e-016
Identities = 48/128 (37%), Positives = 73/128 (56%), Gaps = 3/128 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L + VE I+ G IE+APLA MRGR+ NA +I+DE QN +T ++K
Sbjct: 217 PYLRPLYDALYDMMPGDKVERAIQAGVIEIAPLAFMRGRTLANAAVILDEAQNTTTMQMK 276
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKEA--DGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDI 119
M LTR+GE +++ GD Q DL GL + + + + + V ++ F D+VR +
Sbjct: 277 MFLTRLGENGRMIVTGDPSQVDLPRGVKSGLVEALQILPEVE-GVSVVRFKDVDVVRHPM 335
Query: 120 TAMWVRTF 127
A VR +
Sbjct: 336 VARIVRAY 343
>ref|NP_390412.1| (NC_000964) phosphate starvation-induced protein [Bacillus
subtilis]
sp|P46343|PHOL_BACSU PHOH-like protein
pir||E69676 phosphate starvation-induced protein phoH - Bacillus subtilis
dbj|BAA12477.1| (D84432) YqfE [Bacillus subtilis]
emb|CAB14476.1| (Z99117) phosphate starvation-induced protein [Bacillus subtilis]
Length = 319
Score = 81.6 bits (200), Expect = 1e-015
Identities = 47/132 (35%), Positives = 76/132 (56%), Gaps = 5/132 (3%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L LG E ++ G IE+APLA MRGR+ D+A++I+DE QN + ++K
Sbjct: 189 PYLRPLYDALHDVLGADHTERLMERGIIEIAPLAYMRGRTLDDAYVILDEAQNTTPAQMK 248
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKEADGLSKVIHLAK---KHQLPVPIIEFGVDDIVRSD 118
M LTR+G + +++ GD Q DL + G+ + +AK K + +IE D+VR
Sbjct: 249 MFLTRLGFSSKMIITGDVSQIDLPK--GVKSGLAVAKEMLKGIDGISMIELDQTDVVRHP 306
Query: 119 ITAMWVRTFLKE 130
+ A + + K+
Sbjct: 307 LVAKIIEAYDKQ 318
>ref|NP_268762.1| (NC_002737) phosphate starvation-induced protein [Streptococcus
pyogenes] [Streptococcus pyogenes M1 GAS]
gb|AAK33483.1| (AE006506) phosphate starvation-induced protein [Streptococcus
pyogenes M1 GAS]
Length = 350
Score = 81.3 bits (199), Expect = 2e-015
Identities = 47/128 (36%), Positives = 73/128 (56%), Gaps = 3/128 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ PV D L LGK ++ IE+APLA MRGR+ D+AF+I+DE QN + ++K
Sbjct: 188 PYLRPVYDALYHILGKEQTTRLMERDVIEIAPLAYMRGRTLDDAFVILDEAQNTTIMQMK 247
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKEADGLSKVIHLAKKHQ--LPVPIIEFGVDDIVRSDI 119
M LTR+G + +++NGD Q DL + S +I +K Q + + F D+VR +
Sbjct: 248 MFLTRLGFNSKMIVNGDTSQIDLPR-NVKSGLIDATQKLQGIKQIDFVYFSAKDVVRHPV 306
Query: 120 TAMWVRTF 127
A ++ +
Sbjct: 307 VADIIKAY 314
>ref|NP_518651.1| (NC_003295) PROBABLE PHOH-LIKE PROTEIN [Ralstonia solanacearum]
emb|CAD14058.1| (AL646059) PROBABLE PHOH-LIKE PROTEIN [Ralstonia solanacearum]
Length = 325
Score = 79.0 bits (193), Expect = 8e-015
Identities = 47/128 (36%), Positives = 71/128 (54%), Gaps = 3/128 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L LG + + IE+APLA MRGR+ ++AFII+DE QN + ++K
Sbjct: 189 PYLRPLYDALYDLLGFDKTQKMFERQMIEIAPLAYMRGRTLNHAFIILDEAQNTTPEQMK 248
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLK--EADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDI 119
M LTR+G G+ V+ GD Q DL + GL + H+ + + V + F D+VR +
Sbjct: 249 MFLTRIGFGSKAVITGDTTQIDLPRGQKSGLIEAQHVLRDVR-GVSMTRFSSVDVVRHPL 307
Query: 120 TAMWVRTF 127
A V +
Sbjct: 308 VARIVEAY 315
>ref|NP_298194.1| (NC_002488) ATP-binding protein [Xylella fastidiosa 9a5c]
pir||D82747 ATP-binding protein XF0904 [imported] - Xylella fastidiosa (strain
9a5c)
gb|AAF83714.1|AE003930_4 (AE003930) ATP-binding protein [Xylella fastidiosa 9a5c]
Length = 327
Score = 79.0 bits (193), Expect = 8e-015
Identities = 49/131 (37%), Positives = 73/131 (55%), Gaps = 5/131 (3%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L + LG V ++ IE+APLA MRGR+ +NAF+I+DE QN + ++K
Sbjct: 188 PYLRPLYDALYEMLGVEKVLKLLEKNIIEIAPLAYMRGRTLNNAFVILDEAQNTTIEQMK 247
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLKE--ADGLSKVIHLAKKHQLP-VPIIEFGVDDIVRSD 118
M LTR+G G+T V+ GD Q D+ + GL I + H + + F D+VR
Sbjct: 248 MFLTRLGFGSTAVVTGDLTQIDIPKHLKSGLDDAIEVL--HDIEGISFTFFKSHDVVRHP 305
Query: 119 ITAMWVRTFLK 129
+ A V + K
Sbjct: 306 LVARIVSAYDK 316
>ref|NP_213930.1| (NC_000918) phosphate starvation-inducible protein [Aquifex
aeolicus]
pir||C70417 phosphate starvation-inducible protein - Aquifex aeolicus
gb|AAC07324.1| (AE000736) phosphate starvation-inducible protein [Aquifex
aeolicus]
Length = 348
Score = 79.0 bits (193), Expect = 8e-015
Identities = 46/128 (35%), Positives = 71/128 (54%), Gaps = 3/128 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L + ++ IE+APLA MRGR+ ++AFII+DE QN + ++K
Sbjct: 198 PYLTPLYDALYDMVDYDKANYMLERNIIEIAPLAFMRGRTLNDAFIILDEAQNATRDQMK 257
Query: 62 MLLTRVGEGTTIVLNGDAQQSDL--KEADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDI 119
M LTR+G G+ V+ GD Q DL K+ GL + + + K + + I F D+VR I
Sbjct: 258 MFLTRIGFGSKAVITGDVTQIDLPKKKESGLIEAMEVLKNIE-GIEFIFFKQTDVVRHPI 316
Query: 120 TAMWVRTF 127
A + +
Sbjct: 317 VAKIINAY 324
>ref|NP_540892.1| (NC_003317) PHOH PROTEIN [Brucella melitensis]
gb|AAL53156.1| (AE009631) PHOH PROTEIN [Brucella melitensis]
Length = 315
Score = 78.2 bits (191), Expect = 1e-014
Identities = 46/132 (34%), Positives = 75/132 (55%), Gaps = 3/132 (2%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L + VE I G IE+APLA MRGR+ ++ +I+DE QN ++ ++K
Sbjct: 178 PYLRPLYDALYDMMPAEKVERAITAGVIEIAPLAFMRGRTLAHSAVILDEAQNTTSMQMK 237
Query: 62 MLLTRVGEGTTIVLNGDAQQSDLK--EADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDI 119
M LTR+GEG+ +++ GD Q DL + GL + + + + V + F D+VR +
Sbjct: 238 MFLTRLGEGSRMIVTGDPSQIDLPPGQKSGLVEALRVLDDVE-GVIKVRFTEKDVVRHPL 296
Query: 120 TAMWVRTFLKEG 131
A V + ++G
Sbjct: 297 VAAIVGAYDRDG 308
>emb|CAB66426.1| (AL136534) PhoH-like protein. [Streptomyces coelicolor A3(2)]
Length = 359
Score = 74.7 bits (182), Expect = 1e-013
Identities = 38/83 (45%), Positives = 53/83 (63%)
Query: 2 PWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHELK 61
P+ P+ D L L ++ + G IE+APLA MRGR+ ++AFII+DE QN S ++K
Sbjct: 204 PYLRPLYDALHDMLDPDSIPRLMAAGTIEVAPLAYMRGRTLNDAFIILDEAQNTSPEQMK 263
Query: 62 MLLTRVGEGTTIVLNGDAQQSDL 84
M LTR+G + IV+ GD Q DL
Sbjct: 264 MFLTRLGFESKIVITGDVTQVDL 286
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.319 0.138 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,382,663
Number of Sequences: 891141
Number of extensions: 2566861
Number of successful extensions: 6625
Number of sequences better than 5.0e-02: 58
Number of HSP's better than 0.1 without gapping: 58
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6566
Number of HSP's gapped (non-prelim): 58
length of query: 132
length of database: 279,076,897
effective HSP length: 108
effective length of query: 24
effective length of database: 182,833,669
effective search space: 4388008056
effective search space used: 4388008056
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf38 [7891-7954]
(62 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.325 0.141 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 35,708,679
Number of Sequences: 891141
Number of extensions: 1062175
Number of successful extensions: 2754
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2754
Number of HSP's gapped (non-prelim): 0
length of query: 62
length of database: 279,076,897
effective HSP length: 38
effective length of query: 24
effective length of database: 245,213,539
effective search space: 5885124936
effective search space used: 5885124936
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf39 [7972-8019]
(46 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.316 0.130 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,773,424
Number of Sequences: 891141
Number of extensions: 241775
Number of successful extensions: 393
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 393
Number of HSP's gapped (non-prelim): 0
length of query: 46
length of database: 279,076,897
effective HSP length: 22
effective length of query: 24
effective length of database: 259,471,795
effective search space: 6227323080
effective search space used: 6227323080
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf40 [8020-8105]
(84 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.314 0.131 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 53,526,688
Number of Sequences: 891141
Number of extensions: 2061407
Number of successful extensions: 2993
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2993
Number of HSP's gapped (non-prelim): 0
length of query: 84
length of database: 279,076,897
effective HSP length: 60
effective length of query: 24
effective length of database: 225,608,437
effective search space: 5414602488
effective search space used: 5414602488
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf41 [8102-8189]
(86 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.320 0.129 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,460,804
Number of Sequences: 891141
Number of extensions: 852140
Number of successful extensions: 3098
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3098
Number of HSP's gapped (non-prelim): 0
length of query: 86
length of database: 279,076,897
effective HSP length: 62
effective length of query: 24
effective length of database: 223,826,155
effective search space: 5371827720
effective search space used: 5371827720
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf42 [8186-8299]
(112 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064759.1| (NC_002519) gp22 [Roseophage SIO1] >gi|994... 187 2e-047
ref|NP_067468.1| (NM_021493) hypothetical protein, MNCb-130... 37 0.027
>ref|NP_064759.1| (NC_002519) gp22 [Roseophage SIO1]
gb|AAG02604.1|AF189021_24 (AF189021) Roseophage SIO1 complete genome
Length = 112
Score = 187 bits (474), Expect = 2e-047
Identities = 96/112 (85%), Positives = 96/112 (85%)
Query: 1 MVRFIVCLSLVLFLGGCLNPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSG 60
MVRFIVCLSLVLFLGGCLNPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSG
Sbjct: 1 MVRFIVCLSLVLFLGGCLNPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSG 60
Query: 61 VAPSGSVGSLNISNQDXXXXXXXXXXXXXXXXSPQEIWRGFLKTITLGKYRG 112
VAPSGSVGSLNISNQD SPQEIWRGFLKTITLGKYRG
Sbjct: 61 VAPSGSVGSLNISNQDIPIWVILLLILGWVLPSPQEIWRGFLKTITLGKYRG 112
>ref|NP_067468.1| (NM_021493) hypothetical protein, MNCb-1301 [Mus musculus]
dbj|BAA95056.1| (AB041572) unnamed protein product [Mus musculus]
Length = 325
Score = 37.4 bits (85), Expect = 0.027
Identities = 18/52 (34%), Positives = 28/52 (53%)
Query: 19 NPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSGVAPSGSVGSL 70
N LGGGGP + V+ ++N +G+ + TKA + PSG +G+L
Sbjct: 257 NEWKELGGGGPQESVGVRPHSDNKDSGLSSLESTKARASSAASLPSGDLGAL 308
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.317 0.137 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 53,918,011
Number of Sequences: 891141
Number of extensions: 1801779
Number of successful extensions: 4361
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4358
Number of HSP's gapped (non-prelim): 2
length of query: 112
length of database: 279,076,897
effective HSP length: 88
effective length of query: 24
effective length of database: 200,656,489
effective search space: 4815755736
effective search space used: 4815755736
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)
Query= orf43 [8299-8363]
(63 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.311 0.131 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 30,390,504
Number of Sequences: 891141
Number of extensions: 879841
Number of successful extensions: 1504
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1504
Number of HSP's gapped (non-prelim): 0
length of query: 63
length of database: 279,076,897
effective HSP length: 39
effective length of query: 24
effective length of database: 244,322,398
effective search space: 5863737552
effective search space used: 5863737552
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf44 [8356-8557]
(200 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064760.1| (NC_002519) gp23 [Roseophage SIO1] >gi|994... 391 e-108
sp|P71410|HTR2_HALSA Sensory rhodopsin II transducer (HTR-I... 39 0.039
ref|NP_280509.1| (NC_002607) Htr2 transducer; Htr2 [Halobac... 39 0.039
>ref|NP_064760.1| (NC_002519) gp23 [Roseophage SIO1]
gb|AAG02605.1|AF189021_25 (AF189021) Roseophage SIO1 complete genome
Length = 200
Score = 391 bits (1005), Expect = e-108
Identities = 200/200 (100%), Positives = 200/200 (100%)
Query: 1 MASSWNLAAAQTSSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE 60
MASSWNLAAAQTSSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE
Sbjct: 1 MASSWNLAAAQTSSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE 60
Query: 61 DVMDAAATGVEAAGSVLPAIMDTAQAAGEGLMGSGLSEFRFLGSNFFDAGGKFTEKDLNV 120
DVMDAAATGVEAAGSVLPAIMDTAQAAGEGLMGSGLSEFRFLGSNFFDAGGKFTEKDLNV
Sbjct: 61 DVMDAAATGVEAAGSVLPAIMDTAQAAGEGLMGSGLSEFRFLGSNFFDAGGKFTEKDLNV 120
Query: 121 SDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVLKGGILAGVFDPDLRMARTVGGFKFEE 180
SDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVLKGGILAGVFDPDLRMARTVGGFKFEE
Sbjct: 121 SDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVLKGGILAGVFDPDLRMARTVGGFKFEE 180
Query: 181 DAEGNTILRNTYNFNEGPKT 200
DAEGNTILRNTYNFNEGPKT
Sbjct: 181 DAEGNTILRNTYNFNEGPKT 200
>sp|P71410|HTR2_HALSA Sensory rhodopsin II transducer (HTR-II) (Methyl-accepting
phototaxis protein II) (MPP-II)
pir||T44946 transducer protein htrII [validated] - Halobacterium salinarum
gb|AAC44369.1| (U62676) phototaxis transducer II [Halobacterium salinarum]
Length = 765
Score = 38.5 bits (88), Expect = 0.039
Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 26/165 (15%)
Query: 13 SSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE---DVMDAAATG 69
S++ E+VA + SL AA A+ G++A D E + D + E D MDA +
Sbjct: 510 SASAEEVAETVASL-ADTAGQAASAVDDGRQATEDAVETMDDVADDAEAAADAMDALDSE 568
Query: 70 VEAAGSVLPAIMDTA------------QAAGEGLMGSGLS----EFRFLGSNFFDAGGKF 113
+ G ++ I D A +AA G G G + E + L DA
Sbjct: 569 MADIGEIVDVIADIADQTNMLALNASIEAARTGADGDGFAVVADEVKTLAEESRDAAEDI 628
Query: 114 TEKDL----NVSDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVL 154
+ L VSD++ RA ++GR+ V D TA +V+
Sbjct: 629 ESRLLALQGQVSDVADEMRATSDTVSDGRATV--GDAATALDDVV 671
>ref|NP_280509.1| (NC_002607) Htr2 transducer; Htr2 [Halobacterium sp. NRC-1]
sp|Q9HP81|HTR2_HALN1 Sensory rhodopsin II transducer (HTR-II) (Methyl-accepting
phototaxis protein II) (MPP-II)
gb|AAG19989.1| (AE005080) Htr2 transducer; Htr2 [Halobacterium sp. NRC-1]
Length = 764
Score = 38.5 bits (88), Expect = 0.039
Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 26/165 (15%)
Query: 13 SSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE---DVMDAAATG 69
S++ E+VA + SL AA A+ G++A D E + D + E D MDA +
Sbjct: 509 SASAEEVAETVASL-ADTAGQAASAVDDGRQATEDAVETMDDVADDAEAAADAMDALDSE 567
Query: 70 VEAAGSVLPAIMDTA------------QAAGEGLMGSGLS----EFRFLGSNFFDAGGKF 113
+ G ++ I D A +AA G G G + E + L DA
Sbjct: 568 MADIGEIVDVIADIADQTNMLALNASIEAARTGADGDGFAVVADEVKTLAEESRDAAEDI 627
Query: 114 TEKDL----NVSDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVL 154
+ L VSD++ RA ++GR+ V D TA +V+
Sbjct: 628 ESRLLALQGQVSDVADEMRATSDTVSDGRATV--GDAATALDDVV 670
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.311 0.130 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 107,791,670
Number of Sequences: 891141
Number of extensions: 4358493
Number of successful extensions: 9223
Number of sequences better than 5.0e-02: 3
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 9220
Number of HSP's gapped (non-prelim): 5
length of query: 200
length of database: 279,076,897
effective HSP length: 114
effective length of query: 86
effective length of database: 177,486,823
effective search space: 15263866778
effective search space used: 15263866778
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 88 (38.5 bits)
Query= orf45 [8557-8818]
(260 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064761.1| (NC_002519) gp24 [Roseophage SIO1] >gi|994... 384 e-106
>ref|NP_064761.1| (NC_002519) gp24 [Roseophage SIO1]
gb|AAG02606.1|AF189021_26 (AF189021) Roseophage SIO1 complete genome
Length = 247
Score = 384 bits (987), Expect = e-106
Identities = 197/247 (79%), Positives = 197/247 (79%)
Query: 14 MKKXXXXXXXXXXXXXXXXPQMEKLTESLNMTSAWNKGQTLSKDMPDYKPELLNLSSYQA 73
MKK PQMEKLTESLNMTSAWNKGQTLSKDMPDYKPELLNLSSYQA
Sbjct: 1 MKKVSISITVYLVVLILLIPQMEKLTESLNMTSAWNKGQTLSKDMPDYKPELLNLSSYQA 60
Query: 74 KASPSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRKVEAQSYDTLYGNFERQDTP 133
KASPS RKVEAQSYDTLYGNFERQDTP
Sbjct: 61 KASPSTGGATYQAGAGTTPTSATSGITALQPTSTTQTTLRKVEAQSYDTLYGNFERQDTP 120
Query: 134 FKDVSVSSMTIGELADFSRASGEYGRYVKPRLPKNTYAYKKGLTSTPMGKYQIVGSTLRD 193
FKDVSVSSMTIGELADFSRASGEYGRYVKPRLPKNTYAYKKGLTSTPMGKYQIVGSTLRD
Sbjct: 121 FKDVSVSSMTIGELADFSRASGEYGRYVKPRLPKNTYAYKKGLTSTPMGKYQIVGSTLRD 180
Query: 194 LTRRMNLPADTVFNKETQDKMFLFLAKEAVGGGKTSSEKRSRLRGIWEGFKHVDNNTLNK 253
LTRRMNLPADTVFNKETQDKMFLFLAKEAVGGGKTSSEKRSRLRGIWEGFKHVDNNTLNK
Sbjct: 181 LTRRMNLPADTVFNKETQDKMFLFLAKEAVGGGKTSSEKRSRLRGIWEGFKHVDNNTLNK 240
Query: 254 VIAEIEQ 260
VIAEIEQ
Sbjct: 241 VIAEIEQ 247
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.312 0.129 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 109,494,739
Number of Sequences: 891141
Number of extensions: 3711035
Number of successful extensions: 5584
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 5582
Number of HSP's gapped (non-prelim): 1
length of query: 260
length of database: 279,076,897
effective HSP length: 118
effective length of query: 142
effective length of database: 173,922,259
effective search space: 24696960778
effective search space used: 24696960778
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 89 (38.9 bits)
Query= orf46 [8867-9453]
(585 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064762.1| (NC_002519) gp25 [Roseophage SIO1] >gi|994... 1030 0.0
>ref|NP_064762.1| (NC_002519) gp25 [Roseophage SIO1]
gb|AAG02607.1|AF189021_27 (AF189021) Roseophage SIO1 complete genome
Length = 564
Score = 1030 bits (2664), Expect = 0.0
Identities = 531/542 (97%), Positives = 531/542 (97%)
Query: 44 KLRAGSGPTQTDRNNAVFVENLRNVNQDLASGMSPDQVASKYGATFANLSLNSEQKAVLT 103
KLRAGSGPTQTDRNNAVFVENLRNVNQDLASGMSPDQVASKYGATFANLSLNSEQKAVLT
Sbjct: 23 KLRAGSGPTQTDRNNAVFVENLRNVNQDLASGMSPDQVASKYGATFANLSLNSEQKAVLT 82
Query: 104 KTIGEDIFYVPKQVETPVDTATEMFNNTDETVRLGLIDIETKKAEAEGETISQDVAAERA 163
KTIGEDIFYVPKQVETPVDTATEMFNNTDETVRLGLIDIETKKAEAEGETISQDVAAERA
Sbjct: 83 KTIGEDIFYVPKQVETPVDTATEMFNNTDETVRLGLIDIETKKAEAEGETISQDVAAERA 142
Query: 164 RNNYAAFQVAANAGILQGNIDFSTGFDQNMKTLDSFIKAVSAGLQVEQRGENLSLETLRQ 223
RNNYAAFQVAANAGILQGNIDFSTGFDQNMKTLDSFIKAVSAGLQVEQRGENLSLETLRQ
Sbjct: 143 RNNYAAFQVAANAGILQGNIDFSTGFDQNMKTLDSFIKAVSAGLQVEQRGENLSLETLRQ 202
Query: 224 LQDGFLLLKSNPSFQKPTGKAALEQWALMETRLQSIEDVFTRLEDYDAKGATAKAKQLMG 283
LQDGFLLLKSNPSFQKPTGKAALEQWALMETRLQSIEDVFTRLEDYDAKGATAKAKQLMG
Sbjct: 203 LQDGFLLLKSNPSFQKPTGKAALEQWALMETRLQSIEDVFTRLEDYDAKGATAKAKQLMG 262
Query: 284 VISLSGKSPLSALAAKDSDFMIKMXXXXXXXXXXXLANMGEIPEVDHSALNFDPVILELM 343
VISLSGKSPLSALAAKDSDFMIKM LANMGEIPEVDHSALNFDPVILELM
Sbjct: 263 VISLSGKSPLSALAAKDSDFMIKMAATIADDITADLANMGEIPEVDHSALNFDPVILELM 322
Query: 344 GVAPSGQQTGDVPEGGFDIPPSPFPTELGEKYEGMKIVEKAKFRAYHRSAITGLEAGSLD 403
GVAPSGQQTGDVPEGGFDIPPSPFPTELGEKYEGMKIVEKAKFRAYHRSAITGLEAGSLD
Sbjct: 323 GVAPSGQQTGDVPEGGFDIPPSPFPTELGEKYEGMKIVEKAKFRAYHRSAITGLEAGSLD 382
Query: 404 SPEAINAYASSVTALSYDLTQNDKPSSQNFDVLFSNRNIARVNALQAQGGEAGRVAGNLR 463
SPEAINAYASSVTALSYDLTQNDKPSSQNFDVLFSNRNIARVNALQAQGGEAGRVAGNLR
Sbjct: 383 SPEAINAYASSVTALSYDLTQNDKPSSQNFDVLFSNRNIARVNALQAQGGEAGRVAGNLR 442
Query: 464 AQMGAALQHNQARYGVLAAGLVSRIPQVKIDETNGKFTLDVKDDPVLQEIAAVVSVYYGG 523
AQMGAALQHNQARYGVLAAGLVSRIPQVKIDETNGKFTLDVKDDPVLQEIAAVVSVYYGG
Sbjct: 443 AQMGAALQHNQARYGVLAAGLVSRIPQVKIDETNGKFTLDVKDDPVLQEIAAVVSVYYGG 502
Query: 524 DFEALWAEGGTAKISLKNRLAREGKITPDSPEFERFDAATKVLESSLWKGLASRYNTVRG 583
DFEALWAEGGTAKISLKNRLAREGKITPDSPEFERFDAATKVLESSLWKGLASRYNTVRG
Sbjct: 503 DFEALWAEGGTAKISLKNRLAREGKITPDSPEFERFDAATKVLESSLWKGLASRYNTVRG 562
Query: 584 IP 585
IP
Sbjct: 563 IP 564
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.313 0.130 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 282,863,272
Number of Sequences: 891141
Number of extensions: 11192938
Number of successful extensions: 23388
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 23387
Number of HSP's gapped (non-prelim): 1
length of query: 585
length of database: 279,076,897
effective HSP length: 126
effective length of query: 459
effective length of database: 166,793,131
effective search space: 76558047129
effective search space used: 76558047129
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 94 (40.8 bits)
Query= orf47 [9454-10939]
(1484 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064763.1| (NC_002519) gp26 [Roseophage SIO1] >gi|994... 2839 0.0
ref|NP_415890.1| (NC_000913) putative membrane protein [Esc... 43 0.023
sp|P76072|STFR_ECOLI SIDE TAIL FIBER PROTEIN HOMOLOG FROM L... 43 0.023
>ref|NP_064763.1| (NC_002519) gp26 [Roseophage SIO1]
gb|AAG02609.1|AF189021_29 (AF189021) Roseophage SIO1 complete genome
Length = 1484
Score = 2839 bits (7360), Expect = 0.0
Identities = 1464/1484 (98%), Positives = 1464/1484 (98%)
Query: 1 MDSIASFXXXXXXXXXXXXQSKSKPIDPNSPLEIETVEAQAYLLDEDPQVISETRANQDF 60
MDSIASF QSKSKPIDPNSPLEIETVEAQAYLLDEDPQVISETRANQDF
Sbjct: 1 MDSIASFEELEGGLEEETTQSKSKPIDPNSPLEIETVEAQAYLLDEDPQVISETRANQDF 60
Query: 61 THEGLAKQYPDLTRYLDTLYEAGVSVEEAGRLAREHVERKAIAVAPQEFIFSSMLMVDDE 120
THEGLAKQYPDLTRYLDTLYEAGVSVEEAGRLAREHVERKAIAVAPQEFIFSSMLMVDDE
Sbjct: 61 THEGLAKQYPDLTRYLDTLYEAGVSVEEAGRLAREHVERKAIAVAPQEFIFSSMLMVDDE 120
Query: 121 TVNPESLRVLTNYEKISSRISKRLEENDPSTFKWLTAGALNTARDFTVGVLEMAIRRDSS 180
TVNPESLRVLTNYEKISSRISKRLEENDPSTFKWLTAGALNTARDFTVGVLEMAIRRDSS
Sbjct: 121 TVNPESLRVLTNYEKISSRISKRLEENDPSTFKWLTAGALNTARDFTVGVLEMAIRRDSS 180
Query: 181 LSQKYADSLFMEEDEFNDFWDKEIADAEAKGLFNIREYESLKELQALVDNFGTDTDAGFN 240
LSQKYADSLFMEEDEFNDFWDKEIADAEAKGLFNIREYESLKELQALVDNFGTDTDAGFN
Sbjct: 181 LSQKYADSLFMEEDEFNDFWDKEIADAEAKGLFNIREYESLKELQALVDNFGTDTDAGFN 240
Query: 241 QLLALADIATLGGTKTVGRLASAGVKKAATGTRSVVSDLLASKTVSEAVTATKGIAEGGK 300
QLLALADIATLGGTKTVGRLASAGVKKAATGTRSVVSDLLASKTVSEAVTATKGIAEGGK
Sbjct: 241 QLLALADIATLGGTKTVGRLASAGVKKAATGTRSVVSDLLASKTVSEAVTATKGIAEGGK 300
Query: 301 ATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTKRSMLFEKMAEMLSSPFA 360
ATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTKRSMLFEKMAEMLSSPFA
Sbjct: 301 ATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTKRSMLFEKMAEMLSSPFA 360
Query: 361 GKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYIYTALLGKADTGKPFTTK 420
GKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYIYTALLGKADTGKPFTTK
Sbjct: 361 GKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYIYTALLGKADTGKPFTTK 420
Query: 421 AAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERVDTSRLATEIEDVNVEEG 480
AAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERVDTSRLATEIEDVNVEEG
Sbjct: 421 AAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERVDTSRLATEIEDVNVEEG 480
Query: 481 FLKRSAASLFSAGQTALGPRLGFMLNAAEGLVARVSKEADVAFKDISKLSKGESEELNKI 540
FLKRSAASLFSAGQTALGPRLGFMLNAAEGLVARVSKEADVAFKDISKLSKGESEELNKI
Sbjct: 481 FLKRSAASLFSAGQTALGPRLGFMLNAAEGLVARVSKEADVAFKDISKLSKGESEELNKI 540
Query: 541 ITSYRDSPLGDTELDLAAQRGAPSSAKFEQDFMAVNGRVPSEQQMKAYRALVDFNNSSWN 600
ITSYRDSPLGDTELDLAAQRGAPSSAKFEQDFMAVNGRVPSEQQMKAYRALVDFNNSSWN
Sbjct: 541 ITSYRDSPLGDTELDLAAQRGAPSSAKFEQDFMAVNGRVPSEQQMKAYRALVDFNNSSWN 600
Query: 601 VKATEILKKVTERGGWTVDVSEGYTTIGVPVTVADDAVVFSRLQGSVRGSAVGDRIVYKL 660
VKATEILKKVTERGGWTVDVSEGYTTIGVPVTVADDAVVFSRLQGSVRGSAVGDRIVYKL
Sbjct: 601 VKATEILKKVTERGGWTVDVSEGYTTIGVPVTVADDAVVFSRLQGSVRGSAVGDRIVYKL 660
Query: 661 DEPFEDANGNFFEYVTDVADARVPQKSDVLGYNFGGSRNNETTNFFVGTLFDVEFAXXXX 720
DEPFEDANGNFFEYVTDVADARVPQKSDVLGYNFGGSRNNETTNFFVGTLFDVEFA
Sbjct: 661 DEPFEDANGNFFEYVTDVADARVPQKSDVLGYNFGGSRNNETTNFFVGTLFDVEFAGGKK 720
Query: 721 XXXXFRSLLGSYSSKEATSAARELNNIQETIGGFLKATGLKSISKLELSGEDLDRVNDII 780
FRSLLGSYSSKEATSAARELNNIQETIGGFLKATGLKSISKLELSGEDLDRVNDII
Sbjct: 721 GKGGFRSLLGSYSSKEATSAARELNNIQETIGGFLKATGLKSISKLELSGEDLDRVNDII 780
Query: 781 SRNNKWNPSVENFDDLKDLSQRHKESFANKFEVKRRDQQVDTEITDGYGLSVGDYQSRRV 840
SRNNKWNPSVENFDDLKDLSQRHKESFANKFEVKRRDQQVDTEITDGYGLSVGDYQSRRV
Sbjct: 781 SRNNKWNPSVENFDDLKDLSQRHKESFANKFEVKRRDQQVDTEITDGYGLSVGDYQSRRV 840
Query: 841 ARKRGDAPLMEYGGGRVSNQDPITNILEQFQSTAYRYTHSKATQAAVNGWVDKARRMGNV 900
ARKRGDAPLMEYGGGRVSNQDPITNILEQFQSTAYRYTHSKATQAAVNGWVDKARRMGNV
Sbjct: 841 ARKRGDAPLMEYGGGRVSNQDPITNILEQFQSTAYRYTHSKATQAAVNGWVDKARRMGNV 900
Query: 901 TFDGPVPSDPNDFLRRAKIKGSEGVDADMAEQQAVIKRRMGLEERTDKENNFFIRMGQHI 960
TFDGPVPSDPNDFLRRAKIKGSEGVDADMAEQQAVIKRRMGLEERTDKENNFFIRMGQHI
Sbjct: 901 TFDGPVPSDPNDFLRRAKIKGSEGVDADMAEQQAVIKRRMGLEERTDKENNFFIRMGQHI 960
Query: 961 YDEGVFMGKGKGIKTNPEDWLDGAAGRVRAMVFHMKMGIFNPDQLVLNASHVAQIMAISP 1020
YDEGVFMGKGKGIKTNPEDWLDGAAGRVRAMVFHMKMGIFNPDQLVLNASHVAQIMAISP
Sbjct: 961 YDEGVFMGKGKGIKTNPEDWLDGAAGRVRAMVFHMKMGIFNPDQLVLNASHVAQIMAISP 1020
Query: 1021 KAGLKATAAVPIIAQLMRKTSKAAAKDIDALYANGFAGMTKEELLATVKYMRESGRDIVG 1080
KAGLKATAAVPIIAQLMRKTSKAAAKDIDALYANGFAGMTKEELLATVKYMRESGRDIVG
Sbjct: 1021 KAGLKATAAVPIIAQLMRKTSKAAAKDIDALYANGFAGMTKEELLATVKYMRESGRDIVG 1080
Query: 1081 TSVLERSGTAFDPKKNAASEFLELGLTPYKMGELFGRIASVATAVVEHNAKKISDDVFSE 1140
TSVLERSGTAFDPKKNAASEFLELGLTPYKMGELFGRIASVATAVVEHNAKKISDDVFSE
Sbjct: 1081 TSVLERSGTAFDPKKNAASEFLELGLTPYKMGELFGRIASVATAVVEHNAKKISDDVFSE 1140
Query: 1141 AGLQYVANREQVLSFRMTSGQKGAYQEGAIMGLATQWMSYTNRFVDNILIGRDLTKAERA 1200
AGLQYVANREQVLSFRMTSGQKGAYQEGAIMGLATQWMSYTNRFVDNILIGRDLTKAERA
Sbjct: 1141 AGLQYVANREQVLSFRMTSGQKGAYQEGAIMGLATQWMSYTNRFVDNILIGRDLTKAERA 1200
Query: 1201 RMVGVNTVLFGTRGMGFSPKMTAALVAFGVDPEDENSTATLNAVKFGLFDFMLSQLVGED 1260
RMVGVNTVLFGTRGMGFSPKMTAALVAFGVDPEDENSTATLNAVKFGLFDFMLSQLVGED
Sbjct: 1201 RMVGVNTVLFGTRGMGFSPKMTAALVAFGVDPEDENSTATLNAVKFGLFDFMLSQLVGED 1260
Query: 1261 VSLGSRIAPMGGIVQQYSELFREDPLYATLGGPSAQIGFDTYKAIKATLETITGGHKQIA 1320
VSLGSRIAPMGGIVQQYSELFREDPLYATLGGPSAQIGFDTYKAIKATLETITGGHKQIA
Sbjct: 1261 VSLGSRIAPMGGIVQQYSELFREDPLYATLGGPSAQIGFDTYKAIKATLETITGGHKQIA 1320
Query: 1321 VEEFKVLMRNVKSVDIYAKVVELIETGEYRSKRRGLAGEFDEVGTGLAASVLGGATPMKV 1380
VEEFKVLMRNVKSVDIYAKVVELIETGEYRSKRRGLAGEFDEVGTGLAASVLGGATPMKV
Sbjct: 1321 VEEFKVLMRNVKSVDIYAKVVELIETGEYRSKRRGLAGEFDEVGTGLAASVLGGATPMKV 1380
Query: 1381 LNHYDAKDISYKEDAKFKDARRRIDTWATKALDLISTGEPDKMKQGRELYNDAMNLIEDG 1440
LNHYDAKDISYKEDAKFKDARRRIDTWATKALDLISTGEPDKMKQGRELYNDAMNLIEDG
Sbjct: 1381 LNHYDAKDISYKEDAKFKDARRRIDTWATKALDLISTGEPDKMKQGRELYNDAMNLIEDG 1440
Query: 1441 GFSEENQTKLYRAVVNLETMTDLVKRATGQSKASQLTAKASQGE 1484
GFSEENQTKLYRAVVNLETMTDLVKRATGQSKASQLTAKASQGE
Sbjct: 1441 GFSEENQTKLYRAVVNLETMTDLVKRATGQSKASQLTAKASQGE 1484
>ref|NP_415890.1| (NC_000913) putative membrane protein [Escherichia coli K12]
pir||G64887 probable tail fiber protein GP37 - Escherichia coli
gb|AAC74454.1| (AE000234) putative membrane protein [Escherichia coli K12]
Length = 1122
Score = 43.1 bits (100), Expect = 0.023
Identities = 61/306 (19%), Positives = 107/306 (34%), Gaps = 26/306 (8%)
Query: 246 ADIATLGGTKTVGRLASAGVKKAATGTRSVV----------------------SDLLASK 283
A + G KT ASA ++ AAT + S+ AS
Sbjct: 198 AAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASS 257
Query: 284 TVSEAVTATKGIAEGGKATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTK 343
+ S A ++ KA +AR S A ++ + K + + ++A T +
Sbjct: 258 SASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQ 317
Query: 344 RSMLFEKMAEMLSSPFAGKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYI 403
S + S + + T EAT + + A+S +A K S A+ S+
Sbjct: 318 ASASATAAGKSAESAASSASTATTKAGEATEQAS--AAARSASA-AKTSETNAKASETSA 374
Query: 404 YTALLGKADTGKPFTTKAAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERV 463
++ A + + A++ A KD+ +A S T A + G ++
Sbjct: 375 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 434
Query: 464 DTSRLATEIEDVNVEEGFLKRSAASLFSAGQTALG-PRLGFMLNAAEGLVARVSKEADVA 522
T+ A + + SA +L A T G +L N+ +A K A
Sbjct: 435 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 494
Query: 523 FKDISK 528
+ + K
Sbjct: 495 YDNAEK 500
>sp|P76072|STFR_ECOLI SIDE TAIL FIBER PROTEIN HOMOLOG FROM LAMBDOID PROPHAGE RAC
dbj|BAA14966.1| (D90774) Tail fiber protein GP37. [Escherichia coli]
dbj|BAA14975.1| (D90775) Tail fiber protein GP37. [Escherichia coli]
Length = 1120
Score = 43.1 bits (100), Expect = 0.023
Identities = 61/306 (19%), Positives = 107/306 (34%), Gaps = 26/306 (8%)
Query: 246 ADIATLGGTKTVGRLASAGVKKAATGTRSVV----------------------SDLLASK 283
A + G KT ASA ++ AAT + S+ AS
Sbjct: 196 AAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASS 255
Query: 284 TVSEAVTATKGIAEGGKATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTK 343
+ S A ++ KA +AR S A ++ + K + + ++A T +
Sbjct: 256 SASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQ 315
Query: 344 RSMLFEKMAEMLSSPFAGKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYI 403
S + S + + T EAT + + A+S +A K S A+ S+
Sbjct: 316 ASASATAAGKSAESAASSASTATTKAGEATEQAS--AAARSASA-AKTSETNAKASETSA 372
Query: 404 YTALLGKADTGKPFTTKAAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERV 463
++ A + + A++ A KD+ +A S T A + G ++
Sbjct: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 432
Query: 464 DTSRLATEIEDVNVEEGFLKRSAASLFSAGQTALG-PRLGFMLNAAEGLVARVSKEADVA 522
T+ A + + SA +L A T G +L N+ +A K A
Sbjct: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 492
Query: 523 FKDISK 528
+ + K
Sbjct: 493 YDNAEK 498
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.314 0.131 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 734,400,001
Number of Sequences: 891141
Number of extensions: 30566639
Number of successful extensions: 65712
Number of sequences better than 5.0e-02: 3
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 65711
Number of HSP's gapped (non-prelim): 5
length of query: 1484
length of database: 279,076,897
effective HSP length: 133
effective length of query: 1351
effective length of database: 160,555,144
effective search space: 216909999544
effective search space used: 216909999544
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 98 (42.4 bits)
Query= orf48 [10943-11088]
(144 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064764.1| (NC_002519) gp26.1 [Roseophage SIO1] >gi|9... 171 1e-042
sp|O76743|GLH4_CAEEL ATP-dependent RNA helicase glh-4 (Germ... 37 0.024
ref|NP_491207.1| (NM_058806) helicase [Caenorhabditis elega... 37 0.024
pir||T32759 hypothetical protein T12F5.3 - Caenorhabditis e... 37 0.024
>ref|NP_064764.1| (NC_002519) gp26.1 [Roseophage SIO1]
gb|AAG02608.1|AF189021_28 (AF189021) Roseophage SIO1 complete genome
Length = 144
Score = 171 bits (433), Expect = 1e-042
Identities = 93/144 (64%), Positives = 93/144 (64%)
Query: 1 MVAVTGTVLAIAGTTAGIVGTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXIRSNILXXXX 60
MVAVTGTVLAIAGTTAGIVGTV IRSNIL
Sbjct: 1 MVAVTGTVLAIAGTTAGIVGTVKAGKAARRSARAQQQAQEVQAKRQRRAAIRSNILASAR 60
Query: 61 XXXXXXXXXXXXXXXXXXXIGAGRSQLGAELGFGSQLSGLSANISKFDMQAQTYGDIAKL 120
IGAGRSQLGAELGFGSQLSGLSANISKFDMQAQTYGDIAKL
Sbjct: 61 AKASAQAAGTSQSSGLSGAIGAGRSQLGAELGFGSQLSGLSANISKFDMQAQTYGDIAKL 120
Query: 121 GFSAAGNAPTIGGYAEDLYGFFKK 144
GFSAAGNAPTIGGYAEDLYGFFKK
Sbjct: 121 GFSAAGNAPTIGGYAEDLYGFFKK 144
>sp|O76743|GLH4_CAEEL ATP-dependent RNA helicase glh-4 (Germline helicase-4)
pir||T43326 germline RNA helicase-4 - Caenorhabditis elegans
gb|AAC28387.1| (AF079508) germline RNA helicase-4 [Caenorhabditis elegans]
Length = 1156
Score = 37.4 bits (85), Expect = 0.024
Identities = 17/42 (40%), Positives = 22/42 (51%)
Query: 92 GFGSQLSGLSANISKFDMQAQTYGDIAKLGFSAAGNAPTIGG 133
GFG + +AN + FD Q +G AK GF G AP+ G
Sbjct: 84 GFGGNAAAAAANKTSFDQQGNGFGGAAKHGFGGVGGAPSSFG 125
>ref|NP_491207.1| (NM_058806) helicase [Caenorhabditis elegans]
gb|AAB96745.2| (AF039718) Hypothetical protein T12F5.3 [Caenorhabditis elegans]
Length = 1156
Score = 37.4 bits (85), Expect = 0.024
Identities = 17/42 (40%), Positives = 22/42 (51%)
Query: 92 GFGSQLSGLSANISKFDMQAQTYGDIAKLGFSAAGNAPTIGG 133
GFG + +AN + FD Q +G AK GF G AP+ G
Sbjct: 84 GFGGNAAAAAANKTSFDQQGNGFGGAAKHGFGGVGGAPSSFG 125
>pir||T32759 hypothetical protein T12F5.3 - Caenorhabditis elegans
Length = 1172
Score = 37.4 bits (85), Expect = 0.024
Identities = 17/42 (40%), Positives = 22/42 (51%)
Query: 92 GFGSQLSGLSANISKFDMQAQTYGDIAKLGFSAAGNAPTIGG 133
GFG + +AN + FD Q +G AK GF G AP+ G
Sbjct: 100 GFGGNAAAAAANKTSFDQQGNGFGGAAKHGFGGVGGAPSSFG 141
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.318 0.137 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 44,333,951
Number of Sequences: 891141
Number of extensions: 1142589
Number of successful extensions: 3103
Number of sequences better than 5.0e-02: 4
Number of HSP's better than 0.1 without gapping: 4
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3092
Number of HSP's gapped (non-prelim): 10
length of query: 144
length of database: 279,076,897
effective HSP length: 120
effective length of query: 24
effective length of database: 172,139,977
effective search space: 4131359448
effective search space used: 4131359448
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf49 [11089-11242]
(152 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064765.1| (NC_002519) gp26.2 [Roseophage SIO1] >gi|9... 299 5e-081
ref|NP_349180.1| (NC_003030) Predicted acetyltransferase [C... 47 7e-005
ref|NP_561792.1| (NC_003366) hypothetical protein [Clostrid... 45 3e-004
ref|NP_288995.1| (NC_002655) Z3699 gene product [Escherichi... 38 0.026
ref|NP_248202.1| (NC_000909) protease synthase and sporulat... 38 0.026
ref|NP_416929.1| (NC_000913) orf, hypothetical protein [Esc... 38 0.034
>ref|NP_064765.1| (NC_002519) gp26.2 [Roseophage SIO1]
gb|AAG02610.1|AF189021_30 (AF189021) Roseophage SIO1 complete genome
Length = 153
Score = 299 bits (766), Expect = 5e-081
Identities = 152/152 (100%), Positives = 152/152 (100%)
Query: 1 VLKTTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVI 60
VLKTTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVI
Sbjct: 2 VLKTTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVI 61
Query: 61 DVDGEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQ 120
DVDGEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQ
Sbjct: 62 DVDGEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQ 121
Query: 121 IGMGDIEGISSLEKLYNRLGYERAETVYLKEL 152
IGMGDIEGISSLEKLYNRLGYERAETVYLKEL
Sbjct: 122 IGMGDIEGISSLEKLYNRLGYERAETVYLKEL 153
>ref|NP_349180.1| (NC_003030) Predicted acetyltransferase [Clostridium
acetobutylicum]
gb|AAK80520.1|AE007755_3 (AE007755) Predicted acetyltransferase [Clostridium acetobutylicum]
Length = 167
Score = 46.6 bits (109), Expect = 7e-005
Identities = 39/147 (26%), Positives = 71/147 (47%), Gaps = 15/147 (10%)
Query: 2 LKTTATKIRTATPDDVFDILILAKEFSKEAP------QSHKWNKEKTEQFILSALQNTNM 55
+++ + IR A D + I+ L K+ KE+P + + E +++I +NT +
Sbjct: 1 MQSESLVIRRAEEGDAYQIIRLVKQVIKESPFFERTPEEFNFTVEDEQEYI----KNTEL 56
Query: 56 TIFVIDVDGEIEGALVGLLSELYMSYTVQATELAWF-VSKDYRGKPASLKLIKAFEKWAK 114
I V++VDG+I G+ S L M ++ T L + K+Y GK +IK +WA+
Sbjct: 57 FI-VVEVDGKIVGSATLKRSSLSM---LRHTALFGITILKEYSGKGIGSLIIKRVFEWAE 112
Query: 115 ESGAKQIGMGDIEGISSLEKLYNRLGY 141
E+ ++I + LY + G+
Sbjct: 113 ENAIEKIDLEVFHDNFKAISLYKKFGF 139
>ref|NP_561792.1| (NC_003366) hypothetical protein [Clostridium perfringens]
dbj|BAB80582.1| (AP003188) hypothetical protein [Clostridium perfringens]
Length = 147
Score = 44.7 bits (104), Expect = 3e-004
Identities = 35/121 (28%), Positives = 54/121 (43%), Gaps = 4/121 (3%)
Query: 22 ILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVIDVDGEIEGALVGLLSELYMSY 81
+ K FSKE P + W+ E ++ + + V+ D +IEG ++G L + Y
Sbjct: 18 LFVKTFSKE-PWNEPWDFENAKKRLNDVVLTPGFRGAVLRNDEKIEGVILGNLEQWYDGE 76
Query: 82 TVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQIGMGDIEGISSLEKLYNRLGY 141
E +FV +GK KL+ A E E I ++G S+ E YN+ GY
Sbjct: 77 HFCVKE--FFVDSSSQGKGTGKKLLNALEDILMEKEVGVIHFWTMKG-STAEAFYNKRGY 133
Query: 142 E 142
E
Sbjct: 134 E 134
>ref|NP_288995.1| (NC_002655) Z3699 gene product [Escherichia coli O157:H7 EDL933]
ref|NP_311332.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
gb|AAG57552.1|AE005473_7 (AE005473) Z3699 gene product [Escherichia coli O157:H7 EDL933]
dbj|BAB36728.1| (AP002561) hypothetical protein [Escherichia coli O157:H7]
Length = 178
Score = 38.1 bits (87), Expect = 0.026
Identities = 34/146 (23%), Positives = 64/146 (43%), Gaps = 12/146 (8%)
Query: 4 TTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVIDVD 63
T +IR +D +++ L + P WN + + + N ++++F++
Sbjct: 35 TNTMEIRVFRQEDFEEVITLWERCDLLRP----WNDPEMD---IERKMNHDVSLFLV--- 84
Query: 64 GEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQIGM 123
E+ G +VG + Y + A L V ++RG+ + L+ EK G +I +
Sbjct: 85 AEVNGEVVGTVMGGYDGHRGSAYYLG--VHPEFRGRGIANALLNRLEKKLIARGCPKIQI 142
Query: 124 GDIEGISSLEKLYNRLGYERAETVYL 149
E + +Y RLGYE A+ + L
Sbjct: 143 NVPEDNDMVLGMYERLGYEHADVLSL 168
>ref|NP_248202.1| (NC_000909) protease synthase and sporulation negative regulator
Pai1, putative [Methanococcus jannaschii]
sp|Q58604|YC07_METJA Hypothetical protein MJ1207
pir||F64450 hypothetical protein MJ1207 - Methanococcus jannaschii
gb|AAB99211.1| (U67562) protease synthase and sporulation negative regulator Pai1,
putative [Methanococcus jannaschii]
Length = 226
Score = 38.1 bits (87), Expect = 0.026
Identities = 35/143 (24%), Positives = 64/143 (44%), Gaps = 7/143 (4%)
Query: 9 IRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVIDVDGEIEG 68
IR T DD+ D L L + + + + K+ + ++ FV +V+G+ G
Sbjct: 77 IRNVTKDDLDDFLELYFKAYRGFDKYYYKKKKWARWYFKWLMKRDEDGFFVCEVNGKPVG 136
Query: 69 ALV---GLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQIGMG- 124
+ +S + + E+ FV D+RG+ LI ++AK+ G + + +
Sbjct: 137 FVACDCNWISNIEKREVAEIHEI--FVDPDFRGRGIGTALINKAIEYAKKRGRRIVELWV 194
Query: 125 DIEGISSLEKLYNRLGYERAETV 147
+E ++E Y RLG+E E V
Sbjct: 195 GVENKGAIE-FYKRLGFEEKEVV 216
>ref|NP_416929.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
pir||A65018 hypothetical protein b2434 - Escherichia coli (strain K-12)
gb|AAC75487.1| (AE000330) orf, hypothetical protein [Escherichia coli K12]
Length = 178
Score = 37.7 bits (86), Expect = 0.034
Identities = 34/146 (23%), Positives = 64/146 (43%), Gaps = 12/146 (8%)
Query: 4 TTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVIDVD 63
T +IR +D +++ L + P WN + + + N ++++F++
Sbjct: 35 TNTMEIRVFRQEDFEEVITLWERCDLLRP----WNDPEMD---IERKMNHDVSLFLV--- 84
Query: 64 GEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQIGM 123
E+ G +VG + Y + A L V ++RG+ + L+ EK G +I +
Sbjct: 85 AEVNGDVVGTVMGGYDGHRGSAYYLG--VHPEFRGRGIANALLNRLEKKLIARGCPKIQI 142
Query: 124 GDIEGISSLEKLYNRLGYERAETVYL 149
E + +Y RLGYE A+ + L
Sbjct: 143 NVPEDNDMVLGMYERLGYEHADVLSL 168
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.315 0.132 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 77,103,875
Number of Sequences: 891141
Number of extensions: 2877335
Number of successful extensions: 5494
Number of sequences better than 5.0e-02: 6
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 5493
Number of HSP's gapped (non-prelim): 6
length of query: 152
length of database: 279,076,897
effective HSP length: 109
effective length of query: 43
effective length of database: 181,942,528
effective search space: 7823528704
effective search space used: 7823528704
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 85 (37.4 bits)
Query= orf50 [11234-11950]
(715 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064766.1| (NC_002519) gp27 [Roseophage SIO1] >gi|994... 1397 0.0
>ref|NP_064766.1| (NC_002519) gp27 [Roseophage SIO1]
gb|AAG02611.1|AF189021_31 (AF189021) Roseophage SIO1 complete genome
Length = 715
Score = 1397 bits (3615), Expect = 0.0
Identities = 695/715 (97%), Positives = 695/715 (97%)
Query: 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLS 60
MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLS
Sbjct: 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLS 60
Query: 61 DVVVPEGALVQTLDWYNVAGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHS 120
DVVVPEGALVQTLDWYNVAGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHS
Sbjct: 61 DVVVPEGALVQTLDWYNVAGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHS 120
Query: 121 ASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGSD 180
ASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGSD
Sbjct: 121 ASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGSD 180
Query: 181 VDVTSLYFGEGTSVSNQRIYDTYNVGWVGPKGSAALNTYGSYIVYPALTHPWYSGKDANG 240
VDVTSLYFGEGTSVSNQRIYDTYNVGWVGPKGSAALNTYGSYIVYPALTHPWYSGKDANG
Sbjct: 181 VDVTSLYFGEGTSVSNQRIYDTYNVGWVGPKGSAALNTYGSYIVYPALTHPWYSGKDANG 240
Query: 241 AFNKADWLEIYTGSSLASNGHYVLDVFNKARTGLTTEVETGRFRSVAAYAGRVFYAGIDS 300
AFNKADWLEIYTGSSLASNGHYVLDVFNKARTGLTTEVETGRFRSVAAYAGRVFYAGIDS
Sbjct: 241 AFNKADWLEIYTGSSLASNGHYVLDVFNKARTGLTTEVETGRFRSVAAYAGRVFYAGIDS 300
Query: 301 AKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAHNIRKLHVLGA 360
AKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAHNIRKLHVLGA
Sbjct: 301 AKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAHNIRKLHVLGA 360
Query: 361 SLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTGIYAVQ 420
SLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTGIYAVQ
Sbjct: 361 SLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTGIYAVQ 420
Query: 421 QSENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYN 480
QSENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYN
Sbjct: 421 QSENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYN 480
Query: 481 NILVMDLALQAFYPWRVEDEAXXXXXXXXXXXXXXXXXXXXETQVVNGADVVVNGSDNVV 540
NILVMDLALQAFYPWRVEDEA ETQVVNGADVVVNGSDNVV
Sbjct: 481 NILVMDLALQAFYPWRVEDEASSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVV 540
Query: 541 ATLYRDYLEGDSEIKLLVRDGTTGKMTFATFRGDTYLDWGSADYKSFAEAGYDFMGDITT 600
ATLYRDYLEGDSEIKLLVRDGTTGKMTFATFRGDTYLDWGSADYKSFAEAGYDFMGDITT
Sbjct: 541 ATLYRDYLEGDSEIKLLVRDGTTGKMTFATFRGDTYLDWGSADYKSFAEAGYDFMGDITT 600
Query: 601 FKNAPYVTTYMRVTEDGYVASGAGYEFINPSSCLMSVSWNLSKSGSTPREIYKLKDVPVV 660
FKNAPYVTTYMRVTEDGYVASGAGYEFINPSSCLMSVSWNLSKSGSTPREIYKLKDVPVV
Sbjct: 601 FKNAPYVTTYMRVTEDGYVASGAGYEFINPSSCLMSVSWNLSKSGSTPREIYKLKDVPVV 660
Query: 661 NPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVIGAKNNSY 715
NPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVIGAKNNSY
Sbjct: 661 NPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVIGAKNNSY 715
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.315 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 386,746,719
Number of Sequences: 891141
Number of extensions: 16467563
Number of successful extensions: 38280
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 38277
Number of HSP's gapped (non-prelim): 1
length of query: 715
length of database: 279,076,897
effective HSP length: 127
effective length of query: 588
effective length of database: 165,901,990
effective search space: 97550370120
effective search space used: 97550370120
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 95 (41.2 bits)
Query= orf51 [11939-12045]
(105 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.305 0.126 0.000
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 50,970,698
Number of Sequences: 891141
Number of extensions: 1523416
Number of successful extensions: 1934
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1934
Number of HSP's gapped (non-prelim): 0
length of query: 105
length of database: 279,076,897
effective HSP length: 81
effective length of query: 24
effective length of database: 206,894,476
effective search space: 4965467424
effective search space used: 4965467424
T: 11
A: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf52 [12041-12291]
(249 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064767.1| (NC_002519) gp28 [Roseophage SIO1] >gi|994... 410 e-114
>ref|NP_064767.1| (NC_002519) gp28 [Roseophage SIO1]
gb|AAG02612.1|AF189021_32 (AF189021) Roseophage SIO1 complete genome
Length = 206
Score = 410 bits (1054), Expect = e-114
Identities = 206/206 (100%), Positives = 206/206 (100%)
Query: 44 MIATRFIPEHSQTLKLTSFSSSARPTHFSFPTRVKNIEFLDYNVSKAVGGVEYRRLKYLS 103
MIATRFIPEHSQTLKLTSFSSSARPTHFSFPTRVKNIEFLDYNVSKAVGGVEYRRLKYLS
Sbjct: 1 MIATRFIPEHSQTLKLTSFSSSARPTHFSFPTRVKNIEFLDYNVSKAVGGVEYRRLKYLS 60
Query: 104 PDEFFGLSDGRDSLASNVKQVADVGSDSILLIRNDAMPMYYTSFDDDTVVLDSYDASVDA 163
PDEFFGLSDGRDSLASNVKQVADVGSDSILLIRNDAMPMYYTSFDDDTVVLDSYDASVDA
Sbjct: 61 PDEFFGLSDGRDSLASNVKQVADVGSDSILLIRNDAMPMYYTSFDDDTVVLDSYDASVDA 120
Query: 164 ILTSAKTRAYGVKYPTFDSFSDTFVPDIDDTMFPFLLAEAKSTAMSLFKSGADPKIEQTA 223
ILTSAKTRAYGVKYPTFDSFSDTFVPDIDDTMFPFLLAEAKSTAMSLFKSGADPKIEQTA
Sbjct: 121 ILTSAKTRAYGVKYPTFDSFSDTFVPDIDDTMFPFLLAEAKSTAMSLFKSGADPKIEQTA 180
Query: 224 RRQKVYVQNDMHKVNTGRAKNNYGRN 249
RRQKVYVQNDMHKVNTGRAKNNYGRN
Sbjct: 181 RRQKVYVQNDMHKVNTGRAKNNYGRN 206
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.316 0.131 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 120,916,297
Number of Sequences: 891141
Number of extensions: 4511863
Number of successful extensions: 11138
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 11137
Number of HSP's gapped (non-prelim): 1
length of query: 249
length of database: 279,076,897
effective HSP length: 117
effective length of query: 132
effective length of database: 174,813,400
effective search space: 23075368800
effective search space used: 23075368800
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 89 (38.9 bits)
Query= orf53 [12291-12482]
(190 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064768.1| (NC_002519) gp29 [Roseophage SIO1] >gi|994... 379 e-104
>ref|NP_064768.1| (NC_002519) gp29 [Roseophage SIO1]
gb|AAG02613.1|AF189021_33 (AF189021) Roseophage SIO1 complete genome
Length = 190
Score = 379 bits (972), Expect = e-104
Identities = 190/190 (100%), Positives = 190/190 (100%)
Query: 1 MANVEHSSLTGSALHEPKGTATANSGEVYVANGSGSGTWQPIHRHLGAATAFSSTSPYAY 60
MANVEHSSLTGSALHEPKGTATANSGEVYVANGSGSGTWQPIHRHLGAATAFSSTSPYAY
Sbjct: 1 MANVEHSSLTGSALHEPKGTATANSGEVYVANGSGSGTWQPIHRHLGAATAFSSTSPYAY 60
Query: 61 TIDTDTVEKFLSFPVSSSHVEGFTVLTSPNLRFRYDDPTEITSLINVTMSSSQAGGVGHA 120
TIDTDTVEKFLSFPVSSSHVEGFTVLTSPNLRFRYDDPTEITSLINVTMSSSQAGGVGHA
Sbjct: 61 TIDTDTVEKFLSFPVSSSHVEGFTVLTSPNLRFRYDDPTEITSLINVTMSSSQAGGVGHA 120
Query: 121 VQWALFKNGVEIVGSRAIRTISSGSWGSISVTGVTTLNQNDYIEIKTKADADNVDVNYAN 180
VQWALFKNGVEIVGSRAIRTISSGSWGSISVTGVTTLNQNDYIEIKTKADADNVDVNYAN
Sbjct: 121 VQWALFKNGVEIVGSRAIRTISSGSWGSISVTGVTTLNQNDYIEIKTKADADNVDVNYAN 180
Query: 181 IYVSIIGMSA 190
IYVSIIGMSA
Sbjct: 181 IYVSIIGMSA 190
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.312 0.127 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 102,500,124
Number of Sequences: 891141
Number of extensions: 4058849
Number of successful extensions: 13891
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 13890
Number of HSP's gapped (non-prelim): 1
length of query: 190
length of database: 279,076,897
effective HSP length: 113
effective length of query: 77
effective length of database: 178,377,964
effective search space: 13735103228
effective search space used: 13735103228
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 87 (38.1 bits)
Query= orf54 [12521-12835]
(313 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.316 0.132 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 164,565,025
Number of Sequences: 891141
Number of extensions: 6576886
Number of successful extensions: 16257
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 16257
Number of HSP's gapped (non-prelim): 0
length of query: 313
length of database: 279,076,897
effective HSP length: 120
effective length of query: 193
effective length of database: 172,139,977
effective search space: 33223015561
effective search space used: 33223015561
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 90 (39.3 bits)
Query= orf55 [12839-13095]
(255 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064769.1| (NC_002519) gp30 [Roseophage SIO1] >gi|994... 490 e-138
ref|NP_173855.1| (NM_102291) unknown protein [Arabidopsis t... 43 0.003
gb|AAF97958.1|AC000103_8 (AC000103) F21J9.12 [Arabidopsis t... 43 0.003
pir||T35661 probable chromosome associated protein - Strept... 42 0.006
pir||I51116 NF-180 - sea lamprey >gi|632549|gb|AAA80106.1| ... 42 0.006
gb|AAD42939.2|AF086813_1 (AF086813) protective antigen [Str... 42 0.007
gb|AAB71984.1| (U73162) M-protein [Streptococcus equi] 42 0.007
gb|AAF73810.1|AF154038_1 (AF154038) surface protein PspC [S... 41 0.010
ref|NP_346601.1| (NC_003028) choline binding protein A [Str... 39 0.047
dbj|BAB29420.1| (AK014542) data source:MGD, source key:MGI:... 39 0.047
sp|Q9JKY5|HIPR_MOUSE Huntingtin interacting protein 1 relat... 39 0.047
>ref|NP_064769.1| (NC_002519) gp30 [Roseophage SIO1]
gb|AAG02614.1|AF189021_35 (AF189021) Roseophage SIO1 complete genome
Length = 255
Score = 490 bits (1261), Expect = e-138
Identities = 255/255 (100%), Positives = 255/255 (100%)
Query: 1 MSVFSEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQM 60
MSVFSEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQM
Sbjct: 1 MSVFSEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQM 60
Query: 61 KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADI 120
KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADI
Sbjct: 61 KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADI 120
Query: 121 ESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAK 180
ESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAK
Sbjct: 121 ESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAK 180
Query: 181 SPAAFMQLMGKPAPRSNPLVQGSIRTEGSTMQASSERDFGYYQRLRKENSSLYYKPSTQR 240
SPAAFMQLMGKPAPRSNPLVQGSIRTEGSTMQASSERDFGYYQRLRKENSSLYYKPSTQR
Sbjct: 181 SPAAFMQLMGKPAPRSNPLVQGSIRTEGSTMQASSERDFGYYQRLRKENSSLYYKPSTQR 240
Query: 241 AMMADADRLGDNFYR 255
AMMADADRLGDNFYR
Sbjct: 241 AMMADADRLGDNFYR 255
>ref|NP_173855.1| (NM_102291) unknown protein [Arabidopsis thaliana]
Length = 1730
Score = 43.1 bits (100), Expect = 0.003
Identities = 31/128 (24%), Positives = 60/128 (46%), Gaps = 7/128 (5%)
Query: 43 VEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNI 102
++ + ++E IE++K E AL + L + L + E TA + + L++ N N+
Sbjct: 845 IQEVQLARVEEQEEIEKVKSEVDALTSKLAETQTALKLVEDALSTAEDNISRLTEENRNV 904
Query: 103 GGTADVANTKPSSSEADIESLVEQTLRKREQESV-------AKSNIALVESEREKAYGTE 155
+ A + + AD S+ + +S A+ NI+ + SE+E+A G
Sbjct: 905 QAAKENAELELQKAVADASSVASELDEVLATKSTLEAALMQAERNISDIISEKEEAQGRT 964
Query: 156 AAATVQQK 163
A A ++Q+
Sbjct: 965 ATAEMEQE 972
>gb|AAF97958.1|AC000103_8 (AC000103) F21J9.12 [Arabidopsis thaliana]
Length = 1864
Score = 43.1 bits (100), Expect = 0.003
Identities = 31/128 (24%), Positives = 60/128 (46%), Gaps = 7/128 (5%)
Query: 43 VEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNI 102
++ + ++E IE++K E AL + L + L + E TA + + L++ N N+
Sbjct: 875 IQEVQLARVEEQEEIEKVKSEVDALTSKLAETQTALKLVEDALSTAEDNISRLTEENRNV 934
Query: 103 GGTADVANTKPSSSEADIESLVEQTLRKREQESV-------AKSNIALVESEREKAYGTE 155
+ A + + AD S+ + +S A+ NI+ + SE+E+A G
Sbjct: 935 QAAKENAELELQKAVADASSVASELDEVLATKSTLEAALMQAERNISDIISEKEEAQGRT 994
Query: 156 AAATVQQK 163
A A ++Q+
Sbjct: 995 ATAEMEQE 1002
>pir||T35661 probable chromosome associated protein - Streptomyces coelicolor
emb|CAA22420.1| (AL034447) putative chromosome associated protein [Streptomyces
coelicolor A3(2)]
Length = 1186
Score = 42.0 bits (97), Expect = 0.006
Identities = 41/179 (22%), Positives = 78/179 (42%), Gaps = 7/179 (3%)
Query: 32 ELVGEGRKFNDVEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEHLKETASAS 91
EL E + + EA + LEA E ++ AA +ADLE++ L + E + A+ +
Sbjct: 329 ELEREAARVREQEAELEAALEA---AEHALEDTAAHRADLERE---LAMEERRLKDAARA 382
Query: 92 TAELSDPNNNIGGTADVANTKPSSSEADIESLVEQTLRKREQESVAKSNIALVESEREKA 151
A+ + +GG A ++ ++++A+IE L + ++ + A+ + +E +
Sbjct: 383 IADRRENLARLGGQVGAARSRAAAAQAEIERLAQARDESGQRAAAAQEEYEALRAEVDGL 442
Query: 152 YGTEAAATVQQKASELGLPMAELQSMAAKSPA-AFMQLMGKPAPRSNPLVQGSIRTEGS 209
+ + A+ L AE AA+ A A + R L G R +G+
Sbjct: 443 DAGDQELAERHDAARRALTEAEAALSAAREAATAAERQRAATQARHEALALGLRRKDGT 501
>pir||I51116 NF-180 - sea lamprey
gb|AAA80106.1| (U19361) NF-180 [Petromyzon marinus]
Length = 1110
Score = 42.0 bits (97), Expect = 0.006
Identities = 43/176 (24%), Positives = 78/176 (43%), Gaps = 12/176 (6%)
Query: 6 EEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQM----- 60
EE + A +E + EE + + E E + +VEA K ++EA+ +E+
Sbjct: 578 EEAIEKAEAAEAKAEVEEEEAEAEEEE--EEEAEEEEVEAETKEEVEAEAEVEEEGEAAE 635
Query: 61 --KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADV-ANTKPSSSE 117
+E A + ++ + + E +E A A+ AE + G DV A +K E
Sbjct: 636 EEAEEEEAEEEEVTSKKAKTQEAEVEEEEAEAAEAEAEEEAEEEAGEEDVEAESKEEEEE 695
Query: 118 ADIESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAE 173
E+ E+ + E+E V + + ++E +A E AA +++A+E AE
Sbjct: 696 DSKEADAEED--EAEEEEVKEEEVTKSDAEEAEAEAEEEAAKSEEEAAEEAKDEAE 749
Score = 39.3 bits (90), Expect = 0.036
Identities = 50/219 (22%), Positives = 87/219 (38%), Gaps = 11/219 (5%)
Query: 5 SEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQMKQEN 64
+EE+ A E+V + E E E + + E K + +E+ + E
Sbjct: 608 AEEEEVEAETKEEVEAEAEVEEEGEAAEEEAEEEEAEEEEVTSKKAKTQEAEVEEEEAEA 667
Query: 65 AALKAD--LEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVAN---TKPSSSEAD 119
A +A+ E++A V KE + E +D + +V TK + EA+
Sbjct: 668 AEAEAEEEAEEEAGEEDVEAESKEEEEEDSKE-ADAEEDEAEEEEVKEEEVTKSDAEEAE 726
Query: 120 IESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAA--TVQQKASELGLPMAELQSM 177
E+ E + E AK E+E E TEAA ++AS+ P E++
Sbjct: 727 AEAEEEAAKSEEEAAEEAKDEAEEEEAEEEAVEETEAATEEAEAKEASDDEKPEEEVKE- 785
Query: 178 AAKSPAAFMQLMGKPAPRSNPLVQGSIRTEGSTMQASSE 216
+++P A + P P++ P + + E T + E
Sbjct: 786 -SEAPVA-PEAKKAPEPKAAPKKKAPAKVESPTSEPEDE 822
>gb|AAD42939.2|AF086813_1 (AF086813) protective antigen [Streptococcus pyogenes]
Length = 570
Score = 41.6 bits (96), Expect = 0.007
Identities = 36/145 (24%), Positives = 66/145 (44%), Gaps = 16/145 (11%)
Query: 38 RKFNDVEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEHLKETASASTAELSD 97
+K +++ L K +AD + A+L+ +LEK L V+E L E+ AEL
Sbjct: 338 QKLAEIDQLTADKAKADAELAAANDTIASLQTELEKVKTELAVSERLIESGKREIAELEK 397
Query: 98 PNNNIGGTADVANTKPSSSEADIESLVEQTLRKREQESVAKSNIALVESERE--KAYGTE 155
D ++ + S+A++ L +Q++ + + +A +E E E KA +
Sbjct: 398 QK-------DASDKALAESQANVAEL-------EKQKAASDAKVAELEKEVEAAKAEVAD 443
Query: 156 AAATVQQKASELGLPMAELQSMAAK 180
A + +K EL E +++ AK
Sbjct: 444 LKAQLAKKEEELEAVKKEKEALEAK 468
>gb|AAB71984.1| (U73162) M-protein [Streptococcus equi]
Length = 534
Score = 41.6 bits (96), Expect = 0.007
Identities = 36/145 (24%), Positives = 67/145 (45%), Gaps = 16/145 (11%)
Query: 38 RKFNDVEALPKGKLEADRFIEQMKQENAALKADLEKQAYRLGVTEHLKETASASTAELSD 97
+K +++ L K +AD + A+L+ +LEK L V+E L E+ AEL
Sbjct: 302 QKLAEIDQLTADKAKADAELAAANDTIASLQTELEKAKTELAVSERLIESGKREIAELQK 361
Query: 98 PNNNIGGTADVANTKPSSSEADIESLVEQTLRKREQESVAKSNIALVESERE--KAYGTE 155
+ +D A + ++ A++E +Q++ + + +A +E E E KA +
Sbjct: 362 QKD----ASDKALVESQANVAELE----------KQKAASDAKVAELEKEVEAAKAEVAD 407
Query: 156 AAATVQQKASELGLPMAELQSMAAK 180
A + +K EL E +++ AK
Sbjct: 408 LKAQLAKKEEELEAVKKEKEALEAK 432
>gb|AAF73810.1|AF154038_1 (AF154038) surface protein PspC [Streptococcus pneumoniae]
Length = 660
Score = 41.2 bits (95), Expect = 0.010
Identities = 59/251 (23%), Positives = 103/251 (40%), Gaps = 35/251 (13%)
Query: 15 SEQVSSFEEPTSPSVVNELVGEGRKFNDV---------EALPKGKLEADRFIEQM--KQE 63
SEQ S + + V+ EL +K ND E LP L+ ++ + + K E
Sbjct: 253 SEQDKS-KRRANREVLGELATPDKKENDAKSSDSSVGEETLPSPSLKPEKKVAEAEKKVE 311
Query: 64 NAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADIESL 123
A KA+ +K+ R T + T EL ++A + +A++E +
Sbjct: 312 EAKKKAEDQKEEDRRNYP-----TNTYKTLEL-----------EIAESDVEVKKAELELV 355
Query: 124 VEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAKSPA 183
E+ R +E V ++ A VES++ +A E T ++KA E A + + PA
Sbjct: 356 KEEAKEPRNEEKVKQAK-AEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPA 414
Query: 184 AFMQLMGKPAPRSNPLVQGSIRTEGSTMQ--ASSERDFGYYQRLRKENSSLYYK--PSTQ 239
Q PAP++ P + E + A + + Y +R +E + L + P T+
Sbjct: 415 E--QPQPAPAPKAAPAPKPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTE 472
Query: 240 RAMMADADRLG 250
+ + G
Sbjct: 473 KPAQPSTPKTG 483
>ref|NP_346601.1| (NC_003028) choline binding protein A [Streptococcus pneumoniae
TIGR4]
gb|AAK76241.1| (AE007507) choline binding protein A [Streptococcus pneumoniae
TIGR4]
Length = 693
Score = 38.9 bits (89), Expect = 0.047
Identities = 56/264 (21%), Positives = 103/264 (38%), Gaps = 37/264 (14%)
Query: 5 SEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDV---------EALPKGKLEADR 55
+EE+ +++ + V EL +K ND E LP L+ ++
Sbjct: 272 AEEEAKRRADAKEQGKPKGRAKRGVPGELATPDKKENDAKSSDSSVGEETLPSPSLKPEK 331
Query: 56 FIEQM--KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKP 113
+ + K E A KA+ +K+ R T + T EL ++A +
Sbjct: 332 KVAEAEKKVEEAKKKAEDQKEEDRRNYP-----TNTYKTLEL-----------EIAESDV 375
Query: 114 SSSEADIESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAE 173
+A++E + E+ R +E V ++ A VES++ +A E T ++KA E A
Sbjct: 376 EVKKAELELVKEEAKEPRNEEKVKQAK-AEVESKKAEATRLEKIKTDRKKAEEEAKRKAA 434
Query: 174 LQSMAAKSPAAFMQLMGKPAPRSN-----PLVQGSIRTEGSTMQASSERDFGYYQRLRKE 228
+ + PA Q PAP++ P + + A + + Y +R +E
Sbjct: 435 EEDKVKEKPAE--QPQPAPAPKAEKPAPAPKPENPAEQPKAEKPADQQAEEDYARRSEEE 492
Query: 229 NSSLYYK--PSTQRAMMADADRLG 250
+ L + P T++ + G
Sbjct: 493 YNRLTQQQPPKTEKPAQPSTPKTG 516
>dbj|BAB29420.1| (AK014542) data source:MGD, source key:MGI:1352504,
evidence:ISS~huntingtin interacting protein 1
related~putative [Mus musculus]
Length = 1068
Score = 38.9 bits (89), Expect = 0.047
Identities = 52/187 (27%), Positives = 74/187 (38%), Gaps = 20/187 (10%)
Query: 46 LPKGKLEADRFIEQMKQENAALKADLEKQAYR----LGVTEHLK-ETASASTAELSDPNN 100
L K K+EA R+I Q+K + L+A+LE+Q + L E L+ E A +L N
Sbjct: 370 LEKIKMEAQRYISQLKGQVNGLEAELEEQRKQKQKALVDNEQLRHELAQLKALQLEGARN 429
Query: 101 NIGGTADVANTKPSSSEADIESLVEQTLR--------KREQESVAKSNIALVESEREKAY 152
G + A K S++EA L E+ R+ AK +S+ E A
Sbjct: 430 Q--GLREEAERKASATEARYSKLKEKHSELINTHAELLRKNADTAKQLTVTQQSQEEVAR 487
Query: 153 GTEAAA-TVQQKASELGLPMAELQSMAAKSPAAFMQLMGKPAPRSNPLVQGSIRTEGSTM 211
E A ++Q E + M E K G+ A L RTE S
Sbjct: 488 VKEQLAFQMEQAKRESEMKMEEQSDQLEKLKRELAARAGELARAQEAL----SRTEQSGS 543
Query: 212 QASSERD 218
+ SS D
Sbjct: 544 ELSSRLD 550
Score = 38.9 bits (89), Expect = 0.047
Identities = 42/166 (25%), Positives = 68/166 (40%), Gaps = 13/166 (7%)
Query: 12 ATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRF--IEQMKQENAALKA 69
A + E+ + E +S E V F+ P G ++ DR IE +K+E L+A
Sbjct: 309 APEEEEPENLIEISSAPPAGEPVVVADLFDQTFGPPNGSMKDDRDLQIENLKREVETLRA 368
Query: 70 DLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADIESLVEQTLR 129
+LEK +K A ++L N + + + + D E L + +
Sbjct: 369 ELEK----------IKMEAQRYISQLKGQVNGLEAELEEQRKQKQKALVDNEQLRHELAQ 418
Query: 130 KREQESVAKSNIALVESEREKAYGTEAA-ATVQQKASELGLPMAEL 174
+ + N L E KA TEA + +++K SEL AEL
Sbjct: 419 LKALQLEGARNQGLREEAERKASATEARYSKLKEKHSELINTHAEL 464
>sp|Q9JKY5|HIPR_MOUSE Huntingtin interacting protein 1 related (Hip1-related)
gb|AAF34662.1|AF221713_1 (AF221713) huntingtin interacting protein 1 related [Mus musculus]
Length = 1068
Score = 38.9 bits (89), Expect = 0.047
Identities = 52/187 (27%), Positives = 74/187 (38%), Gaps = 20/187 (10%)
Query: 46 LPKGKLEADRFIEQMKQENAALKADLEKQAYR----LGVTEHLK-ETASASTAELSDPNN 100
L K K+EA R+I Q+K + L+A+LE+Q + L E L+ E A +L N
Sbjct: 370 LEKIKMEAQRYISQLKGQVNGLEAELEEQRKQKQKALVDNEQLRHELAQLKALQLEGARN 429
Query: 101 NIGGTADVANTKPSSSEADIESLVEQTLR--------KREQESVAKSNIALVESEREKAY 152
G + A K S++EA L E+ R+ AK +S+ E A
Sbjct: 430 Q--GLREEAERKASATEARYSKLKEKHSELINTHAELLRKNADTAKQLTVTQQSQEEVAR 487
Query: 153 GTEAAA-TVQQKASELGLPMAELQSMAAKSPAAFMQLMGKPAPRSNPLVQGSIRTEGSTM 211
E A ++Q E + M E K G+ A L RTE S
Sbjct: 488 VKEQLAFQMEQAKRESEMKMEEQSDQLEKLKRELAARAGELARAQEAL----SRTEQSGS 543
Query: 212 QASSERD 218
+ SS D
Sbjct: 544 ELSSRLD 550
Score = 38.9 bits (89), Expect = 0.047
Identities = 42/166 (25%), Positives = 68/166 (40%), Gaps = 13/166 (7%)
Query: 12 ATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRF--IEQMKQENAALKA 69
A + E+ + E +S E V F+ P G ++ DR IE +K+E L+A
Sbjct: 309 APEEEEPENLIEISSAPPAGEPVVVADLFDQTFGPPNGSMKDDRDLQIENLKREVETLRA 368
Query: 70 DLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADIESLVEQTLR 129
+LEK +K A ++L N + + + + D E L + +
Sbjct: 369 ELEK----------IKMEAQRYISQLKGQVNGLEAELEEQRKQKQKALVDNEQLRHELAQ 418
Query: 130 KREQESVAKSNIALVESEREKAYGTEAA-ATVQQKASELGLPMAEL 174
+ + N L E KA TEA + +++K SEL AEL
Sbjct: 419 LKALQLEGARNQGLREEAERKASATEARYSKLKEKHSELINTHAEL 464
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.305 0.121 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 120,858,427
Number of Sequences: 891141
Number of extensions: 4509671
Number of successful extensions: 12991
Number of sequences better than 5.0e-02: 20
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 19
Number of HSP's that attempted gapping in prelim test: 12973
Number of HSP's gapped (non-prelim): 37
length of query: 255
length of database: 279,076,897
effective HSP length: 117
effective length of query: 138
effective length of database: 174,813,400
effective search space: 24124249200
effective search space used: 24124249200
T: 11
A: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 89 (38.9 bits)
Query= orf56 [13096-13177]
(80 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.311 0.126 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 42,978,439
Number of Sequences: 891141
Number of extensions: 1466131
Number of successful extensions: 4074
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4074
Number of HSP's gapped (non-prelim): 0
length of query: 80
length of database: 279,076,897
effective HSP length: 56
effective length of query: 24
effective length of database: 229,173,001
effective search space: 5500152024
effective search space used: 5500152024
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf57 [13175-13275]
(99 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
891,141 sequences; 279,076,897 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1] >gi|9944... 123 4e-028
>ref|NP_064741.1| (NC_002519) gp5 [Roseophage SIO1]
gb|AAG02587.1|AF189021_5 (AF189021) Roseophage SIO1 complete genome
Length = 271
Score = 123 bits (308), Expect = 4e-028
Identities = 62/62 (100%), Positives = 62/62 (100%)
Query: 1 MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE 60
MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE
Sbjct: 178 MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE 237
Query: 61 QM 62
QM
Sbjct: 238 QM 239
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Mar 5, 2002 3:28 AM
Number of letters in database: 279,076,897
Number of sequences in database: 891,141
Lambda K H
0.313 0.125 5.49e-269
Gapped
Lambda K H
0.267 0.0410 5.49e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 31,193,457
Number of Sequences: 891141
Number of extensions: 829244
Number of successful extensions: 1660
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1659
Number of HSP's gapped (non-prelim): 1
length of query: 99
length of database: 279,076,897
effective HSP length: 75
effective length of query: 24
effective length of database: 212,241,322
effective search space: 5093791728
effective search space used: 5093791728
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 83 (36.6 bits)