Heuristic Approach for Gene Prediction in Prokaryotes (Reload this page)
Reference:Besemer J. and Borodovsky M., Heuristic approach to deriving models for gene finding, NAR, 1999, Vol. 27, No. 19, pp. 3911-3920.
[ Download PDF ]

The models used by GeneMark.hmm 2.0 and GeneMark 2.4 are derived from parameters measured from the input sequences and knowledge gained through the study of various bacterial genomes.These models have been shown to accurately predict genes in bacterial, viral and plasmid DNA sequences. Please note that email is the only way to receive output for sequences larger than 1 MB.

UPDATE (June 1, 2001): Web site has been redesigned and moved a to new, more powerful server
Listing of previous updates


Gene Prediction Results

Information on input sequence

Sequence title: Thu Mar 28 11:07:19 EST 2002
Length:         47533 bp
G+C percentage: 59.38 %

Parse predicted by GeneMark.hmm 2.0

GeneMark.hmm PROKARYOTIC (Version 2.1)
Sequence file name: sequence,	RBS: N
Model file name: heuristic_no_rbs.mat
Model organism: Heuristic_model
Thu Mar 28 11:07:30 2002

Predicted genes
   Gene    Strand    LeftEnd    RightEnd       Gene     Class
    #                                         Length
    1        +          <3          71           69        1
    2        +          55        2076         2022        1
    3        +        2117        2395          279        1
    4        +        2461        4098         1638        1
    5        +        4082        5269         1188        1
    6        +        5285        5692          408        1
    7        +        5706        6800         1095        1
    8        +        7188        7280           93        1
    9        +        7280        7666          387        1
   10        +        7659        8348          690        1
   11        +        8361        9863         1503        1
   12        +       10049       10258          210        1
   13        +       10343       10744          402        1
   14        +       10840       10944          105        1
   15        +       10993       12804         1812        1
   16        +       12804       14057         1254        1
   17        +       14050       15303         1254        1
   18        +       15282       15926          645        1
   19        +       15939       16346          408        1
   20        +       16413       17015          603        1
   21        +       17025       18212         1188        1
   22        +       18205       18927          723        1
   23        +       18940       20274         1335        1
   24        +       20277       21626         1350        1
   25        +       21789       22040          252        1
   26        +       22042       22638          597        1
   27        +       22757       22990          234        1
   28        +       22995       25514         2520        1
   29        +       25519       26655         1137        1
   30        +       26869       27348          480        1
   31        +       27357       27623          267        1
   32        -       27642       27953          312        1
   33        -       27943       28167          225        1
   34        -       28139       28699          561        1
   35        -       28692       28871          180        1
   36        -       28864       29418          555        1
   37        -       29424       29909          486        1
   38        -       29968       32028         2061        1
   39        -       32088       32813          726        1
   40        -       32884       33951         1068        1
   41        -       33963       35090         1128        1
   42        -       35094       35945          852        1
   43        +       36032       36235          204        1
   44        +       36228       37877         1650        1
   45        +       37877       38383          507        1
   46        +       38383       38892          510        1
   47        +       38938       39273          336        1
   48        +       39261       39446          186        1
   49        +       39632       39790          159        1
   50        +       39810       40142          333        1
   51        +       40139       40591          453        1
   52        +       40595       40804          210        1
   53        +       40808       40996          189        1
   54        +       41053       41232          180        1
   55        -       41229       41495          267        1
   56        -       41492       41800          309        1
   57        -       41934       42347          414        1
   58        -       42344       42646          303        1
   59        -       42643       43014          372        1
   60        -       43011       45386         2376        1
   61        +       45428       45601          174        1
   62        +       45920       46672          753        1
   63        +       46891      >47532          642        1

Listing of GeneMark Predictions

                              GENEMARK PREDICTIONS

Sequence: Thu Mar 28 11:07:19 EST 2002
Sequence file: gm_sequence
Sequence length: 47533
GC Content:  59.38%
Window length: 96
Window step: 12
Threshold value: 0.500
---
Matrix: Heuristic model
Matrix author: MB/JDB
Matrix order: 2

List of Open reading frames predicted as CDSs, shown with alternate starts
(regions from start to stop codon w/ coding function >0.50)

Left      Right     DNA         Coding Avg   Start
end       end       Strand      Frame  Prob  Prob
--------  --------  ----------  -----  ----  ----

      55      2076  direct      fr 1   0.77  ....  
     196      2076  direct      fr 1   0.79  0.38  
     241      2076  direct      fr 1   0.80  0.23  
     271      2076  direct      fr 1   0.80  0.54  
     280      2076  direct      fr 1   0.80  0.71  
     316      2076  direct      fr 1   0.79  0.34  

    2117      2395  direct      fr 2   0.64  0.62  
    2210      2395  direct      fr 2   0.68  0.01  

    2377      4098  direct      fr 1   0.83  0.46  
    2461      4098  direct      fr 1   0.85  0.19  
    2470      4098  direct      fr 1   0.85  0.10  
    2494      4098  direct      fr 1   0.86  0.26  
    2500      4098  direct      fr 1   0.86  0.20  
    2707      4098  direct      fr 1   0.90  0.52  
    2725      4098  direct      fr 1   0.90  0.63  

    4082      5269  direct      fr 2   0.81  0.96  
    4106      5269  direct      fr 2   0.83  0.97  
    4118      5269  direct      fr 2   0.83  0.05  
    4163      5269  direct      fr 2   0.83  0.02  
    4220      5269  direct      fr 2   0.82  0.39  

    5285      5692  direct      fr 2   0.69  0.75  
    5315      5692  direct      fr 2   0.68  0.91  
    5333      5692  direct      fr 2   0.67  0.16  

    5706      6800  direct      fr 3   0.83  0.65  
    5733      6800  direct      fr 3   0.84  0.47  
    5766      6800  direct      fr 3   0.84  0.51  
    5781      6800  direct      fr 3   0.84  0.46  

    7149      7280  direct      fr 3   0.67  0.75  
    7173      7280  direct      fr 3   0.67  0.25  
    7188      7280  direct      fr 3   0.64  0.82  

    7280      7666  direct      fr 2   0.54  0.99  
    7364      7666  direct      fr 2   0.51  0.04  
    7511      7666  direct      fr 2   0.51  0.23  
    7577      7666  direct      fr 2   0.55  0.58  
    7583      7666  direct      fr 2   0.55  0.41  

    7659      8348  direct      fr 3   0.77  0.35  
    7722      8348  direct      fr 3   0.80  0.37  
    7758      8348  direct      fr 3   0.79  0.04  
    7878      8348  direct      fr 3   0.75  0.25  

    8361      9863  direct      fr 3   0.69  0.14  
    8649      9863  direct      fr 3   0.73  0.37  
    8670      9863  direct      fr 3   0.73  0.16  
    8724      9863  direct      fr 3   0.72  0.05  

   10100     10258  direct      fr 2   0.67  0.05  
   10118     10258  direct      fr 2   0.62  0.05  

   10301     10744  direct      fr 2   0.76  0.08  
   10343     10744  direct      fr 2   0.79  0.91  
   10400     10744  direct      fr 2   0.76  0.14  
   10412     10744  direct      fr 2   0.75  0.05  

   10780     10944  direct      fr 1   0.58  0.24  
   10864     10944  direct      fr 1   0.51  0.45  

   10948     12804  direct      fr 1   0.87  0.49  
   10993     12804  direct      fr 1   0.89  0.05  
   11059     12804  direct      fr 1   0.91  0.46  
   11143     12804  direct      fr 1   0.90  0.27  

   12804     14057  direct      fr 3   0.84  0.03  
   12882     14057  direct      fr 3   0.88  0.62  
   12945     14057  direct      fr 3   0.88  0.05  
   12981     14057  direct      fr 3   0.87  0.85  

   14050     15303  direct      fr 1   0.71  0.63  
   14107     15303  direct      fr 1   0.73  0.29  
   14185     15303  direct      fr 1   0.71  0.34  
   14203     15303  direct      fr 1   0.71  0.12  

   15282     15926  direct      fr 3   0.60  0.33  
   15339     15926  direct      fr 3   0.65  0.10  
   15432     15926  direct      fr 3   0.69  0.41  
   15555     15926  direct      fr 3   0.68  0.60  
   15564     15926  direct      fr 3   0.68  0.30  
   15642     15926  direct      fr 3   0.64  0.04  

   15939     16346  direct      fr 3   0.65  0.20  
   16095     16346  direct      fr 3   0.83  0.56  
   16134     16346  direct      fr 3   0.83  0.02  
   16224     16346  direct      fr 3   0.86  0.27  

   16350     17015  direct      fr 3   0.81  0.11  
   16413     17015  direct      fr 3   0.85  0.61  
   16455     17015  direct      fr 3   0.85  0.20  
   16494     17015  direct      fr 3   0.84  0.03  

   17025     18212  direct      fr 3   0.67  0.12  
   17193     18212  direct      fr 3   0.65  0.31  
   17205     18212  direct      fr 3   0.65  0.06  

   18205     18927  direct      fr 1   0.57  0.03  
   18283     18927  direct      fr 1   0.60  0.60  
   18691     18927  direct      fr 1   0.71  0.21  

   18940     20274  direct      fr 1   0.63  0.79  
   19126     20274  direct      fr 1   0.58  0.15  
   19231     20274  direct      fr 1   0.57  0.22  

   20271     21626  direct      fr 3   0.74  0.70  
   20277     21626  direct      fr 3   0.74  0.83  
   20775     21626  direct      fr 3   0.74  0.35  
   20931     21626  direct      fr 3   0.73  0.45  
   21159     21626  direct      fr 3   0.63  0.01  

   21789     22040  direct      fr 3   0.60  0.56  

   22018     22638  direct      fr 1   0.80  0.05  
   22042     22638  direct      fr 1   0.83  0.54  
   22099     22638  direct      fr 1   0.89  0.38  
   22117     22638  direct      fr 1   0.88  0.86  
   22165     22638  direct      fr 1   0.87  0.01  

   22757     22990  direct      fr 2   0.55  0.14  
   22913     22990  direct      fr 2   0.57  0.22  

   22995     25514  direct      fr 3   0.77  0.25  
   23040     25514  direct      fr 3   0.78  0.23  
   23079     25514  direct      fr 3   0.79  0.60  
   23094     25514  direct      fr 3   0.79  0.59  
   23172     25514  direct      fr 3   0.79  0.55  

   25519     26655  direct      fr 1   0.63  0.77  
   25630     26655  direct      fr 1   0.62  0.77  
   25663     26655  direct      fr 1   0.61  0.12  
   25687     26655  direct      fr 1   0.60  0.06  

   26926     27348  direct      fr 1   0.51  0.20  

   27357     27623  direct      fr 3   0.67  0.94  
   27363     27623  direct      fr 3   0.70  0.95  
   27402     27623  direct      fr 3   0.77  0.06  
   27408     27623  direct      fr 3   0.77  0.03  
   27420     27623  direct      fr 3   0.75  0.07  

   27642     27953  complement  fr 2   0.68  0.77  
   27642     27923  complement  fr 2   0.71  0.14  
   27642     27692  complement  fr 2   0.60  0.43  

   27943     28167  complement  fr 3   0.59  0.35  
   27943     28059  complement  fr 3   0.56  0.07  

   28139     28573  complement  fr 1   0.89  0.37  
   28139     28570  complement  fr 1   0.88  0.35  
   28139     28699  complement  fr 1   0.84  0.55  
   28139     28549  complement  fr 1   0.88  0.81  

   28692     28871  complement  fr 2   0.56  0.64  
   28692     28886  complement  fr 2   0.50  0.45  
   28692     28844  complement  fr 2   0.60  0.45  

   28864     29331  complement  fr 3   0.73  0.15  
   28864     29322  complement  fr 3   0.73  0.34  
   28864     29274  complement  fr 3   0.77  0.10  
   28864     29250  complement  fr 3   0.79  0.15  

   29424     29729  complement  fr 2   0.79  0.56  
   29424     29708  complement  fr 2   0.77  0.59  
   29424     29690  complement  fr 2   0.76  0.20  

   29968     31896  complement  fr 3   0.82  0.38  
   29968     31839  complement  fr 3   0.82  0.12  
   29968     31599  complement  fr 3   0.84  0.18  

   32088     32729  complement  fr 2   0.84  0.16  
   32088     32615  complement  fr 2   0.87  0.73  
   32088     32576  complement  fr 2   0.87  0.03  
   32088     32813  complement  fr 2   0.82  0.94  
   32088     32465  complement  fr 2   0.83  0.41  

   32884     33945  complement  fr 3   0.65  0.66  
   32884     33822  complement  fr 3   0.67  0.05  
   32884     33723  complement  fr 3   0.65  0.25  
   32884     33720  complement  fr 3   0.65  0.04  
   32884     33714  complement  fr 3   0.65  0.14  

   33963     35015  complement  fr 2   0.81  0.67  
   33963     34940  complement  fr 2   0.80  0.32  
   33963     34880  complement  fr 2   0.79  0.20  

   35094     35585  complement  fr 2   0.82  0.78  
   35094     35258  complement  fr 2   0.65  0.74  
   35094     35945  complement  fr 2   0.71  0.79  

   36228     37877  direct      fr 3   0.83  0.77  
   36273     37877  direct      fr 3   0.85  0.64  
   36300     37877  direct      fr 3   0.84  0.06  
   36309     37877  direct      fr 3   0.84  0.06  

   37877     38383  direct      fr 2   0.75  0.81  
   37898     38383  direct      fr 2   0.78  0.89  
   38030     38383  direct      fr 2   0.77  0.64  
   38099     38383  direct      fr 2   0.75  0.15  

   38383     38892  direct      fr 1   0.60  0.52  
   38440     38892  direct      fr 1   0.62  0.02  
   38449     38892  direct      fr 1   0.62  0.01  
   38473     38892  direct      fr 1   0.63  0.27  
   38518     38892  direct      fr 1   0.62  0.37  
   38620     38892  direct      fr 1   0.68  0.06  
   38713     38892  direct      fr 1   0.68  0.16  

   38938     39273  direct      fr 1   0.64  0.12  
   38974     39273  direct      fr 1   0.69  0.23  
   38992     39273  direct      fr 1   0.70  0.14  
   39022     39273  direct      fr 1   0.74  0.40  
   39064     39273  direct      fr 1   0.77  0.65  

   39252     39446  direct      fr 3   0.57  0.40  
   39261     39446  direct      fr 3   0.60  0.24  
   39306     39446  direct      fr 3   0.72  0.52  
   39378     39446  direct      fr 3   0.54  0.05  

   39587     39790  direct      fr 2   0.62  0.13  
   39632     39790  direct      fr 2   0.75  0.72  
   39722     39790  direct      fr 2   0.51  0.66  

   39777     40142  direct      fr 3   0.70  0.14  
   39810     40142  direct      fr 3   0.77  0.60  
   39855     40142  direct      fr 3   0.78  0.60  
   39900     40142  direct      fr 3   0.75  0.01  
   39969     40142  direct      fr 3   0.72  0.16  

   40139     40591  direct      fr 2   0.58  0.68  
   40187     40591  direct      fr 2   0.64  0.20  
   40226     40591  direct      fr 2   0.62  0.18  
   40262     40591  direct      fr 2   0.63  0.02  
   40286     40591  direct      fr 2   0.65  0.37  
   40361     40591  direct      fr 2   0.60  0.42  

   40595     40804  direct      fr 2   0.73  0.36  
   40718     40804  direct      fr 2   0.93  0.21  

   40808     40996  direct      fr 2   0.57  0.58  

   40999     41232  direct      fr 1   0.58  0.04  
   41017     41232  direct      fr 1   0.64  0.02  
   41053     41232  direct      fr 1   0.73  0.27  
   41068     41232  direct      fr 1   0.78  0.48  
   41125     41232  direct      fr 1   0.74  0.23  
   41152     41232  direct      fr 1   0.67  0.03  

   41229     41471  complement  fr 2   0.72  0.41  
   41229     41420  complement  fr 2   0.68  0.65  
   41229     41495  complement  fr 2   0.66  0.72  
   41229     41357  complement  fr 2   0.57  0.42  

   41492     41758  complement  fr 1   0.78  0.49  
   41492     41800  complement  fr 1   0.74  0.04  
   41492     41743  complement  fr 1   0.77  0.64  
   41492     41626  complement  fr 1   0.59  0.02  

   41934     42266  complement  fr 2   0.59  0.05  
   41934     42257  complement  fr 2   0.58  0.13  
   41934     42347  complement  fr 2   0.63  0.90  
   41934     42245  complement  fr 2   0.56  0.26  

   42344     42652  complement  fr 1   0.75  0.99  
   42344     42646  complement  fr 1   0.76  1.00  
   42344     42628  complement  fr 1   0.78  0.94  
   42344     42622  complement  fr 1   0.77  0.72  

   42643     43014  complement  fr 3   0.59  0.70  
   42643     42996  complement  fr 3   0.62  0.73  
   42643     42984  complement  fr 3   0.61  0.67  
   42643     42915  complement  fr 3   0.53  0.08  

   43011     45386  complement  fr 2   0.80  0.81  
   43011     45320  complement  fr 2   0.80  0.10  
   43011     45311  complement  fr 2   0.80  0.20  
   43011     45239  complement  fr 2   0.82  0.36  
   43011     45233  complement  fr 2   0.82  0.52  

   45842     46672  direct      fr 2   0.69  0.00  
   45851     46672  direct      fr 2   0.69  0.01  
   45896     46672  direct      fr 2   0.74  0.16  
   45905     46672  direct      fr 2   0.75  0.10  
   45920     46672  direct      fr 2   0.76  0.27  
   45941     46672  direct      fr 2   0.78  0.83  
   45980     46672  direct      fr 2   0.78  0.75  
   46004     46672  direct      fr 2   0.78  0.06  

   46891     47535  direct      fr 1   0.92  0.62  
   46915     47535  direct      fr 1   0.96  0.09  
   47170     47535  direct      fr 1   0.98  0.20  
   47245     47535  direct      fr 1   0.99  0.15  

List of Regions of interest
(regions from stop to stop codon w/ a signal in between)

   LEnd      REnd    Strand      Frame
 --------  --------  ----------- -----
       46      2076  direct      fr 1
     2102      2395  direct      fr 2
     2335      4098  direct      fr 1
     4067      5269  direct      fr 2
     5267      5692  direct      fr 2
     5598      6800  direct      fr 3
     6784      6969  direct      fr 1
     6897      7280  direct      fr 3
     6976      7170  direct      fr 1
     7274      7666  direct      fr 2
     7644      8348  direct      fr 3
     8346      9863  direct      fr 3
     9870     10073  direct      fr 3
    10046     10258  direct      fr 2
    10256     10744  direct      fr 2
    10681     10944  direct      fr 1
    10942     12804  direct      fr 1
    12786     14057  direct      fr 3
    14041     15303  direct      fr 1
    15138     15926  direct      fr 3
    15924     16346  direct      fr 3
    16344     17015  direct      fr 3
    17013     18212  direct      fr 3
    18190     18927  direct      fr 1
    18925     20274  direct      fr 1
    20214     21626  direct      fr 3
    21771     22040  direct      fr 3
    21991     22638  direct      fr 1
    22694     22990  direct      fr 2
    22965     25514  direct      fr 3
    25456     26655  direct      fr 1
    26857     27348  direct      fr 1
    26875     27249  complement  fr 3
    27339     27623  direct      fr 3
    27642     27989  complement  fr 2
    27943     28182  complement  fr 3
    28139     28795  complement  fr 1
    28692     28946  complement  fr 2
    28864     29436  complement  fr 3
    29424     29936  complement  fr 2
    29968     32181  complement  fr 3
    32088     32840  complement  fr 2
    32839     33495  direct      fr 1
    32884     33960  complement  fr 3
    33963     35096  complement  fr 2
    35094     35987  complement  fr 2
    35981     36235  direct      fr 2
    36156     37877  direct      fr 3
    37862     38383  direct      fr 2
    38368     38892  direct      fr 1
    38896     39273  direct      fr 1
    39045     39446  direct      fr 3
    39386     39790  direct      fr 2
    39768     40142  direct      fr 3
    40079     40591  direct      fr 2
    40589     40804  direct      fr 2
    40802     40996  direct      fr 2
    40951     41232  direct      fr 1
    41229     41516  complement  fr 2
    41492     41839  complement  fr 1
    41934     42374  complement  fr 2
    42344     42745  complement  fr 1
    42643     43068  complement  fr 3
    43011     45413  complement  fr 2
    43596     43823  direct      fr 3
    45361     45648  complement  fr 3
    45806     46672  direct      fr 2
    46855     47535  direct      fr 1


Protein translations of predicted genes

>Translation: 55..2076 (direct), 674 amino acids
MYSASDYYAESAERVIEGLLATTDEREFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYW
REVVDCFDPYSPVHFVAVKKGAQVGATVSVLENLIGYGIDYVKTASMIFATVDDDVTKRR
VTNFILPMLRAAQASRSLIQANDFSKGAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSF
PVPWLLRDEVSGWPQSVGKDGDPMKLTETRTNSFANSRKILDLSTPLLAGTDTITQRFEK
GDQRYYEVPCKHCGEYQRLYFRGNAKNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMINE
DKVTIMGEGNWVPTAKPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDL
DELQVFYNNDLGEAYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVD
VQHTWLAVCTIGWAPSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGG
RVYDIARVGIDASELTDVVYEYCNDWGENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYL
SVTVDLYKDRWSPTLRKEWSGQGEMPRGMLSAPVDMPDKHLKELTVEYKREERDPDTNKI
IRQVWHRPGGSRNELWDTLIYNTAIFESMVLEACEDIVGLEALVWPEFWAIAGKDPRRGA
TGGLCWDIAPDES*

>Translation: 2117..2395 (direct), 93 amino acids
MATTFDIQQRDRLAAILNAYLDAITALLTANVSEYTLNTGQGSQRVTRLDLQQLQENYGL
LYRQYDALVSRCGNGGVVQLVPEGAAPWLDRL*

>Translation: 2461..4098 (direct), 546 amino acids
MPVVADPSGPVMSVENTSRFNGRGDLAHAGYRYTRHDGEKYEGGLGPIEILYTDYWSQRR
RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW
GELPEVCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGLPTYELIDGDMVQTPYD
AQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKATGANGRRIAWLYYGTDKRHT
EARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTKPFSSGAQRR
VQDTDFVGDGTERRTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILRAVAW
ALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVYRSWLRAMVLNRRI
DAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQIKLGALTRTKAARM
LTGTSFKTNMRKLAKENQMLADAMRPILELKKEFGEDVVDEEINARDGAGPSLIELEGGR
NAVAN*

>Translation: 4082..5269 (direct), 396 amino acids
MPWLIDQEVLQLMADAPDLTQEQVDKCMASSPLFLGNGPTANADRVMTVAGDTARINMQG
VMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKI
RNAKKPSTALVSTASSAAYLAASQADEIVAASRVSRVGNIGAVLSMRRPSTSVYVDVTST
DAPNKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSDVNQNFGRGGSLFAETA
LKMGMIDKILTAETESATSTGGAEANAQDDTTGVQTMDLEKLKSEHPAVYAQAIEEGKQI
GLTEERNRVAFHCNMGLKNGAADVALKACQDGASMNDGTVLSDYITAGMNKTELAARESE
ETELSDNQVPENTEEAAKKANVTKLFAGAHMQVQL*

>Translation: 5285..5692 (direct), 136 amino acids
MAKNGLINDRVRDINGVVLFDAKFKPVELAFAAAETWPAGAVLGRVTATGRYVRYNPAAS
DGSQVPSAILTEAVTQSAAGNITYSVAVSGEFRVGDLTDATGTALTANSSADFALRDYGF
ILRDVYETTFRDNNA*

>Translation: 5706..6800 (direct), 365 amino acids
MANELATGWVQAFIERRSPTMFLSSMFTTKPGGIYYGKTVEIDVKRFSENVAVVITKLSG
PNFNDASLISTKEFEPPEYGEAFATDVDDLLQRLIGVNPYDDANIAYSSKLVGRLMDYFM
EANDMIMRGVEIQASQILQTGRLNLLDRDGETAYEIDYAPKATHFPTVTTAWSDDGADPI
DDLRALFEVIRADGKVNPDMIIMGEQALRWALRNANFQEELDNRRIDTGMIDPRMMASGA
TLYGNVWVGSYMAQIWTYPEGYSHPQTRAFTKYVNDDSVIVLSSQTRLDRVSAIVPLPLG
PDQRVSQMLPDLVPGRMVSEGDDLDVTPNLYPTPNGRTIIAELLSRILLVPVQIDGFGCL
DVNP*

>Translation: 7188..7280 (direct), 31 amino acids
MIDEGKEIKAEWLPGGEDQLAYLIGKGLVK*

>Translation: 7280..7666 (direct), 129 amino acids
MNLRQLAEQDLAITLEDADAGFGWPVTLVDPSGASANLTAQSQDIGLIVDPDTGVAVSGR
TASAVLRVSSLVRSGLQVPEGEPRGSANPWELTTTTTNGVAVRFKVDYAAHDRILGTVTL
QLGVLGNA*

>Translation: 7659..8348 (direct), 230 amino acids
MPRPLTVLINQRDSFEIVRDQVAQILADESANQMALATAEGEDPANWQLNVYRERSIPWD
LLDDGAAQSHSVNVWFDSANVDESAADPTNRQTYIGTINIDIVFGSIALKLADTGYVPAD
QAAALGANRITRLVRNILMSDSYTYLNLRQVDAPRVAVGKRWVRSVNTFQPQLDNQTAHH
IVGMRLELAVRYSEFSPQYVAPTLQTVDCTIAEDAQGRVLAGAQFNYTP*

>Translation: 8361..9863 (direct), 501 amino acids
MGISNAIDNSVRARVLGIKTEFRNFNTGRTFFLPQHVALLGQGNTAATYELTPFRATSAA
QVGQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATA
TVRVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTAAVDGTNANEVDVTAKWQ
GLSGNDLVVSIEGSIPGITVAITQPTGGAADPEVADTLALFGENVWYTQIVSCFNTANTD
ALDAFETFGEGRWDPILRRPLAVFTGTNETDPNTLAAIGDARRAQRTNLVTPVPGAQNLP
CEIAAAWVARVARSANNDPASDFARLTLPGLTPGTDAQQWTHTQRDLLVKAGISTSIVRD
GVAEISDTVTTYHPTGEVNPGYLFYKDTVKASNVLFNLDLIFNTREWDGAPLIPDDQPTT
NARAKRPKDAVAVLSVLANNLGLDAIISDVPFTLENIRASINDQNPNRLDIIFPAKNSGN
ANIISADYFFGFFLGTQAAV*

>Translation: 10049..10258 (direct), 70 amino acids
LQLSIDDLNDDQEFLQQVADSNNMEPILITYASGAAYQGEGTITGDLQTSSQNTTATVTL
MGQGNLSRQ*

>Translation: 10343..10744 (direct), 134 amino acids
MNEIEYKVDRETAEAEFDQMCDVMGVETDEDIMAKDDREDFAKHKERVIKSIMRGVVVLN
DGVPTVHCSDGDKVTFKEPKGGAMIQPMKKNEDELNRIYKIAGTLTGGTAHLAKRHMRDY
RPLLSLTSLFISM*

>Translation: 10840..10944 (direct), 35 amino acids
MDYAGLPDVRTLDMSQVRFFYEGLAPSIIKAMKG*

>Translation: 10993..12804 (direct), 604 amino acids
MSRPIRNIQRNVERFTRTATRNMAKLRRETKKVTESINGLGAKALAAGGTMATALGLASR
AGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGATTQFSATQAAQGIDFLAMAGF
NAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAK
TSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFLR
LAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGAEG
LAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFE
LSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATLWAFN
TIVTTITGAITLLNLVIAANPIVLVITAAIVAVAALAAAIYLHWEPIKTFFTDLWDGIGN
AFDTFATNLANKWEALIAPIRSSIQWLLEQADALLGVGGAANTVVPAGQGFAVAGGGSGF
VPGAAAPTAPPIVTPEERQAAVFSEQIKRNVVELRINDPQGRVDLDGESATDDNVRITKT
GNF*

>Translation: 12804..14057 (direct), 418 amino acids
MVWQTRLREAAYTPPSGNRLTFIYTGVSEEFDQKGGPFDFAGAQGTYVQFLGVTGRRYPM
TIIVSGDDYDLDAAAWMRALAEQGEAVLEHPAYGRLTVAPVGTVKRSENFVNGAGQATIE
VTLFETTGAVYPSPQQDPVAAVEQAVAEADAAAAAELAAAPYAETVGEEASFIDAINDTL
DTVDNALRTSYQAVDEVERQVRQVQDSINRAIDVLVQQPLSLASQVQQLIQLPGRVVSQA
TARLSAYGDLAGQIAGSFDSDGNERPERRIAATRQTLYANNLAALGTASSTALASIRTEF
TSQEEAVEQAEQLLNMLDDLAAWRDQSFNELGQVDSGAGWQATQQAVATAAGALVDISFG
LQRQRIYVADRPRNIVELAYQLYGEVDSRLDELINNSNLSGDEIIEIPRGREVIYYE*

>Translation: 14050..15303 (direct), 418 amino acids
MSRSYTVRQGDTFESISRQVYGQEQYSDDIRQANPGAGTQPAVGAVLVIPNVPELQLSEG
RARNAQVTQDPDEVTLRINGLNFRFWRAVTITQHLDAVSTVSLHAPFDPNDQQSRDAFRP
YSYQPVAVDVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWD
EATLQTIAADLCRPFGVQVLAPNGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQ
GRLVLLRPDTQGEPVAEFIEGQQPPISVTPTFGNQDYYSHVTGITPTIVGLEGPQATVRN
PRLEGVLRPFVYNADDMDEADLVQAVQSKAGRMFAAAATYDVPVPTWRNANGGLWRVGDF
VILEAPGAQVYRRTLMQIKTVRFTATPAERTAVLELIIPGSLSGQLPEALPWEGSPQ*

>Translation: 15282..15926 (direct), 215 amino acids
VGRFAAITNFIRKAAGVSDVRCNPGGGATVDAYHAQPAGDDCHPLPTDTTVLVEVPRSNN
YAAVGIIDPNNAQTAGPGERRVYSRNANGEQVAEVFLHDDGRVRASNDNGSVDLAANGDI
VATNGSASAALVGASVTLSDGAGGSISIVNGVITLTGAAIRLAGPVDANGATISAAGQIT
DADGLSVHGHNHTQPNDSDGDVQQPTSTAQIPTP*

>Translation: 15939..16346 (direct), 136 amino acids
MTLAFSQNVRNQRLAVVAAAADAGSGPALIRIYNGTRPPTGGPVTTLLAQLEMSDPAFDA
PANGTMTARAITPEGSTPIGGTATWFRITDSEGNFVLDGDVGLDGSNAELELGDVNFLSN
QEVRIATMTINDGNE*

>Translation: 16413..17015 (direct), 201 amino acids
MQAGNFILTAATSDMTTADATLRASGEMRIIIRGSASLDIAPASMDASGETVAPPIVIEP
PGDLPPAGIRDVWMYQTVDDGNVYPVNGDLYRTDGLETAVYLSLYGGNPEDNGQDANRLG
WWGNADQDDPARQMVSRFQHLVEGIPLTSGNVQRLEDAAAADLEWLSSLGYDVRTSGRIA
GKDKLHMTINIDGDEFVITN*

>Translation: 17025..18212 (direct), 396 amino acids
MVAPTTPTTQSISDNIVAQISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQ
MFIRTASNTPVTINGQTVTPLAEWGRILGAGPQRAAVQTQLDVSVAVVTQGGVLASGEQL
QGPNGIIYITTGDTLLNAATVTVRVTAAGDQQGGDGSGTIGNLADGAVLNFVQPLAALQP
TATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPYTGDT
PGTVAVYIESSTEADGIPTNGQLLEARTAINQSPNGRRNRAPVGTLVNTFPITRETYNVT
VSGLNVDNPADVRRDISNALAEYFLQREPFITGVTTGVRRDQITQIAVGGVVEGIVTAAG
GTLTGVSVTNEAGDAVVSRILPQGTKAKLGAVGYA*

>Translation: 18205..18927 (direct), 241 amino acids
MRNTLRLLLPRGRAWHTTLDKPLRKFMDALGDTFGDVRDHLGSAYTDLLPAYTRRLDDWE
RQFGLPTVDISEEDRRQRLEAAWRPIEGQGIDTLQDVLQASGFNVYFHNWWVPGTEPPPG
SHAPPSVRDPSEFLSLNEARVNCGEPLAQCGEPSAQAGNSLQMPPAPQGRLLVNIIQTDT
GTVEYAIPTDPAEFPHIIYVGGETFGSYASVPFARREEFEALILRIRPAQKLVGLLVNYT
*

>Translation: 18940..20274 (direct), 445 amino acids
MAIDYRNQPRYDGKINEADLANYPQGKAQNVTTPGDNTGTPWDQIGLNDEWGFLQSILAR
AGVTPNNQPDSVGNPQYLDALESLLAPPIGSLQDTRVVLSAPQWLLANGQAVSRTTYSEL
FAVLGTRFGQGDGSTTFNLPADGASKEPGQEFVNLGTGTGLPTYNSEARFARAPNGNMYF
VATGPTRLYRSTDNGLTWTTTNIGSGLPANFDNPAIMVGQNNYVYFIDRTNADLRRSTDD
GVSFTNVGVGGGLPSSIGAPFLASAPTTTGSNPEILYFIDSANDTLYVSTNEGINWTTFG
VGDGLPPNGLGEMIGAPDGYFYGIDIVSDVLYRQVPPSTTWSSVGIGNGLPSTISSPSMS
VDPDGTLYFIDGADRDRLYRSTDRGVTWSSQFLGQGLPDGGNPSLGTSATGSVFYIEPES
DALYASLSNDVHVAGQLATYIRAQ*

>Translation: 20277..21626 (direct), 450 amino acids
MALNYRDITAFEGVIDETDLENYPYGKPQNIIVEGDGRGSIWDQLNINDWWGFQQSILAR
AGVQPNNTPDSVGNPQYLDALTAITAPRVGQTLRTYGVLPEPQWLRTDGRAVSRTEYADL
FAVVGTQYGAGDGSTTFNIPQEAGQEPTPTPGETFVEKQSDGYPRDVRSTAIAGLPDGTI
LGAFGRFSGTPIRKSGDGGDTWLSTGLGNGLSSYNIPVMAANDSGTILLIDINNDSRPLF
RSQDGGENWSDTGIGNGLTINGGDLAAAPNGDFYYINRDETIFRSTDNGENWAQVSGQLP
SHSFAGITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEGDGSGLPEVSNRSS
LGAGRNGDLYIIAPDRNQGYRSVDNGASWSPVSGLIDDFSFAYVTSTPNGNIYVVAGTFS
TGGVYASEGSGGPAPGTGLTAEFYIRAQQ*

>Translation: 21789..22040 (direct), 84 amino acids
MELTSGGRCPNHPDQAKKSDPNGGDHPKGNGVDIRVHNRQHYDKIALLAGRHGFNAIGDG
LKYGFIHLGRRPENGDRVSAWGY*

>Translation: 22042..22638 (direct), 199 amino acids
MTNVWDIVKTVGSAVISHAVPGGGLVLDLINGFLDDDKKLPVTATGEQAMRAIQSLDPAQ
QSALLSKQLDVKIEDIKQSHDTLRTMLNADKDNPQTTRPWIAKWAFVFTALFSGVVGLVI
IVAYAYAVYRQDVELVKAIVDGWPFVLGILTAVTGTFSLLLRAYFGVLTKEHENRIGAAT
GRYKPSAISGLLAAMRPK*

>Translation: 22757..22990 (direct), 78 amino acids
MARTTKTLTASRQEVATGRCVITILRSGKYALNDTDTATAQSLSYFKAGEQVVQTERKPT
YASAPDGNGLIIVDEEG*

>Translation: 22995..25514 (direct), 840 amino acids
MLVKLSRSTRYLGALMGVGGSGGGGGGSVPDERVFADLDARQAWTATNASSLEPADECIV
LDDIDGVSFYAWTGFGWNSVDKIYQGLKGSKGDDGASMVSVAFSGDDILVGLSDGSTIPL
EGGKSELTGDSVASVAFDGLDMVFYDENNVEIDRILDAANTLRGARGQSFFVRFARTSAG
PWSDVFDSDPVSGSFFWQFSYDNKNTWTEPQPCRQLTTVNIENPITLDLGNVKLGSSGDS
FAIYNADRRISGYPVMSGIYDDGTTVRAQELVPDGPKVLNAVFGGAPSGVAVDCDYTYNN
PFSAAVLATRVLPMETYSGKINVDVYSNSGELISRHVQTVNVTVGQPAYLSMEDGPKQVA
EASRTYRMLVTKTADNQPLQVQGSDNTGPNPGGPHRAVDYQPLAYRQSLAVGDYELIFDE
LNNAPEGPGLNAEKVKGIEAFIENKIGTDRGNIDFSGDLSQFSPANKSDYWTAVIASGSE
SIGGLTFNNGDRLIAVQSNDVVPTTLNSPSWRVDPKTVPQASATTLGGVKISTTSALEIN
ASTGNLDVKAATANQAGALKADAVGGAPTIGPDGKLKRSVIPVQTGTQTKFLTLESELLT
LPITSDAYIVNVSSTSRRWGLDSNADPSKLENWALLGVFGDSVTSFNDRPGDVEPLAGDY
DADMVDETVDRVFMTPAQSGKLDRYNETARYADFETDRTYEKYDRVRNAAGFIIEANKRT
PANAVPLEGDWDVVSYDYVVVESDRNVGEDKAHMSYVVPATRGATTLRLATAFPTAAHSY
DIYVRGDRDETTEVKVQLTAGVTAGPGTDADGYRIRSAAKLEVRCDASQVNVNYILLDG*

>Translation: 25519..26655 (direct), 379 amino acids
MTGTIIEKPKTLTSDSVVGESGTPSAGAPASSEWLAEVAAKALNKDTSVLPDLDDAVPTD
TNKVPNAADTKEKLDKKLDKALTNAASKVIVTDADGNIIASADVSLDKLKAVGALSTDGA
IVAVDAEGKVSETSLTKTKAEALQALATKGAVVVVGEDGQLIETGLTFTATDENGTEVNR
LSQTAAPASTASAETDMLTKKDADGLYQPIGAALEGITLPSRGIHTLEPDEDEDTKIFEF
DTLAPLTGGTPTTVSVYASTGGDQVEFYLSSRVPYFAKFQGTKAVTTRTNGEVTASEVSV
VCSAARRWTSATAGHATAQTLTTSSFNREDCVSLIMQFGKGGGSVHNFTCACQIELTPVD
SGQMNLTVTPLSGKQQSS*

>Translation: 26869..27348 (direct), 160 amino acids
MALADFSMSDLILFVTVLLVFRLRGAELVAAGAFVLCSILHTVATLVVPVSDIVYYASAV
VADFVALIVLCGVRPVCGVVYGLARVCVASMALNLVGLLCWWLYLPAWPYNLAFMVLYVV
ALGIIGSGGANGMGGRAVRWRFSRLAFPASASGAGCNKA*

>Translation: 27357..27623 (direct), 89 amino acids
MNVIERFTQLINDVKVAMGIGVGTASGGVGQWLDLLPDNALTKTATIIGMCLSLVLIYTH
LKRHIREERMAKVDLQIKLKALKEEGPS*

>Translation: 27642..27953 (reverse), 104 amino acids
MIADKYGVGTVLFENEDKPHPCAQFGDGKLRIFTTVWSPRNVGLKIARTVEDLPPFSAHT
YDEENPPHQTDGNEIHLLFDDPRSIDAMITQLEYIKGVMLGGE*

>Translation: 27943..28167 (reverse), 75 amino acids
MKQPRKRSFKARALEAETERDLLRTQLFNARAALGLVANGGLDTDPKVVQYWAGELVKGL
DETLKKTERKQDDS*

>Translation: 28139..28699 (reverse), 187 amino acids
MYKYFQVNSDVGHYLVDKYNREVNPKREELVTDLLGAVGAVGVVLYREWGMPATIQALVF
PADHDICRAEGVKLTEHRQGMVVQFDNDSQYAGIYYEPISHLNQQLLAYPDFSDWVVHTM
GVTRYALADTDGEAKKKIATYSHLLTDGRIIFAVPLGNVGQKPINPDRRMVEITKADYEA
ATQAQF*

>Translation: 28692..28871 (reverse), 60 amino acids
MTDTPKIISVQVTPNDTHWQGALIGLDDQGTTYISDRVGGANKWVEYVPALKPETKDNV*

>Translation: 28864..29418 (reverse), 185 amino acids
MRKPLTEINWLRLGTTDEHKELKHELAHRVRAVAGGDYGDRPLPAIILVFNEHGWTVVVP
EGVSVAASERLLAVCIDLARAAAIDEVCMHDILRTFVELGVRLHDVDPRPALTPERADGR
TFIEFGARPVAVPIAETLEQKVRAALAELNDRTEISDLDLQDDEGDRDAVLDFIQNIREA
VHND*

>Translation: 29424..29909 (reverse), 162 amino acids
MTNNAKTTDEALDAIRGVKGLYAQPISNYTASIKILDSKNLARLTVELPLDQLSDRRDAR
MFDPRNPMFDTLKMVPFLAFVDVSEFEERFAEPTAAPKLGANGENVETEFLKSDYLTHHD
DFVRACEIARDASDGEGGELYWQRQINTLARLAEDYPFKRG*

>Translation: 29968..32028 (reverse), 687 amino acids
LKDGKGRRAYIPGGPKPTDLIDHIANGGPIAAWNITFEFWIWNMVLAKSEGWPWLQLEQC
YCDMAKARRFSLPGSLDTAAKALRTAEKDKKGKQLIGKLCRPVSATKARPEPRWTLYTAP
QDYADLYRYCDQDVKSEDHVSALVPDMTPAERETWLADQRINARGVLVDVESLDNALDIV
GQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATVGCRMHNLKSETVAETLERDDLNPTA
RRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAGTGRWSAGGVQLQNLTSKGP
KSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWALKDLKHRNLDL
IIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKI
YEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASGYGGWIGAWKAFGAEEHFE
DDDAMKKAILGWRDASPEIVEFWGGQYRRDPYTRKWVREYYGLEGAVIMAILNPGKCYAV
GDITYAVFDDVLYCRLPSGRFLQYHQPKLIESDSWKGPEYQVTFMGYNTNSQKGPVGWVR
MDTYGGRLAENVTQAAAADVQAYAMKQCENNGYPVVMHTHDELIAEVPGDFGSVEEMAAL
MTGREPWREWWPIRAAGWEDDRYQKD*

>Translation: 32088..32813 (reverse), 242 amino acids
MNIEQWVGLAAAKEHEPRGLHLLRVESGMAYASDGHRAHWAPTYFADGYYDPRTLQPVPG
AHTVDLVGHLQKHVWGTTIVQWEGGDLREGTLHYFPKDQLAQAQPIGNLFINDNHQVTGV
GFSEGMFTIAACKGVEKPAQTPPAAEKPAEAPKVDPIKFLALEADTGRYVATMTKPDGHY
MKPVEADQLPAGTVIHNVGMGRSTVYPSFDFETYSEAGFVINPETGKVHTAAKTGNKNGM
R*

>Translation: 32884..33951 (reverse), 356 amino acids
MPMHNDNFLKLVDATVVWDGINRPDQLEAKPGQAPGNKWSVKIVFPPNHPDLPLLENLAW
AELNAGEFQGTLPNGGMMPVSDVGPNEFNGMFPGWKCVNVSTFQQPQVFNQGQMLDPMQW
NQFVYSGQRVSVLLHCKTYNNKSRGVAARLDGLEVLTEFNAPRLQFGGGANAMDALGGGN
NGGGQPQNNGGQPQQQQPQQQQPQQQQQQWGQPQNNGGQPQQQQQQWGQPQNNGGQPQQQ
QQQYEQPQQQPQQQQQQYEQPQQQQQQQQQQQQWGQPQQQQQQQQQQQWGQPQQQQQQYE
QPQQQQQQQQYEQPQQQQYNNGQPQQYEQPQQQQPQNNNNGGGQYPQQAQNFMPQ*

>Translation: 33963..35090 (reverse), 376 amino acids
MGGQHASIPPSGAPIWGHCSGYLGVMHKVADFDTDATRAGTASHWAFSEAVLNDLDAEDY
IGHADPDGTIVDEGMAHGAQLMVDMVREDMEKHPGGQLYIEQRVYMPDIHPDNWGTLDYA
YVLPAIKRVILGDYKHGHLEVSPERNLQLVDYLKGLENLVGYTFDGWRIDLKVCQPYAYS
PHGPKKVWQASRADIVPLWAQLSEQAHSDPHMTTGKHCRYCPLVGRCSAARQAGYSLISY
VKEPFEVDTMTGADLEAERDILTEGSVILKARLEDIEEQLQNRLRKGEKGIGLTLRQTTG
RAKWTQSDKVVITALKSIGLDVSKETTITPTQAKDKAKTEAQKAAVKALQRRPNGKIELV
KLKDSPAHRAFGANQ*

>Translation: 35094..35945 (reverse), 284 amino acids
MLKIEFDAANKPLAAAIGQALLNYADAKLDPASAAIVSQVKANHSDPKESTDNVSAPGSD
TPSESNSKTFTETPSETQTGESTTTAQSAESDDANYVFDQHWKSASGLDYQGHPVHPEYG
VPVDDKGYPLIDIYGVKHDPDWCQKAAKPFYASGKDKGKWKAKVGTDKDAFAEWHYAQLS
ELGKQSTDGAAGGPTDEPQADAASAFGSQAQQPANNGGAARPTANIGDLMGWIGEKQAAQ
VLTTDQINKAYTDCGTDIASLAAPDNTAGREAVFGYLSGICGV*

>Translation: 36032..36235 (direct), 68 amino acids
MEYIDEHVRAHLANYPKGRTVNQLARELGVPRATLRACMQRLETTGAAFTGDATFAGNKY
WKARAHV*

>Translation: 36228..37877 (direct), 550 amino acids
MFKLRDYQQLAYNRIMAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIVHRREIV
AQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLN
NAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKGEG
GFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVESHI
VGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQF
EEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDP
VRNWERHGPPTAVRNWSLMGREKGQGSGGDTIPQKTCDACTVPYEAYYKACPYCGNVNEI
ADRSAPDKVDGDLVELDLDAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRGPELRRHR
AAKYRREVLDNVVRWWVGMQQQLGRDMSEIYKRFYFRFGVDIGTAFTLNQKDTDALTAKI
RERFHEDLN*

>Translation: 37877..38383 (direct), 169 amino acids
MSNTFLTMHGNFDFDLSGDKFDLPIKTIAYHLSHINRFNGAVGQYSVAQHCVQVAALLPA
NLKLAGLLHDATEAVLCDVPAPLKRMLPDYQAIENRLQDAVDARFKVKTRHKRVREADLS
MLAAEARDFGLDLGPLGFEPVSATIKPWPAKTAEQAWLAAFHAYKGTY*

>Translation: 38383..38892 (direct), 170 amino acids
MSIEFDQWAARWPQAAAELVAVMADTAPSPVTSPASEARAQQEARMEAGRRGGVLWRNNV
GATKAKEPHVCPNCAFKFEVRKPPLRYGLCNDSEKLNAKIKSSDLIGIKPVLITPDMVGQ
TIGQFWAVEVKAPGEPINLRDERQKGQAAFGALVERFNGTFEFSHGGLT*

>Translation: 38938..39273 (direct), 112 amino acids
MAKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQL
KRAIVRGAIKADDGLVVAMAVKAGDFQRAKVDELLYIRGEHQLEGWSPCLS*

>Translation: 39261..39446 (direct), 62 amino acids
MPELVEFIVSPCGCRVTKYHGGQLDDFQLLSTPEAADKWVEREQKVYEHRVPRVEVKVTR
L*

>Translation: 39632..39790 (direct), 53 amino acids
MRLIDLFIILAGALCGFCVYLATEGHILSIMALAGLLCVIGIIYEVTKNAIN*

>Translation: 39810..40142 (direct), 111 amino acids
MPRSLVREFKRLNDIMFDGSLDLTLDEEKQVLEHVRTIEGARNLVTIGQFLRIMPTDKRR
RSALLAYLFDEDRPSGYACDACYGIDTQYTKRAAELLVRKVQQAQELVNA*

>Translation: 40139..40591 (direct), 151 amino acids
MKLRDYQRAALLGLEGVCGEPTLEPAIRFMELHKSAATGKSVAPSATIVMATPTARRLLG
GPELQNVEPPEFVKMSYADFEAAVMARFAENVGVDLAQLVVSIQRMGRALRPDMPEVPRW
DEVKPGTLKVAKNERTKGSHPQPHYYKGRW*

>Translation: 40595..40804 (direct), 70 amino acids
MTHQYKYRCAIAPADGLRLQQDGQNLRVDIEFGNGTGHTTPVLLDREQALQLARDLADYV
GLDKVTVRD*

>Translation: 40808..40996 (direct), 63 amino acids
MFKKIKVLYWQWVVGCAVEDIQHIDNELPYLNPLAKTQALRERRQAINDLITARERLLKL
EG*

>Translation: 41053..41232 (direct), 60 amino acids
VHFGKVVERILRNEAPSQDDISTLVQHYAFDPIMTTLDLQDVVDELNVRFGTNHTITGR*

>Translation: 41229..41495 (reverse), 89 amino acids
MKTTTLTDMQRKALEWFTGDRKRVLMAHINDEGVECFDGVQGLHTTMCARGLKKKRLIEQ
AGRVGDKTAYRISKKGRRVLAYNKEKRA*

>Translation: 41492..41800 (reverse), 103 amino acids
MSTQDNHKTTTTLSVPREFVGDVVKLVADKKREKQARTIVITPEDLGTMKYFIEEKGDVT
RWCDWDEKKAAIFELYPGLRRALNNLELAETQLRHELEGIDV*

>Translation: 41934..42347 (reverse), 138 amino acids
MKVLTGDAYELHAIAADVPHGEDVTGLVDAMTAAMTAAGGIGLAGNQLGVLKRVIVVRAP
KFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGL
TAFCIQHEIDHLNGVTI*

>Translation: 42344..42646 (reverse), 101 amino acids
MNYTHDMDMHLEKYAQTRTAEQLAADLGAPNAKSIQNRCQRIGISLVKSGRNHRLAKLTA
EDIRLAKLLIDDGELTMAAIERKLGLYKGAAKRIKEGILT*

>Translation: 42643..43014 (reverse), 124 amino acids
MSGATRMWRVMFLDERHCEIAPDVLLDFDPRGQMGEPMHTCTARLMTWHIISERTRAVAV
VSDVFLRWGRVWDGYDLFQHRNYDNLGSWFNAHTVKPVLGARGDDGPYNRVPPRLEEVID
ECQ*

>Translation: 43011..45386 (reverse), 792 amino acids
MNQTAQQLKAHGLGAFPCLQNKMPAVPKGTSWKDWAHAELNALPWPSDIVGVPIPSGVVV
LDLDTYKGITREYVEAGAGFAIPWDAAHIQTTQSGGQHYAFRAPNWPVKNISNAKHNETG
VQFEGLDVRSAGKGYIATGEPYYTPTHLGGALAMAFPQMLPPLPEGLRPWLEAVAHESSE
RVEVTDEDAKTIREALRHIDPGETREEWVKVGLSLKSGFGDDPQGLSLFDEWSSGALWRD
GDEPANYVPEHIETQWDSFKAEGGRTIATVYHKAIEGGWQPPAGINAFDVFGEGATDSTT
FAAIVDSLQQHGGDPKQTQTLIDTIIALPCSDMQRAMLAATLSRELKDADLLTKEVRAQI
ARITGNTAAGAASKVPVTPKGQRLAVNQPMHPDLWAPFHTTGKDMKPRGTIDNFEIMMSA
YGVQIGFDEISKELSIVVPGVNYGGALKDEAALSEIESIGNLNHYPKSDIKGMIAYMAHR
YAHNPVKAWVESAPWDGLDHVGLLFSHITLTPDEDRPMCEHLFRKWMRAAVSAGVGDQEG
CEPVLVFVDEQGGVGKTRFFRTLCPEPFRADSVLLDVKDKDSVKQAISYWLVELGELDGT
FSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEAEFLVDTTSNRRFIPMK
VAALNHQHRVNTQQAWAQALAEARSGAVTYVEAHEVAERNASFQATSAIDDVLSSRLNEA
TGERNVHMTVTDLLKRCGMHNPTKRDLNDAGKWLRSNGYEKRIRSGVRGFMLPDMSIGAQ
AFGGPQLEVVK*

>Translation: 45428..45601 (direct), 58 amino acids
MRIIRGLTLKPASLLSLNIRTTVRVVQPKASAASLVEIWGLLSIVLALFVLLVTGEI*

>Translation: 45920..46672 (direct), 251 amino acids
MCTLPARVLESGRTDSEGKTMFEINSKQMRQLEDALQQTNDLGVKFAQRATVNDYAFETA
KTAKENIRRDFINRNTFTQRSVRVDKATRFIDSSEVGSTADYMREQEEGARTAAPLNVPT
PAASGESVRANKRLKAVRKANRMTSITLKKRSRRAGGLTRKQQNIAAVKQAASEGRKFVY
LDRGKRKGIFRLYGGKRRPRVRLVQDLSRSVRVVPQHQWLEPASDKTMERRAEIYWRRLA
QQLRRANFPR*


Input Sequence
Title (optional):


Sequence:


Sequence File upload:


Use alternate genetic code:
      Mycoplasma (TGA = Trp)

Output Options
Email Address: (required for graphical output or sequences longer than 1000000 bp)


Generate PostScript graphics
Print GeneMark 2.4 predictions in addition to GeneMark.hmm predictions
Translate predicted genes into protein


Run 

Web pages maintained by GeneMark administrator, genemark-admin@amber.biology.gatech.edu. Please send any suggestions for improvements or problems to the web page maintainer.