Heuristic Approach for Gene Prediction in Prokaryotes (Reload this page)
Reference:Besemer J. and Borodovsky M., Heuristic approach to deriving models for gene finding, NAR, 1999, Vol. 27, No. 19, pp. 3911-3920.
[ Download PDF ]

The models used by GeneMark.hmm 2.0 and GeneMark 2.4 are derived from parameters measured from the input sequences and knowledge gained through the study of various bacterial genomes.These models have been shown to accurately predict genes in bacterial, viral and plasmid DNA sequences. Please note that email is the only way to receive output for sequences larger than 1 MB.

UPDATE (June 1, 2001): Web site has been redesigned and moved a to new, more powerful server
Listing of previous updates


Gene Prediction Results

Information on input sequence

Sequence title: >clear061902
Length:         47537 bp
G+C percentage: 59.38 %

Parse predicted by GeneMark.hmm 2.0

GeneMark.hmm PROKARYOTIC (Version 2.1)
Sequence file name: sequence,	RBS: N
Model file name: heuristic_no_rbs.mat
Model organism: Heuristic_model
Fri Jul 12 15:41:00 2002

Predicted genes
   Gene    Strand    LeftEnd    RightEnd       Gene     Class
    #                                         Length
    1        -          <2          85           84        1
    2        +         124         837          714        1
    3        +         821        2842         2022        1
    4        +        2883        3161          279        1
    5        +        3227        4864         1638        1
    6        +        4848        6035         1188        1
    7        +        6051        6458          408        1
    8        +        6472        7566         1095        1
    9        +        7954        8046           93        1
   10        +        8046        8432          387        1
   11        +        8425        9114          690        1
   12        +        9127       10629         1503        1
   13        +       10815       11024          210        1
   14        +       11109       11510          402        1
   15        +       11606       11710          105        1
   16        +       11759       13570         1812        1
   17        +       13570       14823         1254        1
   18        +       14816       16069         1254        1
   19        +       16048       16692          645        1
   20        +       16705       17112          408        1
   21        +       17116       17781          666        1
   22        +       17791       18960         1170        1
   23        +       18973       19695          723        1
   24        +       19708       21042         1335        1
   25        +       21045       22394         1350        1
   26        +       22557       22808          252        1
   27        +       22810       23406          597        1
   28        +       23525       23758          234        1
   29        +       23763       26282         2520        1
   30        +       26287       27408         1122        1
   31        +       27638       28117          480        1
   32        +       28126       28392          267        1
   33        -       28411       28722          312        1
   34        -       28712       28936          225        1
   35        -       28908       29468          561        1
   36        -       29461       29640          180        1
   37        -       29633       30187          555        1
   38        -       30193       30678          486        1
   39        -       30737       32797         2061        1
   40        -       32857       33582          726        1
   41        -       33653       34714         1062        1
   42        -       34732       35859         1128        1
   43        -       35863       36714          852        1
   44        +       36801       37004          204        1
   45        +       36997       39153         2157        1
   46        +       39153       39662          510        1
   47        +       39708       40043          336        1
   48        +       40031       40216          186        1
   49        +       40402       40560          159        1
   50        +       40580       40912          333        1
   51        +       40909       41361          453        1
   52        +       41365       41574          210        1
   53        +       41578       41766          189        1
   54        +       41823       42002          180        1
   55        -       41999       42265          267        1
   56        -       42262       42570          309        1
   57        -       42704       43117          414        1
   58        -       43114       43416          303        1
   59        -       43413       43784          372        1
   60        -       43781       46156         2376        1
   61        +       46198       46371          174        1
   62        +       46690       47442          753        1

Listing of GeneMark Predictions

                              GENEMARK PREDICTIONS

Sequence: >clear061902
Sequence file: gm_sequence
Sequence length: 47537
GC Content:  59.38%
Window length: 96
Window step: 12
Threshold value: 0.500
---
Matrix: Heuristic model
Matrix author: MB/JDB
Matrix order: 2

List of Open reading frames predicted as CDSs, shown with alternate starts
(regions from start to stop codon w/ coding function >0.50)

Left      Right     DNA         Coding Avg   Start
end       end       Strand      Frame  Prob  Prob
--------  --------  ----------  -----  ----  ----

     124       837  direct      fr 1   0.89  0.47  
     148       837  direct      fr 1   0.92  0.11  
     403       837  direct      fr 1   0.92  0.29  
     478       837  direct      fr 1   0.91  0.08  
     484       837  direct      fr 1   0.90  0.05  

     821      2842  direct      fr 2   0.78  0.80  
     962      2842  direct      fr 2   0.80  0.55  
    1007      2842  direct      fr 2   0.80  0.19  
    1037      2842  direct      fr 2   0.80  0.48  

    2883      3161  direct      fr 3   0.64  0.68  
    2976      3161  direct      fr 3   0.66  0.01  

    3143      4864  direct      fr 2   0.82  0.43  
    3227      4864  direct      fr 2   0.84  0.15  
    3236      4864  direct      fr 2   0.85  0.08  
    3260      4864  direct      fr 2   0.86  0.30  
    3266      4864  direct      fr 2   0.86  0.25  
    3473      4864  direct      fr 2   0.89  0.54  
    3491      4864  direct      fr 2   0.89  0.75  

    4848      6035  direct      fr 3   0.73  0.95  
    4872      6035  direct      fr 3   0.75  0.96  
    4884      6035  direct      fr 3   0.75  0.07  
    4929      6035  direct      fr 3   0.74  0.03  
    4986      6035  direct      fr 3   0.73  0.48  

    6051      6458  direct      fr 3   0.66  0.67  
    6081      6458  direct      fr 3   0.65  0.89  
    6099      6458  direct      fr 3   0.63  0.26  

    6472      7566  direct      fr 1   0.81  0.62  
    6499      7566  direct      fr 1   0.83  0.33  
    6532      7566  direct      fr 1   0.84  0.42  
    6547      7566  direct      fr 1   0.84  0.60  
    6622      7566  direct      fr 1   0.83  0.02  

    7915      8046  direct      fr 1   0.61  0.74  
    7939      8046  direct      fr 1   0.65  0.24  
    7954      8046  direct      fr 1   0.62  0.83  

    8343      8432  direct      fr 3   0.55  0.61  
    8349      8432  direct      fr 3   0.55  0.51  

    8425      9114  direct      fr 1   0.73  0.42  
    8488      9114  direct      fr 1   0.77  0.46  
    8524      9114  direct      fr 1   0.75  0.05  
    8644      9114  direct      fr 1   0.71  0.32  

    9127     10629  direct      fr 1   0.70  0.19  
    9415     10629  direct      fr 1   0.73  0.49  
    9436     10629  direct      fr 1   0.72  0.14  
    9490     10629  direct      fr 1   0.71  0.03  
    9496     10629  direct      fr 1   0.72  0.09  
    9526     10629  direct      fr 1   0.72  0.84  
    9574     10629  direct      fr 1   0.72  0.08  

   10866     11024  direct      fr 3   0.65  0.05  
   10884     11024  direct      fr 3   0.62  0.04  

   11067     11510  direct      fr 3   0.73  0.21  
   11109     11510  direct      fr 3   0.77  0.88  
   11166     11510  direct      fr 3   0.73  0.13  
   11178     11510  direct      fr 3   0.72  0.04  

   11714     13570  direct      fr 2   0.86  0.35  
   11759     13570  direct      fr 2   0.87  0.05  
   11825     13570  direct      fr 2   0.89  0.51  
   11909     13570  direct      fr 2   0.89  0.42  
   11978     13570  direct      fr 2   0.89  0.53  

   13570     14823  direct      fr 1   0.80  0.04  
   13648     14823  direct      fr 1   0.83  0.70  
   13711     14823  direct      fr 1   0.83  0.05  
   13747     14823  direct      fr 1   0.82  0.86  

   14816     16069  direct      fr 2   0.70  0.64  
   14873     16069  direct      fr 2   0.72  0.37  
   14951     16069  direct      fr 2   0.71  0.33  
   14969     16069  direct      fr 2   0.71  0.10  
   15014     16069  direct      fr 2   0.71  0.65  

   16048     16692  direct      fr 1   0.59  0.44  
   16105     16692  direct      fr 1   0.63  0.08  
   16198     16692  direct      fr 1   0.66  0.43  
   16321     16692  direct      fr 1   0.67  0.70  
   16330     16692  direct      fr 1   0.67  0.36  
   16408     16692  direct      fr 1   0.63  0.04  

   16705     17112  direct      fr 1   0.59  0.23  
   16861     17112  direct      fr 1   0.72  0.54  
   16900     17112  direct      fr 1   0.72  0.01  
   16990     17112  direct      fr 1   0.80  0.37  

   17116     17781  direct      fr 1   0.80  0.23  
   17179     17781  direct      fr 1   0.84  0.49  
   17221     17781  direct      fr 1   0.83  0.30  
   17260     17781  direct      fr 1   0.84  0.04  
   17311     17781  direct      fr 1   0.87  0.53  

   17791     18960  direct      fr 1   0.66  0.15  
   17959     18960  direct      fr 1   0.64  0.31  
   17971     18960  direct      fr 1   0.64  0.08  

   18973     19695  direct      fr 1   0.54  0.03  
   19051     19695  direct      fr 1   0.56  0.62  
   19459     19695  direct      fr 1   0.66  0.16  

   19708     21042  direct      fr 1   0.59  0.80  
   19894     21042  direct      fr 1   0.54  0.08  
   19999     21042  direct      fr 1   0.53  0.21  

   21039     22394  direct      fr 3   0.70  0.61  
   21045     22394  direct      fr 3   0.70  0.84  
   21543     22394  direct      fr 3   0.66  0.26  
   21699     22394  direct      fr 3   0.65  0.61  
   21927     22394  direct      fr 3   0.52  0.01  

   22557     22808  direct      fr 3   0.53  0.34  

   22786     23406  direct      fr 1   0.79  0.05  
   22810     23406  direct      fr 1   0.82  0.28  
   22867     23406  direct      fr 1   0.88  0.52  
   22885     23406  direct      fr 1   0.88  0.91  
   22933     23406  direct      fr 1   0.87  0.02  

   23525     23758  direct      fr 2   0.52  0.24  

   23763     26282  direct      fr 3   0.75  0.36  
   23808     26282  direct      fr 3   0.76  0.13  
   23847     26282  direct      fr 3   0.76  0.70  
   23862     26282  direct      fr 3   0.76  0.68  
   23940     26282  direct      fr 3   0.76  0.58  

   26287     27408  direct      fr 1   0.61  0.56  
   26398     27408  direct      fr 1   0.61  0.71  
   26431     27408  direct      fr 1   0.60  0.09  
   26455     27408  direct      fr 1   0.59  0.06  

   27695     28117  direct      fr 2   0.50  0.16  

   28126     28392  direct      fr 1   0.64  0.96  
   28132     28392  direct      fr 1   0.67  0.97  
   28171     28392  direct      fr 1   0.77  0.03  
   28177     28392  direct      fr 1   0.75  0.02  
   28189     28392  direct      fr 1   0.74  0.07  

   28411     28722  complement  fr 3   0.65  0.75  
   28411     28692  complement  fr 3   0.67  0.08  

   28712     28828  complement  fr 1   0.56  0.07  
   28712     28936  complement  fr 1   0.58  0.32  

   28908     29468  complement  fr 2   0.81  0.68  
   28908     29342  complement  fr 2   0.88  0.35  
   28908     29339  complement  fr 2   0.88  0.33  
   28908     29318  complement  fr 2   0.88  0.87  

   29461     29640  complement  fr 3   0.55  0.60  
   29461     29613  complement  fr 3   0.58  0.42  

   29633     30100  complement  fr 1   0.70  0.15  
   29633     30187  complement  fr 1   0.68  0.64  
   29633     30091  complement  fr 1   0.70  0.31  
   29633     29794  complement  fr 1   0.71  0.28  
   29633     30043  complement  fr 1   0.75  0.07  
   29633     30019  complement  fr 1   0.79  0.09  

   30193     30498  complement  fr 3   0.81  0.53  
   30193     30477  complement  fr 3   0.80  0.56  
   30193     30459  complement  fr 3   0.79  0.20  

   30737     32665  complement  fr 1   0.80  0.24  
   30737     32608  complement  fr 1   0.80  0.13  
   30737     32368  complement  fr 1   0.81  0.15  

   32857     33498  complement  fr 3   0.83  0.13  
   32857     33582  complement  fr 3   0.81  0.92  
   32857     33384  complement  fr 3   0.85  0.61  
   32857     33345  complement  fr 3   0.84  0.05  

   34732     35784  complement  fr 3   0.77  0.79  
   34732     35709  complement  fr 3   0.75  0.28  
   34732     35649  complement  fr 3   0.74  0.11  

   35863     36354  complement  fr 3   0.79  0.80  
   35863     36027  complement  fr 3   0.60  0.80  
   35863     36714  complement  fr 3   0.69  0.81  

   36997     39153  direct      fr 1   0.83  0.67  
   37042     39153  direct      fr 1   0.84  0.57  
   37069     39153  direct      fr 1   0.83  0.06  
   37078     39153  direct      fr 1   0.83  0.05  

   39153     39662  direct      fr 3   0.51  0.72  
   39210     39662  direct      fr 3   0.52  0.00  
   39219     39662  direct      fr 3   0.51  0.01  
   39243     39662  direct      fr 3   0.52  0.14  
   39288     39662  direct      fr 3   0.51  0.26  
   39390     39662  direct      fr 3   0.60  0.03  
   39483     39662  direct      fr 3   0.63  0.32  

   39708     40043  direct      fr 3   0.60  0.07  
   39744     40043  direct      fr 3   0.65  0.07  
   39762     40043  direct      fr 3   0.67  0.08  
   39792     40043  direct      fr 3   0.72  0.38  
   39834     40043  direct      fr 3   0.77  0.69  

   40022     40216  direct      fr 2   0.56  0.36  
   40031     40216  direct      fr 2   0.56  0.12  
   40076     40216  direct      fr 2   0.69  0.65  
   40148     40216  direct      fr 2   0.51  0.01  

   40357     40560  direct      fr 1   0.64  0.15  
   40402     40560  direct      fr 1   0.73  0.68  

   40547     40912  direct      fr 2   0.61  0.08  
   40580     40912  direct      fr 2   0.68  0.51  
   40625     40912  direct      fr 2   0.71  0.71  
   40670     40912  direct      fr 2   0.65  0.01  
   40739     40912  direct      fr 2   0.64  0.20  
   40871     40912  direct      fr 2   0.54  0.02  

   40909     41361  direct      fr 1   0.59  0.74  
   40957     41361  direct      fr 1   0.64  0.14  
   40996     41361  direct      fr 1   0.61  0.20  
   41032     41361  direct      fr 1   0.63  0.02  
   41056     41361  direct      fr 1   0.65  0.38  
   41131     41361  direct      fr 1   0.65  0.51  
   41161     41361  direct      fr 1   0.59  0.14  

   41365     41574  direct      fr 1   0.74  0.28  
   41488     41574  direct      fr 1   0.90  0.20  

   41578     41766  direct      fr 1   0.52  0.50  

   41769     42002  direct      fr 3   0.55  0.06  
   41787     42002  direct      fr 3   0.61  0.03  
   41823     42002  direct      fr 3   0.69  0.26  
   41838     42002  direct      fr 3   0.73  0.39  
   41895     42002  direct      fr 3   0.68  0.27  
   41922     42002  direct      fr 3   0.58  0.06  

   41999     42241  complement  fr 1   0.74  0.47  
   41999     42265  complement  fr 1   0.68  0.74  
   41999     42190  complement  fr 1   0.74  0.63  
   41999     42127  complement  fr 1   0.64  0.32  

   42262     42570  complement  fr 3   0.77  0.04  
   42262     42528  complement  fr 3   0.78  0.41  
   42262     42396  complement  fr 3   0.58  0.01  
   42262     42513  complement  fr 3   0.77  0.58  

   42704     43117  complement  fr 1   0.58  0.92  
   42704     43036  complement  fr 1   0.53  0.04  
   42704     43027  complement  fr 1   0.52  0.09  
   42704     43015  complement  fr 1   0.50  0.21  

   43114     43416  complement  fr 3   0.74  1.00  
   43114     43398  complement  fr 3   0.74  0.93  
   43114     43422  complement  fr 3   0.75  0.99  

   43413     43766  complement  fr 2   0.66  0.79  
   43413     43754  complement  fr 2   0.65  0.79  
   43413     43685  complement  fr 2   0.57  0.04  

   43781     46156  complement  fr 1   0.76  0.80  
   43781     46090  complement  fr 1   0.76  0.09  
   43781     46081  complement  fr 1   0.76  0.21  
   43781     46009  complement  fr 1   0.77  0.25  

   46612     47442  direct      fr 1   0.66  0.00  
   46621     47442  direct      fr 1   0.67  0.01  
   46666     47442  direct      fr 1   0.70  0.13  
   46675     47442  direct      fr 1   0.71  0.07  
   46690     47442  direct      fr 1   0.72  0.21  
   46711     47442  direct      fr 1   0.74  0.84  
   46750     47442  direct      fr 1   0.75  0.78  
   46774     47442  direct      fr 1   0.74  0.04  
   46843     47442  direct      fr 1   0.71  0.04  

List of Regions of interest
(regions from stop to stop codon w/ a signal in between)

   LEnd      REnd    Strand      Frame
 --------  --------  ----------- -----
       88       837  direct      fr 1
      812      2842  direct      fr 2
     2868      3161  direct      fr 3
     3101      4864  direct      fr 2
     4833      6035  direct      fr 3
     6033      6458  direct      fr 3
     6364      7566  direct      fr 1
     7550      7735  direct      fr 2
     7663      8046  direct      fr 1
     7742      7936  direct      fr 2
     8040      8432  direct      fr 3
     8410      9114  direct      fr 1
     9112     10629  direct      fr 1
    10636     10839  direct      fr 1
    10812     11024  direct      fr 3
    11022     11510  direct      fr 3
    11708     13570  direct      fr 2
    13552     14823  direct      fr 1
    14807     16069  direct      fr 2
    15904     16692  direct      fr 1
    16690     17112  direct      fr 1
    17110     17781  direct      fr 1
    17779     18960  direct      fr 1
    18958     19695  direct      fr 1
    19693     21042  direct      fr 1
    20982     22394  direct      fr 3
    22539     22808  direct      fr 3
    22759     23406  direct      fr 1
    23462     23758  direct      fr 2
    23733     26282  direct      fr 3
    26224     27408  direct      fr 1
    27626     28117  direct      fr 2
    27644     28018  complement  fr 1
    28108     28392  direct      fr 1
    28411     28758  complement  fr 3
    28712     28951  complement  fr 1
    28908     29564  complement  fr 2
    29461     29715  complement  fr 3
    29633     30205  complement  fr 1
    29774     30214  direct      fr 2
    30193     30705  complement  fr 3
    30737     32950  complement  fr 1
    32857     33609  complement  fr 3
    33608     34264  direct      fr 2
    33653     34729  complement  fr 1
    34732     35865  complement  fr 3
    35863     36756  complement  fr 3
    36750     37004  direct      fr 3
    36925     39153  direct      fr 1
    39138     39662  direct      fr 3
    39258     39683  complement  fr 2
    39666     40043  direct      fr 3
    39815     40216  direct      fr 2
    40156     40560  direct      fr 1
    40538     40912  direct      fr 2
    40849     41361  direct      fr 1
    41359     41574  direct      fr 1
    41572     41766  direct      fr 1
    41721     42002  direct      fr 3
    41999     42286  complement  fr 1
    42262     42609  complement  fr 3
    42704     43144  complement  fr 1
    43114     43515  complement  fr 3
    43413     43838  complement  fr 2
    43781     46183  complement  fr 1
    44366     44593  direct      fr 2
    46576     47442  direct      fr 1

POSSIBLE SEQUENCE FRAMESHIFTS DETECTED
 From   To
 Frame  Frame  At base...
 -----  -----  ----------
   2      1        15240 +/- 11 bp  (direct)


Protein translations of predicted genes

>Translation: 124..837 (direct), 238 amino acids
MTKNNGRIVSRAELLRRLDRSRTRFYEDACKTGRLRNALTADGRVDLDHIDAINYCHEFN
YKEPDVKAEVAKALAAEREKAAAPKAGPTQAVDVHPNLAADAAVDVPDDNDEKSPDEFMN
MTLNELIKRFGTQSKFLDYARARKVLTEIQAKDEDSARRRGEHVPFSLVLALEGHIDALQ
TALLSETTIAVRTKVSALIKAGADDKTIERAVHDMISRTIKTTKATTERMIRDVQRK*

>Translation: 821..2842 (direct), 674 amino acids
MYSASDYYAESAERVIEGLLATTDEREFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYW
REVVDCFDPYSPVHFVAVKKGAQVGATVSVLENLIGYGIDYVKTASMIFATVDDDVTKRR
VTNFILPMLRAAQASRSLIQANDFSKGAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSF
PVPWLLRDEVSGWPQSVGKDGDPMKLTETRTNSFANSRKILDLSTPLLAGTDTITQRFEK
GDQRYYEVPCKHCGEYQRLYFRGNAKNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMINE
DKVTIMGEGNWVPTAKPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDL
DELQVFYNNDLGEAYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVD
VQHTWLAVCTIGWAPSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGG
RVYDIARVGIDASELTDVVYEYCNDWGENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYL
SVTVDLYKDRWSPTLRKEWSGQGEMPRGMLSAPVDMPDKHLKELTVEYKREERDPDTNKI
IRQVWHRPGGSRNELWDTLIYNTAIFESMVLEACEDIVGLEALVWPEFWAIAGKDPRRGA
TGGLCWDIAPDES*

>Translation: 2883..3161 (direct), 93 amino acids
MATTFDIQQRDRLAAILNAYLDAITALLTANVSEYTLNTGQGSQRVTRLDLQQLQENYGL
LYRQYDALVSRCGNGGVVQLVPEGAAPWLDRL*

>Translation: 3227..4864 (direct), 546 amino acids
MPVVADPSGPVMSVENTSRFNGRGDLAHAGYRYTRHDGEKYEGGLGPIEILYTDYWSQRR
RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW
GELPEVCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGLPTYELIDGDMVQTPYD
AQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKATGANGRRIAWLYYGTDKRHT
EARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTKPFSSGAQRR
VQDTDFVGDGTERRTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILRAVAW
ALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVYRSWLRAMVLNRRI
DAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQIKLGALTRTKAARM
LTGTSFKTNMRKLAKENQMLADAMRPILELKKEFGEDVVDEEINARDGAGPSLIELEGGR
NAVAN*

>Translation: 4848..6035 (direct), 396 amino acids
MPWLIDQEVLQLMADAPDLTQEQVDKCMASSPLFLGNGPTANADRVMTVAGDTARINMQG
VMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKI
RNAKKPSTALVSTASSAAYLAASQADEIVAASRVSRVGNIGAVLSMRRPSTSVYVDVTST
DAPNKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSDVNQNFGRGGSLFAETA
LKMGMIDKILTAETESATSTGGAEANAQDDTTGVQTMDLEKLKSEHPAVYAQAIEEGKQI
GLTEERNRVAFHCNMGLKNGAADVALKACQDGASMNDGTVLSDYITAGMNKTELAARESE
ETELSDNQVPENTEEAAKKANVTKLFAGAHMQVQL*

>Translation: 6051..6458 (direct), 136 amino acids
MAKNGLINDRVRDINGVVLFDAKFKPVELAFAAAETWPAGAVLGRVTATGRYVRYNPAAS
DGSQVPSAILTEAVTQSAAGNITYSVAVSGEFRVGDLTDATGTALTANSSADFALRDYGF
ILRDVYETTFRDNNA*

>Translation: 6472..7566 (direct), 365 amino acids
MANELATGWVQAFIERRSPTMFLSSMFTTKPGGIYYGKTVEIDVKRFSENVAVVITKLSG
PNFNDASLISTKEFEPPEYGEAFATDVDDLLQRLIGVNPYDDANIAYSSKLVGRLMDYFM
EANDMIMRGVEIQASQILQTGRLNLLDRDGETAYEIDYAPKATHFPTVTTAWSDDGADPI
DDLRALFEVIRADGKVNPDMIIMGEQALRWALRNANFQEELDNRRIDTGMIDPRMMASGA
TLYGNVWVGSYMAQIWTYPEGYSHPQTRAFTKYVNDDSVIVLSSQTRLDRVSAIVPLPLG
PDQRVSQMLPDLVPGRMVSEGDDLDVTPNLYPTPNGRTIIAELLSRILLVPVQIDGFGCL
DVNP*

>Translation: 7954..8046 (direct), 31 amino acids
MIDEGKEIKAEWLPGGEDQLAYLIGKGLVK*

>Translation: 8046..8432 (direct), 129 amino acids
MNLRQLAEQDLAITLEDADAGFGWPVTLVDPSGASANLTAQSQDIGLIVDPDTGVAVSGR
TASAVLRVSSLVRSGLQVPEGEPRGSANPWELTTTTTNGVAVRFKVDYAAHDRILGTVTL
QLGVLGNA*

>Translation: 8425..9114 (direct), 230 amino acids
MPRPLTVLINQRDSFEIVRDQVAQILADESANQMALATAEGEDPANWQLNVYRERSIPWD
LLDDGAAQSHSVNVWFDSANVDESAADPTNRQTYIGTINIDIVFGSIALKLADTGYVPAD
QAAALGANRITRLVRNILMSDSYTYLNLRQVDAPRVAVGKRWVRSVNTFQPQLDNQTAHH
IVGMRLELAVRYSEFSPQYVAPTLQTVDCTIAEDAQGRVLAGAQFNYTP*

>Translation: 9127..10629 (direct), 501 amino acids
MGISNAIDNSVRARVLGIKTEFRNFNTGRTFFLPQHVALLGQGNTAATYELTPFRATSAA
QVGQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATA
TVRVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTAAVDGTNANEVDVTAKWQ
GLSGNDLVVSIEGSIPGITVAITQPTGGAADPEVADTLALFGENVWYTQIVSCFNTANTD
ALDAFETFGEGRWDPILRRPLAVFTGTNETDPNTLAAIGDARRAQRTNLVTPVPGAQNLP
CEIAAAWVARVARSANNDPASDFARLTLPGLTPGTDAQQWTHTQRDLLVKAGISTSIVRD
GVAEISDTVTTYHPTGEVNPGYLFYKDTVKASNVLFNLDLIFNTREWDGAPLIPDDQPTT
NARAKRPKDAVAVLSVLANNLGLDAIISDVPFTLENIRASINDQNPNRLDIIFPAKNSGN
ANIISADYFFGFFLGTQAAV*

>Translation: 10815..11024 (direct), 70 amino acids
LQLSIDDLNDDQEFLQQVADSNNMEPILITYASGAAYQGEGTITGDLQTSSQNTTATVTL
MGQGNLSRQ*

>Translation: 11109..11510 (direct), 134 amino acids
MNEIEYKVDRETAEAEFDQMCDVMGVETDEDIMAKDDREDFAKHKERVIKSIMRGVVVLN
DGVPTVHCSDGDKVTFKEPKGGAMIQPMKKNEDELNRIYKIAGTLTGGTAHLAKRHMRDY
RPLLSLTSLFISM*

>Translation: 11606..11710 (direct), 35 amino acids
MDYAGLPDVRTLDMSQVRFFYEGLAPSIIKAMKG*

>Translation: 11759..13570 (direct), 604 amino acids
MSRPIRNIQRNVERFTRTATRNMAKLRRETKKVTESINGLGAKALAAGGTMATALGLASR
AGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGATTQFSATQAAQGIDFLAMAGF
NAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAK
TSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFLR
LAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGAEG
LAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFE
LSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATLWAFN
TIVTTITGAITLLNLVIAANPIVLVITAAIVAVAALAAAIYLHWEPIKTFFTDLWDGIGN
AFDTFATNLANKWEALIAPIRSSIQWLLEQADALLGVGGAANTVVPAGQGFAVAGGGSGF
VPGAAAPTAPPIVTPEERQAAVFSEQIKRNVVELRINDPQGRVDLDGESATDDNVRITKT
GNF*

>Translation: 13570..14823 (direct), 418 amino acids
MVWQTRLREAAYTPPSGNRLTFIYTGVSEEFDQKGGPFDFAGAQGTYVQFLGVTGRRYPM
TIIVSGDDYDLDAAAWMRALAEQGEAVLEHPAYGRLTVAPVGTVKRSENFVNGAGQATIE
VTLFETTGAVYPSPQQDPVAAVEQAVAEADAAAAAELAAAPYAETVGEEASFIDAINDTL
DTVDNALRTSYQAVDEVERQVRQVQDSINRAIDVLVQQPLSLASQVQQLIQLPGRVVSQA
TARLSAYGDLAGQIAGSFDSDGNERPERRIAATRQTLYANNLAALGTASSTALASIRTEF
TSQEEAVEQAEQLLNMLDDLAAWRDQSFNELGQVDSGAGWQATQQAVATAAGALVDISFG
LQRQRIYVADRPRNIVELAYQLYGEVDSRLDELINNSNLSGDEIIEIPRGREVIYYE*

>Translation: 14816..16069 (direct), 418 amino acids
MSRSYTVRQGDTFESISRQVYGQEQYSDDIRQANPGAGTQPAVGAVLVIPNVPELQLSEG
RARNAQVTQDPDEVTLRINGLNFRFWRAVTITQHLDAVSTVSLHAPFDPNDQQSRDAFRP
YSYQPVAVDVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWD
EATLQTIAADLCRPFGVQVLAPNGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQ
GRLVLLRPDTQGEPVAEFIEGQQPPISVTPTFGNQDYYSHVTGITPTIVGLEGPQATVRN
PRLEGVLRPFVYNADDMDEADLVQAVQSKAGRMFAAAATYDVPVPTWRNANGGLWRVGDF
VILEAPGAQVYRRTLMQIKTVRFTATPAERTAVLELIIPGSLSGQLPEALPWEGSPQ*

>Translation: 16048..16692 (direct), 215 amino acids
VGRFAAITNFIRKAAGVSDVRCNPGGGATVDAYHAQPAGDDCHPLPTDTTVLVEVPRSNN
YAAVGIIDPNNAQTAGPGERRVYSRNANGEQVAEVFLHDDGRVRASNDNGSVDLAANGDI
VATNGSASAALVGASVTLSDGAGGSISIVNGVITLTGAAIRLAGPVDANGATISAAGQIT
DADGLSVHGHNHTQPNDSDGDVQQPTSTAQIPTP*

>Translation: 16705..17112 (direct), 136 amino acids
MTLAFSQNVRNQRLAVVAAAADAGSGPALIRIYNGTRPPTGGPVTTLLAQLEMSDPAFDA
PANGTMTARAITPEGSTPIGGTATWFRITDSEGNFVLDGDVGLDGSNAELELGDVNFLSN
QEVRIATMTINDGNE*

>Translation: 17116..17781 (direct), 222 amino acids
MARLGQRVNAGEPLAQCGEPQMQAGNFILTAATSDMTTADATLRASGEMRIIIRGSASLD
IAPASMDASGETVAPPIVIEPPGDLPPAGIRDVWMYQTVDDGNVYPVNGDLYRTDGLETA
VYLSLYGGNPEDNGQDANRLGWWGNADQDDPARQMVSRFQHLVEGIPLTSGNVQRLEDAA
AADLEWLSSLGYDVRTSGRIAGKDKLHMTINIDGDEFVITN*

>Translation: 17791..18960 (direct), 390 amino acids
MVAPTTPTTQSISDNIVAQISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQ
MFIRTASNTPVTINGQTVTPLAEWGRILGAGPQRAAVQTQLDVSVAVVTQGGVLASGEQL
QGPNGIIYITTGDTLLNAATVTVRVTAAGDQQGGDGSGTIGNLADGAVLNFVQPLAALQP
TATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPYTGDT
PGTVAVYIESSTEADGIPTNGQLLEARTAINQSPNGRRNRAPVGTLVNTFPITRETYNVT
VSGLNVDNPADVRRDISNALAEYFLQREPFITGVTTGVRRDQITQIAVGGVVEGIVTAAG
GTLTGVSVTNEAGDAVVSLNLCRKALRLN*

>Translation: 18973..19695 (direct), 241 amino acids
MRNTLRLLLPRGRAWHTTLDKPLRKFMDALGDTFGDVRDHLGSAYTDLLPAYTRRLDDWE
RQFGLPTVDISEEDRRQRLEAAWRPIEGQGIDTLQDVLQASGFNVYFHNWWVPGTEPPPG
SHAPPSVRDPSEFLSLNEARVNCGEPLAQCGEPSAQAGNSLQMPPAPQGRLLVNIIQTDT
GTVEYAIPTDPAEFPHIIYVGGETFGSYASVPFARREEFEALILRIRPAQKLVGLLVNYT
*

>Translation: 19708..21042 (direct), 445 amino acids
MAIDYRNQPRYDGKINEADLANYPQGKAQNVTTPGDNTGTPWDQIGLNDEWGFLQSILAR
AGVTPNNQPDSVGNPQYLDALESLLAPPIGSLQDTRVVLSAPQWLLANGQAVSRTTYSEL
FAVLGTRFGQGDGSTTFNLPADGASKEPGQEFVNLGTGTGLPTYNSEARFARAPNGNMYF
VATGPTRLYRSTDNGLTWTTTNIGSGLPANFDNPAIMVGQNNYVYFIDRTNADLRRSTDD
GVSFTNVGVGGGLPSSIGAPFLASAPTTTGSNPEILYFIDSANDTLYVSTNEGINWTTFG
VGDGLPPNGLGEMIGAPDGYFYGIDIVSDVLYRQVPPSTTWSSVGIGNGLPSTISSPSMS
VDPDGTLYFIDGADRDRLYRSTDRGVTWSSQFLGQGLPDGGNPSLGTSATGSVFYIEPES
DALYASLSNDVHVAGQLATYIRAQ*

>Translation: 21045..22394 (direct), 450 amino acids
MALNYRDITAFEGVIDETDLENYPYGKPQNIIVEGDGRGSIWDQLNINDWWGFQQSILAR
AGVQPNNTPDSVGNPQYLDALTAITAPRVGQTLRTYGVLPEPQWLRTDGRAVSRTEYADL
FAVVGTQYGAGDGSTTFNIPQEAGQEPTPTPGETFVEKQSDGYPRDVRSTAIAGLPDGTI
LGAFGRFSGTPIRKSGDGGDTWLSTGLGNGLSSYNIPVMAANDSGTILLIDINNDSRPLF
RSQDGGENWSDTGIGNGLTINGGDLAAAPNGDFYYINRDETIFRSTDNGENWAQVSGQLP
SHSFAGITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEGDGSGLPEVSNRSS
LGAGRNGDLYIIAPDRNQGYRSVDNGASWSPVSGLIDDFSFAYVTSTPNGNIYVVAGTFS
TGGVYASEGSGGPAPGTGLTAEFYIRAQQ*

>Translation: 22557..22808 (direct), 84 amino acids
MELTSGGRCPNHPDQAKKSDPNGGDHPKGNGVDIRVHNRQHYDKIALLAGRHGFNAIGDG
LKYGFIHLGRRPENGDRVSAWGY*

>Translation: 22810..23406 (direct), 199 amino acids
MTNVWDIVKTVGSAVISHAVPGGGLVLDLINGFLDDDKKLPVTATGEQAMRAIQSLDPAQ
QSALLSKQLDVKIEDIKQSHDTLRTMLNADKDNPQTTRPWIAKWAFVFTALFSGVVGLVI
IVAYAYAVYRQDVELVKAIVDGWPFVLGILTAVTGTFSLLLRAYFGVLTKEHENRIGAAT
GRYKPSAISGLLAAMRPK*

>Translation: 23525..23758 (direct), 78 amino acids
MARTTKTLTASRQEVATGRCVITILRSGKYALNDTDTATAQSLSYFKAGEQVVQTERKPT
YASAPDGNGLIIVDEEG*

>Translation: 23763..26282 (direct), 840 amino acids
MLVKLSRSTRYLGALMGVGGSGGGGGGSVPDERVFADLDARQAWTATNASSLEPADECIV
LDDIDGVSFYAWTGFGWNSVDKIYQGLKGSKGDDGASMVSVAFSGDDILVGLSDGSTIPL
EGGKSELTGDSVASVAFDGLDMVFYDENNVEIDRILDAANTLRGARGQSFFVRFARTSAG
PWSDVFDSDPVSGSFFWQFSYDNKNTWTEPQPCRQLTTVNIENPITLDLGNVKLGSSGDS
FAIYNADRRISGYPVMSGIYDDGTTVRAQELVPDGPKVLNAVFGGAPSGVAVDCDYTYNN
PFSAAVLATRVLPMETYSGKINVDVYSNSGELISRHVQTVNVTVGQPAYLSMEDGPKQVA
EASRTYRMLVTKTADNQPLQVQGSDNTGPNPGGPHRAVDYQPLAYRQSLAVGDYELIFDE
LNNAPEGPGLNAEKVKGIEAFIENKIGTDRGNIDFSGDLSQFSPANKSDYWTAVIASGSE
SIGGLTFNNGDRLIAVQSNDVVPTTLNSPSWRVDPKTVPQASATTLGGVKISTTSALEIN
ASTGNLDVKAATANQAGALKADAVGGAPTIGPDGKLKRSVIPVQTGTQTKFLTLESELLT
LPITSDAYIVNVSSTSRRWGLDSNADPSKLENWALLGVFGDSVTSFNDRPGDVEPLAGDY
DADMVDETVDRVFMTPAQSGKLDRYNETARYADFETDRTYEKYDRVRNAAGFIIEANKRT
PANAVPLEGDWDVVSYDYVVVESDRNVGEDKAHMSYVVPATRGATTLRLATAFPTAAHSY
DIYVRGDRDETTEVKVQLTAGVTAGPGTDADGYRIRSAAKLEVRCDASQVNVNYILLDG*

>Translation: 26287..27408 (direct), 374 amino acids
MTGTIIEKPKTLTSDSVVGESGTPSAGAPASSEWLAEVAAKALNKDTSVLPDLDDAVPTD
TNKVPNAADTKEKLDKKLDKALTNAASKVIVTDADGNIIASADVSLDKLKAVGALSTDGA
IVAVDAEGKVSETSLTKTKAEALQALATKGAVVVVGEDGQLIETGLTFTATDENGTEVNR
LSQTAAPASTASAETDMLTKKDADGLYQPIGAALEGITLPSRGIHTLEPDEDEDTKIFEF
DTLAPLTGGTPTTVSVYASTGGDQVEFYLSSRVPYFAKFQGTKAVTTRTNGEVTASEVSV
VCSAARRWTSATAGHATAQTLTTSSFNREDCVSLIMQFGKGGGSVHNFTCACQIELTPVD
SGQMNSYGDAAIG*

>Translation: 27638..28117 (direct), 160 amino acids
MALADFSMSDLILFVTVLLVFRLRGAELVAAGAFVLCSILHTVATLVVPVSDIVYYASAV
VADFVALIVLCGVRPVCGVVYGLARVCVASMALNLVGLLCWWLYLPAWPYNLAFMVLYVV
ALGIIGSGGANGMGGRAVRWRFSRLAFPASASGAGCNKA*

>Translation: 28126..28392 (direct), 89 amino acids
MNVIERFTQLINDVKVAMGIGVGTASGGVGQWLDLLPDNALTKTATIIGMCLSLVLIYTH
LKRHIREERMAKVDLQIKLKALKEEGPS*

>Translation: 28411..28722 (reverse), 104 amino acids
MIADKYGVGTVLFENEDKPHPCAQFGDGKLRIFTTVWSPRNVGLKIARTVEDLPPFSAHT
YDEENPPHQTDGNEIHLLFDDPRSIDAMITQLEYIKGVMLGGE*

>Translation: 28712..28936 (reverse), 75 amino acids
MKQPRKRSFKARALEAETERDLLRTQLFNARAALGLVANGGLDTDPKVVQYWAGELVKGL
DETLKKTERKQDDS*

>Translation: 28908..29468 (reverse), 187 amino acids
MYKYFQVNSDVGHYLVDKYNREVNPKREELVTDLLGAVGAVGVVLYREWGMPATIQALVF
PADHDICRAEGVKLTEHRQGMVVQFDNDSQYAGIYYEPISHLNQQLLAYPDFSDWVVHTM
GVTRYALADTDGEAKKKIATYSHLLTDGRIIFAVPLGNVGQKPINPDRRMVEITKADYEA
ATQAQF*

>Translation: 29461..29640 (reverse), 60 amino acids
MTDTPKIISVQVTPNDTHWQGALIGLDDQGTTYISDRVGGANKWVEYVPALKPETKDNV*

>Translation: 29633..30187 (reverse), 185 amino acids
MRKPLTEINWLRLGTTDEHKELKHELAHRVRAVAGGDYGDRPLPAIILVFNEHGWTVVVP
EGVSVAASERLLAVCIDLARAAAIDEVCMHDILRTFVELGVRLHDVDPRPALTPERADGR
TFIEFGARPVAVPIAETLEQKVRAALAELNDRTEISDLDLQDDEGDRDAVLDFIQNIREA
VHND*

>Translation: 30193..30678 (reverse), 162 amino acids
MTNNAKTTDEALDAIRGVKGLYAQPISNYTASIKILDSKNLARLTVELPLDQLSDRRDAR
MFDPRNPMFDTLKMVPFLAFVDVSEFEERFAEPTAAPKLGANGENVETEFLKSDYLTHHD
DFVRACEIARDASDGEGGELYWQRQINTLARLAEDYPFKRG*

>Translation: 30737..32797 (reverse), 687 amino acids
LKDGKGRRAYIPGGPKPTDLIDHIANGGPIAAWNITFEFWIWNMVLAKSEGWPWLQLEQC
YCDMAKARRFSLPGSLDTAAKALRTAEKDKKGKQLIGKLCRPVSATKARPEPRWTLYTAP
QDYADLYRYCDQDVKSEDHVSALVPDMTPAERETWLADQRINARGVLVDVESLDNALDIV
GQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATVGCRMHNLKSETVAETLERDDLNPTA
RRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAGTGRWSAGGVQLQNLTSKGP
KSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWALKDLKHRNLDL
IIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKI
YEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASGYGGWIGAWKAFGAEEHFE
DDDAMKKAILGWRDASPEIVEFWGGQYRRDPYTRKWVREYYGLEGAVIMAILNPGKCYAV
GDITYAVFDDVLYCRLPSGRFLQYHQPKLIESDSWKGPEYQVTFMGYNTNSQKGPVGWVR
MDTYGGRLAENVTQAAAADVQAYAMKQCENNGYPVVMHTHDELIAEVPGDFGSVEEMAAL
MTGREPWREWWPIRAAGWEDDRYQKD*

>Translation: 32857..33582 (reverse), 242 amino acids
MNIEQWVGLAAAKEHEPRGLHLLRVESGMAYASDGHRAHWAPTYFADGYYDPRTLQPVPG
AHTVDLVGHLQKHVWGTTIVQWEGGDLREGTLHYFPKDQLAQAQPIGNLFINDNHQVTGV
GFSEGMFTIAACKGVEKPAQTPPAAEKPAEAPKVDPIKFLALEADTGRYVATMTKPDGHY
MKPVEADQLPAGTVIHNVGMGRSTVYPSFDFETYSEAGFVINPETGKVHTAAKTGNKNGM
R*

>Translation: 33653..34714 (reverse), 354 amino acids
MHNDNFLKLVDATVVWDGINRPDQLEAKPGQAPGNKWSVKIVFPPNHPDLPLLENLAWAE
LNAGEFQGTLPNGGMMPVSDVGPNEFNGMFPGWKCVNVSTFQQPQVFNQGQMLDPMQWNQ
FVYSGQRVSVLLHCKTYNNKSRGVAARLDGLEVLTEFNAPRLQFGGGANAMDALGGGNNG
GGQPQNNGGQPQQQQPQQQQPQQQQQQWGQPQNNGGQPQQQQQQWGQPQNNGGQPQQQQQ
QYEQPQQQPQQQQQQYEQPQQQQQQQQQQQQWGQPQQQQQQQQQQQWGQPQQQQQQYEQP
QQQQQQQQYEQPQQQQYNNGQPQQYEQPQQQQPQNNNNGGGQYPQQAQNFMPQ*

>Translation: 34732..35859 (reverse), 376 amino acids
MGGQHASIPPSGAPIWGHCSGYLGVMHKVADFDTDATRAGTASHWAFSEAVLNDLDAEDY
IGHADPDGTIVDEGMAHGAQLMVDMVREDMEKHPGGQLYIEQRVYMPDIHPDNWGTLDYA
YVLPAIKRVILGDYKHGHLEVSPERNLQLVDYLKGLENLVGYTFDGWRIDLKVCQPYAYS
PHGPKKVWQASRADIVPLWAQLSEQAHSDPHMTTGKHCRYCPLVGRCSAARQAGYSLISY
VKEPFEVDTMTGADLEAERDILTEGSVILKARLEDIEEQLQNRLRKGEKGIGLTLRQTTG
RAKWTQSDKVVITALKSIGLDVSKETTITPTQAKDKAKTEAQKAAVKALQRRPNGKIELV
KLKDSPAHRAFGANQ*

>Translation: 35863..36714 (reverse), 284 amino acids
MLKIEFDAANKPLAAAIGQALLNYADAKLDPASAAIVSQVKANHSDPKESTDNVSAPGSD
TPSESNSKTFTETPSETQTGESTTTAQSAESDDANYVFDQHWKSASGLDYQGHPVHPEYG
VPVDDKGYPLIDIYGVKHDPDWCQKAAKPFYASGKDKGKWKAKVGTDKDAFAEWHYAQLS
ELGKQSTDGAAGGPTDEPQADAASAFGSQAQQPANNGGAARPTANIGDLMGWIGEKQAAQ
VLTTDQINKAYTDCGTDIASLAAPDNTAGREAVFGYLSGICGV*

>Translation: 36801..37004 (direct), 68 amino acids
MEYIDEHVRAHLANYPKGRTVNQLARELGVPRATLRACMQRLETTGAAFTGDATFAGNKY
WKARAHV*

>Translation: 36997..39153 (direct), 719 amino acids
MFKLRDYQQLAYNRIMAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIVHRREIV
AQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLN
NAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKGEG
GFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVESHI
VGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQF
EEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDP
VRNWERHGPPTAVRNWSLMGREKGQGSGGDTIPQKTCDACTVPYEAYYKACPYCGNVNEI
ADRSAPDKVDGDLVELDLDAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRGPELRRHR
AAKYRREVLDNVVRWWVGMQQQLGRDMSEIYKRFYFRFGVDIGTAFTLNQKDTDALTAKI
RERFHERFKLMSNTFLTMHGNFDFDLSGDKFDLPIKTIAYHLSHINRFNGAVGQYSVAQH
CVQVAALLPANLKLAGLLHDATEAVLCDVPAPLKRMLPDYQAIENRLQDAVDARFKVKTR
HKRVREADLSMLAAEARDFGLDLGPLGFEPVSATIKPWPAKTAEQAWLAAFHAYKGTY*

>Translation: 39153..39662 (direct), 170 amino acids
MSIEFDQWAARWPQAAAELVAVMADTAPSPVTSPASEARAQQEARMEAGRRGGVLWRNNV
GATKAKEPHVCPNCAFKFEVRKPPLRYGLCNDSEKLNAKIKSSDLIGIKPVLITPDMVGQ
TIGQFWAVEVKAPGEPINLRDERQKGQAAFGALVERFNGTFEFSHGGLT*

>Translation: 39708..40043 (direct), 112 amino acids
MAKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQL
KRAIVRGAIKADDGLVVAMAVKAGDFQRAKVDELLYIRGEHQLEGWSPCLS*

>Translation: 40031..40216 (direct), 62 amino acids
MPELVEFIVSPCGCRVTKYHGGQLDDFQLLSTPEAADKWVEREQKVYEHRVPRVEVKVTR
L*

>Translation: 40402..40560 (direct), 53 amino acids
MRLIDLFIILAGALCGFCVYLATEGHILSIMALAGLLCVIGIIYEVTKNAIN*

>Translation: 40580..40912 (direct), 111 amino acids
MPRSLVREFKRLNDIMFDGSLDLTLDEEKQVLEHVRTIEGARNLVTIGQFLRIMPTDKRR
RSALLAYLFDEDRPSGYACDACYGIDTQYTKRAAELLVRKVQQAQELVNA*

>Translation: 40909..41361 (direct), 151 amino acids
MKLRDYQRAALLGLEGVCGEPTLEPAIRFMELHKSAATGKSVAPSATIVMATPTARRLLG
GPELQNVEPPEFVKMSYADFEAAVMARFAENVGVDLAQLVVSIQRMGRALRPDMPEVPRW
DEVKPGTLKVAKNERTKGSHPQPHYYKGRW*

>Translation: 41365..41574 (direct), 70 amino acids
MTHQYKYRCAIAPADGLRLQQDGQNLRVDIEFGNGTGHTTPVLLDREQALQLARDLADYV
GLDKVTVRD*

>Translation: 41578..41766 (direct), 63 amino acids
MFKKIKVLYWQWVVGCAVEDIQHIDNELPYLNPLAKTQALRERRQAINDLITARERLLKL
EG*

>Translation: 41823..42002 (direct), 60 amino acids
VHFGKVVERILRNEAPSQDDISTLVQHYAFDPIMTTLDLQDVVDELNVRFGTNHTITGR*

>Translation: 41999..42265 (reverse), 89 amino acids
MKTTTLTDMQRKALEWFTGDRKRVLMAHINDEGVECFDGVQGLHTTMCARGLKKKRLIEQ
AGRVGDKTAYRISKKGRRVLAYNKEKRA*

>Translation: 42262..42570 (reverse), 103 amino acids
MSTQDNHKTTTTLSVPREFVGDVVKLVADKKREKQARTIVITPEDLGTMKYFIEEKGDVT
RWCDWDEKKAAIFELYPGLRRALNNLELAETQLRHELEGIDV*

>Translation: 42704..43117 (reverse), 138 amino acids
MKVLTGDAYELHAIAADVPHGEDVTGLVDAMTAAMTAAGGIGLAGNQLGVLKRVIVVRAP
KFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGL
TAFCIQHEIDHLNGVTI*

>Translation: 43114..43416 (reverse), 101 amino acids
MNYTHDMDMHLEKYAQTRTAEQLAADLGAPNAKSIQNRCQRIGISLVKSGRNHRLAKLTA
EDIRLAKLLIDDGELTMAAIERKLGLYKGAAKRIKEGILT*

>Translation: 43413..43784 (reverse), 124 amino acids
MSGATRMWRVMFLDERHCEIAPDVLLDFDPRGQMGEPMHTCTARLMTWHIISERTRAVAV
VSDVFLRWGRVWDGYDLFQHRNYDNLGSWFNAHTVKPVLGARGDDGPYNRVPPRLEEVID
ECQ*

>Translation: 43781..46156 (reverse), 792 amino acids
MNQTAQQLKAHGLGAFPCLQNKMPAVPKGTSWKDWAHAELNALPWPSDIVGVPIPSGVVV
LDLDTYKGITREYVEAGAGFAIPWDAAHIQTTQSGGQHYAFRAPNWPVKNISNAKHNETG
VQFEGLDVRSAGKGYIATGEPYYTPTHLGGALAMAFPQMLPPLPEGLRPWLEAVAHESSE
RVEVTDEDAKTIREALRHIDPGETREEWVKVGLSLKSGFGDDPQGLSLFDEWSSGALWRD
GDEPANYVPEHIETQWDSFKAEGGRTIATVYHKAIEGGWQPPAGINAFDVFGEGATDSTT
FAAIVDSLQQHGGDPKQTQTLIDTIIALPCSDMQRAMLAATLSRELKDADLLTKEVRAQI
ARITGNTAAGAASKVPVTPKGQRLAVNQPMHPDLWAPFHTTGKDMKPRGTIDNFEIMMSA
YGVQIGFDEISKELSIVVPGVNYGGALKDEAALSEIESIGNLNHYPKSDIKGMIAYMAHR
YAHNPVKAWVESAPWDGLDHVGLLFSHITLTPDEDRPMCEHLFRKWMRAAVSAGVGDQEG
CEPVLVFVDEQGGVGKTRFFRTLCPEPFRADSVLLDVKDKDSVKQAISYWLVELGELDGT
FSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEAEFLVDTTSNRRFIPMK
VAALNHQHRVNTQQAWAQALAEARSGAVTYVEAHEVAERNASFQATSAIDDVLSSRLNEA
TGERNVHMTVTDLLKRCGMHNPTKRDLNDAGKWLRSNGYEKRIRSGVRGFMLPDMSIGAQ
AFGGPQLEVVK*

>Translation: 46198..46371 (direct), 58 amino acids
MRIIRGLTLKPASLLSLNIRTTVRVVQPKASAASLVEIWGLLSIVLALFVLLVTGEI*

>Translation: 46690..47442 (direct), 251 amino acids
MCTLPARVLESGRTDSEGKTMFEINSKQMRQLEDALQQTNDLGVKFAQRATVNDYAFETA
KTAKENIRRDFINRNTFTQRSVRVDKATRFIDSSEVGSTADYMREQEEGARTAAPLNVPT
PAASGESVRANKRLKAVRKANRMTSITLKKRSRRAGGLTRKQQNIAAVKQAASEGRKFVY
LDRGKRKGIFRLYGGKRRPRVRLVQDLSRSVRVVPQHQWLEPASDKTMERRAEIYWRRLA
QQLRRANFPR*


Input Sequence
Title (optional):


Sequence:


Sequence File upload:


Use alternate genetic code:
      Mycoplasma (TGA = Trp)

Output Options
Email Address: (required for graphical output or sequences longer than 1000000 bp)


Generate PostScript graphics
Print GeneMark 2.4 predictions in addition to GeneMark.hmm predictions
Translate predicted genes into protein


Run 

Web pages maintained by GeneMark administrator, gte851w@prism.gatech.edu. Please send any suggestions for improvements or problems to the web page maintainer.