Heuristic Approach for Gene Prediction in Prokaryotes (Reload this page)
Reference:Besemer J. and Borodovsky M., Heuristic approach to deriving models for gene finding, NAR, 1999, Vol. 27, No. 19, pp. 3911-3920.
[ Download PDF ]

The models used by GeneMark.hmm 2.0 and GeneMark 2.4 are derived from parameters measured from the input sequences and knowledge gained through the study of various bacterial genomes.These models have been shown to accurately predict genes in bacterial, viral and plasmid DNA sequences. Please note that email is the only way to receive output for sequences larger than 1 MB.

UPDATE (June 1, 2001): Web site has been redesigned and moved a to new, more powerful server
Listing of previous updates


Gene Prediction Results

Information on input sequence

Sequence title: SIO1
Length:         39898 bp
G+C percentage: 46.18 %

Parse predicted by GeneMark.hmm 2.0

GeneMark.hmm PROKARYOTIC (Version 2.1)
Sequence file name: sequence,	RBS: N
Model file name: heuristic_no_rbs.mat
Model organism: Heuristic_model
Tue Mar  5 18:00:09 2002

Predicted genes
   Gene    Strand    LeftEnd    RightEnd       Gene     Class
    #                                         Length
    1        +          <3          56           54        1
    2        +         441         851          411        1
    3        +         820        1521          702        1
    4        +        1581        2429          849        1
    5        +        2531        3346          816        1
    6        -        3161        4507         1347        1
    7        -        4688        5254          567        1
    8        -        5251        5661          411        1
    9        -        5663        6256          594        1
   10        -        6348        7970         1623        1
   11        -        7972        8463          492        1
   12        +        9334        9489          156        1
   13        +        9486       10076          591        1
   14        +       10100       10528          429        1
   15        +       10831       10980          150        1
   16        +       10980       11648          669        1
   17        +       11645       11959          315        1
   18        +       12008       12241          234        1
   19        +       12234       12602          369        1
   20        +       12595       12843          249        1
   21        +       12831       13067          237        1
   22        +       13069       13257          189        1
   23        +       13387       14940         1554        1
   24        +       14945       15166          222        1
   25        +       15171       15518          348        1
   26        +       15502       17244         1743        1
   27        +       17261       17839          579        1
   28        +       17832       18272          441        1
   29        +       18265       19128          864        1
   30        +       19118       19522          405        1
   31        +       19515       19700          186        1
   32        +       19693       19998          306        1
   33        +       19995       20177          183        1
   34        +       20174       22189         2016        1
   35        +       22298       22534          237        1
   36        +       22518       22967          450        1
   37        +       23274       23672          399        1
   38        +       23673       23861          189        1
   39        -       23917       24057          141        1
   40        -       24062       24316          255        1
   41        -       24306       24566          261        1
   42        -       24559       24897          339        1
   43        -       24897       25088          192        1
   44        -       25069       25671          603        1
   45        -       25671       26453          783        1
   46        -       26602       28359         1758        1
   47        -       28362       32816         4455        1
   48        -       32831       33265          435        1
   49        -       33267       33725          459        1
   50        -       33703       35850         2148        1
   51        -       35819       36136          318        1
   52        -       36123       36872          750        1
   53        -       36874       37446          573        1
   54        -       37564       38505          942        1
   55        -       38518       39285          768        1
   56        -       39288       39530          243        1
   57        -       39527       39826          300        1

Listing of GeneMark Predictions

                              GENEMARK PREDICTIONS

Sequence: SIO1
Sequence file: gm_sequence
Sequence length: 39898
GC Content:  46.18%
Window length: 96
Window step: 12
Threshold value: 0.500
---
Matrix: Heuristic model
Matrix author: MB/JDB
Matrix order: 2

List of Open reading frames predicted as CDSs, shown with alternate starts
(regions from start to stop codon w/ coding function >0.50)

Left      Right     DNA         Coding Avg   Start
end       end       Strand      Frame  Prob  Prob
--------  --------  ----------  -----  ----  ----

     441       851  direct      fr 3   0.71  0.69  
     618       851  direct      fr 3   0.75  0.81  
     624       851  direct      fr 3   0.75  0.78  
     687       851  direct      fr 3   0.66  0.17  

     712      1521  direct      fr 1   0.59  0.17  
     778      1521  direct      fr 1   0.63  0.02  
     802      1521  direct      fr 1   0.65  0.15  
     820      1521  direct      fr 1   0.67  0.19  

    1581      2429  direct      fr 3   0.61  0.75  
    1611      2429  direct      fr 3   0.63  0.13  
    1635      2429  direct      fr 3   0.62  0.03  
    1920      2429  direct      fr 3   0.62  0.21  

    2531      3346  direct      fr 2   0.83  0.45  
    2558      3346  direct      fr 2   0.86  0.55  
    2600      3346  direct      fr 2   0.87  0.18  
    2771      3346  direct      fr 2   0.86  0.10  
    2777      3346  direct      fr 2   0.85  0.01  

    3161      4222  complement  fr 1   0.67  0.71  
    3161      4045  complement  fr 1   0.67  0.71  
    3161      4507  complement  fr 1   0.66  0.43  
    3161      4039  complement  fr 1   0.67  0.85  

    3245      3346  direct      fr 2   0.50  0.18  

    6348      7970  complement  fr 2   0.52  0.14  
    6348      7916  complement  fr 2   0.53  0.44  
    6348      6743  complement  fr 2   0.66  0.01  
    6348      7838  complement  fr 2   0.52  0.74  
    6348      6935  complement  fr 2   0.65  0.10  

    7972      8442  complement  fr 3   0.78  0.11  
    7972      8463  complement  fr 3   0.77  0.66  
    7972      8319  complement  fr 3   0.77  0.28  
    7972      8283  complement  fr 3   0.76  0.02  

    9334      9489  direct      fr 1   0.55  0.76  

    9486     10076  direct      fr 3   0.68  0.22  
    9582     10076  direct      fr 3   0.73  0.40  
    9627     10076  direct      fr 3   0.71  0.20  
    9639     10076  direct      fr 3   0.70  0.03  

   10043     10528  direct      fr 2   0.56  0.53  
   10052     10528  direct      fr 2   0.57  0.26  
   10100     10528  direct      fr 2   0.59  0.35  
   10208     10528  direct      fr 2   0.52  0.30  
   10316     10528  direct      fr 2   0.67  0.68  
   10322     10528  direct      fr 2   0.71  0.63  

   10980     11648  direct      fr 3   0.66  0.50  
   11013     11648  direct      fr 3   0.68  0.09  
   11232     11648  direct      fr 3   0.70  0.17  
   11454     11648  direct      fr 3   0.60  0.10  

   12008     12241  direct      fr 2   0.70  0.81  
   12164     12241  direct      fr 2   0.54  0.02  

   12234     12602  direct      fr 3   0.82  0.80  
   12549     12602  direct      fr 3   0.56  0.22  

   12595     12843  direct      fr 1   0.58  0.13  
   12655     12843  direct      fr 1   0.61  0.81  
   12691     12843  direct      fr 1   0.55  0.05  

   12789     13067  direct      fr 3   0.60  0.44  
   12810     13067  direct      fr 3   0.65  0.75  
   12831     13067  direct      fr 3   0.71  0.11  
   12933     13067  direct      fr 3   0.80  0.21  
   12942     13067  direct      fr 3   0.79  0.11  
   12951     13067  direct      fr 3   0.77  0.29  

   13069     13257  direct      fr 1   0.66  0.66  
   13099     13257  direct      fr 1   0.68  0.04  

   13372     14940  direct      fr 1   0.62  0.12  
   13387     14940  direct      fr 1   0.63  0.05  
   13519     14940  direct      fr 1   0.67  0.53  
   13561     14940  direct      fr 1   0.67  0.42  
   13801     14940  direct      fr 1   0.63  0.12  
   13816     14940  direct      fr 1   0.62  0.01  

   14945     15166  direct      fr 2   0.68  0.03  

   15171     15518  direct      fr 3   0.63  0.81  
   15195     15518  direct      fr 3   0.67  0.25  
   15312     15518  direct      fr 3   0.74  0.55  
   15375     15518  direct      fr 3   0.68  0.17  
   15414     15518  direct      fr 3   0.60  0.73  

   15502     17244  direct      fr 1   0.83  0.92  
   15661     17244  direct      fr 1   0.84  0.45  
   15742     17244  direct      fr 1   0.83  0.04  
   15847     17244  direct      fr 1   0.84  0.26  

   17261     17839  direct      fr 2   0.86  0.73  
   17303     17839  direct      fr 2   0.89  0.71  
   17441     17839  direct      fr 2   0.86  0.17  
   17615     17839  direct      fr 2   0.83  0.12  

   18265     19128  direct      fr 1   0.68  0.71  
   18382     19128  direct      fr 1   0.67  0.00  
   18394     19128  direct      fr 1   0.67  0.00  
   18595     19128  direct      fr 1   0.68  0.35  

   19118     19522  direct      fr 2   0.67  0.20  
   19265     19522  direct      fr 2   0.68  0.06  
   19304     19522  direct      fr 2   0.65  0.62  

   19515     19700  direct      fr 3   0.65  0.74  
   19596     19700  direct      fr 3   0.78  0.36  

   19678     19998  direct      fr 1   0.65  0.94  
   19681     19998  direct      fr 1   0.67  0.93  
   19687     19998  direct      fr 1   0.67  0.83  
   19693     19998  direct      fr 1   0.70  0.70  
   19765     19998  direct      fr 1   0.76  0.62  

   19995     20177  direct      fr 3   0.51  0.23  

   20174     22189  direct      fr 2   0.64  0.05  
   20258     22189  direct      fr 2   0.66  0.44  
   20291     22189  direct      fr 2   0.66  0.13  
   20384     22189  direct      fr 2   0.65  0.04  
   20756     22189  direct      fr 2   0.63  0.03  

   22298     22534  direct      fr 2   0.62  0.29  
   22313     22534  direct      fr 2   0.65  0.14  
   22412     22534  direct      fr 2   0.56  0.07  

   22488     22967  direct      fr 3   0.73  0.09  
   22518     22967  direct      fr 3   0.78  0.65  

   23274     23672  direct      fr 3   0.87  0.04  
   23292     23672  direct      fr 3   0.87  0.28  
   23364     23672  direct      fr 3   0.84  0.11  

   23917     24057  complement  fr 3   0.58  0.91  
   23917     24039  complement  fr 3   0.57  0.78  

   24062     24316  complement  fr 1   0.53  0.57  
   24062     24286  complement  fr 1   0.61  0.04  
   24062     24100  complement  fr 1   0.59  0.00  

   24306     24503  complement  fr 2   0.74  0.60  
   24306     24566  complement  fr 2   0.67  0.38  
   24306     24485  complement  fr 2   0.73  0.15  

   24559     24897  complement  fr 3   0.56  0.10  
   24559     24894  complement  fr 3   0.56  0.04  
   24559     24837  complement  fr 3   0.65  0.71  
   24559     24900  complement  fr 3   0.56  0.21  

   24897     25088  complement  fr 2   0.63  0.65  
   24897     25151  complement  fr 2   0.51  0.01  

   25069     25485  complement  fr 3   0.83  0.07  
   25069     25671  complement  fr 3   0.80  0.26  
   25069     25431  complement  fr 3   0.81  0.38  
   25069     25398  complement  fr 3   0.80  0.05  

   25671     26414  complement  fr 2   0.55  0.37  
   25671     26405  complement  fr 2   0.56  0.07  
   25671     26378  complement  fr 2   0.56  0.27  
   25671     26351  complement  fr 2   0.55  0.52  

   26602     28296  complement  fr 3   0.83  0.04  
   26602     28134  complement  fr 3   0.88  0.29  
   26602     28119  complement  fr 3   0.88  0.22  
   26602     27981  complement  fr 3   0.87  0.18  

   28362     32708  complement  fr 2   0.84  0.34  
   28362     32477  complement  fr 2   0.84  0.31  
   28362     32471  complement  fr 2   0.84  0.09  

   32831     33265  complement  fr 1   0.57  0.26  
   32831     32941  complement  fr 1   0.62  0.47  

   33267     33725  complement  fr 2   0.70  0.38  
   33267     33563  complement  fr 2   0.71  0.57  
   33267     33491  complement  fr 2   0.66  0.06  

   33703     35424  complement  fr 3   0.56  0.12  
   33703     35055  complement  fr 3   0.57  0.27  
   33703     34905  complement  fr 3   0.59  0.09  
   33703     34761  complement  fr 3   0.60  0.45  
   33703     34743  complement  fr 3   0.60  0.25  
   33703     34461  complement  fr 3   0.54  0.10  

   35819     36136  complement  fr 1   0.79  0.71  
   35819     36088  complement  fr 1   0.81  0.25  

   36123     36881  complement  fr 2   0.60  0.72  
   36123     36872  complement  fr 2   0.61  0.90  
   36123     36866  complement  fr 2   0.61  0.73  
   36123     36851  complement  fr 2   0.61  0.59  
   36123     36827  complement  fr 2   0.61  0.19  
   36123     36743  complement  fr 2   0.57  0.06  
   36123     36455  complement  fr 2   0.58  0.30  
   36123     36449  complement  fr 2   0.58  0.31  

   37564     38409  complement  fr 3   0.78  0.47  
   37564     38220  complement  fr 3   0.74  0.24  
   37564     38211  complement  fr 3   0.74  0.07  

   38518     39261  complement  fr 3   0.69  0.06  
   38518     39198  complement  fr 3   0.70  0.68  
   38518     39186  complement  fr 3   0.70  0.48  

   39288     39530  complement  fr 2   0.53  0.93  
   39288     39416  complement  fr 2   0.51  0.23  

   39527     39826  complement  fr 1   0.82  ....  
   39527     39772  complement  fr 1   0.81  0.29  
   39527     39643  complement  fr 1   0.69  0.26  

List of Regions of interest
(regions from stop to stop codon w/ a signal in between)

   LEnd      REnd    Strand      Frame
 --------  --------  ----------- -----
      207       851  direct      fr 3
      700      1521  direct      fr 1
     1554      2429  direct      fr 3
     2390      3346  direct      fr 2
     2447      3163  complement  fr 1
     3161      4678  complement  fr 1
     4688      5311  complement  fr 1
     5251      5676  complement  fr 3
     5319      5579  direct      fr 3
     5663      6415  complement  fr 1
     6348      7979  complement  fr 2
     7078      7200  direct      fr 1
     7198      7323  direct      fr 1
     7525      7767  direct      fr 1
     7972      8508  complement  fr 3
     9313      9489  direct      fr 1
     9471     10076  direct      fr 3
    10001     10528  direct      fr 2
    10914     11648  direct      fr 3
    11585     11959  direct      fr 2
    11984     12241  direct      fr 2
    12219     12602  direct      fr 3
    12553     12843  direct      fr 1
    12777     13067  direct      fr 3
    13048     13257  direct      fr 1
    13251     13436  direct      fr 3
    13357     14940  direct      fr 1
    14882     15166  direct      fr 2
    15159     15518  direct      fr 3
    15481     17244  direct      fr 1
    17231     17839  direct      fr 2
    17763     18272  direct      fr 3
    18244     19128  direct      fr 1
    19106     19522  direct      fr 2
    19512     19700  direct      fr 3
    19672     19998  direct      fr 1
    19989     20177  direct      fr 3
    20147     22189  direct      fr 2
    22268     22534  direct      fr 2
    22425     22967  direct      fr 3
    22965     23672  direct      fr 3
    23917     24072  complement  fr 3
    24062     24331  complement  fr 1
    24306     24584  complement  fr 2
    24559     24906  complement  fr 3
    24897     25154  complement  fr 2
    25069     25680  complement  fr 3
    25671     26456  complement  fr 2
    26326     26604  complement  fr 3
    26602     28383  complement  fr 3
    28362     32828  complement  fr 2
    32831     33280  complement  fr 1
    33267     33731  complement  fr 2
    33703     35853  complement  fr 3
    35819     36163  complement  fr 1
    36123     36884  complement  fr 2
    36874     37461  complement  fr 3
    37564     38520  complement  fr 3
    38518     39348  complement  fr 3
    39288     39533  complement  fr 2
    39385     39624  direct      fr 1
    39527     39898  complement  fr 1

POSSIBLE SEQUENCE FRAMESHIFTS DETECTED
 From   To
 Frame  Frame  At base...
 -----  -----  ----------
   1      3        24060 +/- 11 bp  (complement)
   3      2        26400 +/- 11 bp  (complement)


Protein translations of predicted genes

>Translation: 441..851 (direct), 137 amino acids
MHCDIAVLDDVVVFENAYTNEGRNKVKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLM
GMEEDIYSKEGELVGKENIYEVMEKAVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKK
RGQYLDRVQFRAQYYQ*

>Translation: 820..1521 (direct), 234 amino acids
VFSLEHSITNDPTDPDSQPIAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVS
KRADYTAIVVIGVDSENNVYVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQS
AIVSELKDNYIKPNGLALKIDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVL
EEELVSYNPAHDDCKDCLAAAVEVAVKPSSAVKRKRSQDNNVVFHPKFGGVAF*

>Translation: 1581..2429 (direct), 283 amino acids
MAGTTIDIESMIDPHSLAVEIANRWTSWNNARSEKVKEWKELRNYIYATDTRTTSNNKLP
WSNSTTTPKLTQIADNLHANYFAALFPQKRWFRFEATDADSDTKIKRSIIQAYMQNKLRQ
SDFVNTTSKLVNDYIQYGNCFATVDFERKVTKYEDGDRIVNYVGPKVVRISPFDICFNPL
AANFSDTPKIVRSVLTLGEIQRMVENDSSKGYMADIFNKMLGNRGSARGNEVDINKSEGF
VADGFASLTDYYESDYVEVLTFYGDIYDKVRVSSLTTVSLLL*

>Translation: 2531..3346 (direct), 272 amino acids
MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGDVEDFDFEPNARIYLGDEGDVGY
LVPDSTALNADFQIQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRIFQHKT
AHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLYFFTQITRDDIKANGKIVPMGA
RHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSEQMG
KTVFTVVMRWVVGNKFLFQNLTVTTTVVIHL*

>Translation: 3161..4507 (reverse), 449 amino acids
MVAYRVAWELTKNPTLRVLYISATSNLAQKQLSFIKNIFESDIHQKYWPEHLHKDESKRE
KWTTSEIALDHPDRKKEAIRDPSIFTGGLTTSLTGMHCDIAVLDDVVVFENAYTNEGRNK
VKSQYSLLSSIEGSEAQEWVVGTRYHPKDLYSDLMGMEEDIYSKEGELVGKENIYEVMEK
AVEDNGDGTGEFLWPRQLRKDGKFFGFDVQILAKKRGQYLDRVQFRAQYYNDPTDPDSQP
IAYEKFQYYDKKHLTRDGGQWFYKGQKLNVSAAVDFAYSVSKRADYTAIVVIGVDSENNV
YVLDIDRFKTDKISEYFRHILDLLNRWDFRKLRAECTAAQSAIVSELKDNYIKPNGLALK
IDEHRPNRHQGSKEERIAAILEPRYDNLQMYHYRGGNCQVLEEELVSYNPAHDDCKDCLA
HLFTDRYVLFEECWLTKFVCKKTGKLFA*

>Translation: 4688..5254 (reverse), 189 amino acids
MTLHLENISFVAVPKTATIAIEKAFSPYARGYTPHTHEPVNVVKRDGGQDCLGLIRNPHD
WIKSYYLYLKHSPYFYSASSAWGIGEKTFEEFVGRFCDGHRLWPEPMRLQSMYLLRNGVS
TDFIYRFENLLDAVAHLGAACGQMPKMSRHNVSPKCDVILSKKMTYLFEDAASEDFDLYE
SGADGVMT*

>Translation: 5251..5661 (reverse), 137 amino acids
MKYKSARYINSTKIDCEIEHPVHGWIPFTCDPSDAGSEFDVAALHAEMAANPETLPYVPP
TEAEILYEQTSEARYYRSQLLTEIVDPVATNPLRWDDLSQERKDEVSAYRRALLNITDQA
GFPSDIVWPEVPSFLA*

>Translation: 5663..6256 (reverse), 198 amino acids
MLEVTSDGSMVYQRYTTYNSVHTVYVRTYYSGTWYNWAKQWDNLNDGSGSGLDADLLDGQ
QGSYYYPASNPNGYTAYTNSDVDTHLNTGTATTDQVLSWNGTDYDWIDGGGAPPTDLHAV
GTYTVACTGVNVAQGGTTAGSNLRYRNIYALDRFTVLTTTSSTVTLTGTWRNVGGPVQYV
SGRGGGQRGVSLWVRIS*

>Translation: 6348..7970 (reverse), 541 amino acids
MAKKPTVTTLQSGFNSTEVLNENFENIRDAFDNTLSLDGSTPNAMQADLDLNGNNLIGAT
GLLINGTDYLADVEAAKAAALVAQAAAELAENNAETAEVNAEASETAAGLSATAAATSAT
NAGASETAASASATASATSATNSATSASQAATSASAAAVSEGNAATSETNAANSATSASG
SASTATTQASAASVSATNAATSASNAATSATNAAASQAAAATSETNAAASESTVTTSATN
AATSEANAATSASTATTQATNAATSASTASTSATNAATSETNAASSASSAASSATSAQAS
KDAALAALDSFDDRYLGQKVSDPTLDNDGNALVAGALYFNTTDGIMKVYDGSVWLAAYAS
LSGALIATNNLSDVLDVTASRTNLGLGTAATTASTDYATAAQGALAVSAVQPNDSPSFGS
VTVTGTVDGRNVAADGSKLDGIEAGATADQTASEILTSIKTVDGSGSGLDADLLDGIHAS
SFLQGNQTITLSGDVSGSGTTSILVTVADDSHNHIIANVDGLQTALDGKWSKGADIGGGC
*

>Translation: 7972..8463 (reverse), 164 amino acids
MAKKPSKMWFYETTLPDTRNEFTNYSLKKKDHKVGDKTYLSLHKIYIEMEDPTEYEFALS
VFGDYSVWENLCNLSWFKTHHVQMQKELMLKLKARTIRNMINDLEEGKASYNAQKYLADA
GYLDNGSKKRGRPSKDELDGALKQAALEKAETEDDATRIGLMN*

>Translation: 9334..9489 (direct), 52 amino acids
MHKYTVTVYDELEGGVDVLTVEAKTIDSVIYIVESNPKHEYEIIEIERLDQ*

>Translation: 9486..10076 (direct), 197 amino acids
MTNHVRNILKLYRQATQEDTINGVEWYARAERMAKAIASDAGLPLPTVIGVMAALSPNNR
WERNCRDAATMCKAWQNGDSMDSFKVSCYNTMKAKAWAILDLGLTDDEDILSHLNGQKIR
SFYSNIRGLDEVTIDGHALNIARGKREGLTSDKTNMGKREYRELQAAYVRAAKRVKVKPH
VLQAITWTTWKRVHNI*

>Translation: 10100..10528 (direct), 143 amino acids
MTNTQHKNILKHLKTAKGLTVREALIEYSISSLTKRVHELRGLGYDIESVRKKHPCNRSK
IYTLLSTRGARAMSMCGEIENTENAIKSLRAKEFNIQFNSIGLTDKERLTIVNTIREELL
EETVRLNKLLELANKAEGSHRA*

>Translation: 10831..10980 (direct), 50 amino acids
MLPVTLEEHLITMGLIPLSSFEELDTVVNPTGQEEDLPAYNEETQEYMF*

>Translation: 10980..11648 (direct), 223 amino acids
MTQIEATYIDHMGSDLSVVNAARVSFGKKSEWVYCGQSDGRDKGLSGRDTKLIKYLAKHK
HISPFGHAFASFHVKAPIFVARQLVKHKFLRWNEISRRYVDDEPEFYTPDVWRGRSADKK
QGSDGVVNPEYNPQYLDNKIKFAYLQALDIGISPEQARMLLPQSTMTEWYWSGSLDAFAD
MCRLRCKEDTQYESRVVADQISEKMADLYPVSWAALMEGEKQ*

>Translation: 11645..11959 (direct), 105 amino acids
MTNRKFTPPTEFPAEYVDGFGCKVTILGRSYYNKERPLVGFDDEGCACNYAENGAYWPDD
GGKYDLHDIQKRITTWHNVYEGWVGASNKVNRGVTENRLCVIPH*

>Translation: 12008..12241 (direct), 78 amino acids
MTDGNTYAINKHLDEREDYDALQEVTAELERAEARIEELEKELAGARADWFQMVFRAIDL
KKEVSTTLAELKGKTDE*

>Translation: 12234..12602 (direct), 123 amino acids
MSEAPERIWAWWDDAYDVGLLNKHGDKRYTPNDAKEYVRADRIEELETQLSEARQVGKEW
FEEQKKARKDANAAEAKLAKAIDFVAGVAGGFKFGESLVDWANLHVLKARTLLAELKGET
DE*

>Translation: 12595..12843 (direct), 83 amino acids
MSDYKQTSYAAGIITSRYSDMLKRLGDDYEDEMVVVSLMKYYNLCLDNDDPTILEAIERV
LEDYMCTADYSAWLLGRNKCLP*

>Translation: 12831..13067 (direct), 79 amino acids
MFTVEFEPECSVITCLDNHDAFDDVEVVVCEDNSVFIVQHMLDVDDTNIIYMSYQQLTEI
AAALDMPEGAYILGNKGD*

>Translation: 13069..13257 (direct), 63 amino acids
MIYGLYVFAIMYSLGAILLLAITEAAEEDNPNADLNLALLWPYIAVRVILERIVNGPYKD
DE*

>Translation: 13387..14940 (direct), 518 amino acids
MDYEHKKHQPCQSCGSSDGAYPYEDGIYCHVCKTKTFNDDGDTQVTEQKTLPPIRGKIMA
VPSRGLNKATAEKYRALTSDDKISLLYTTEGKVTSFKERGLSEKTFKFNGPAQTDLFGQS
AFSKGGKSVTITEGEFDAMAAYQMLYMSEPCVSVINGSSGAVKDCKRNYEWLDSFEKINI
AFDNDKAGQDAAMAVAELFDPRKVRLVTMTLNDPNDYIKQGRERDFIDAHKKAAPFTPDG
IIAGSSLYDLVSTPPEYDCVPYPFSGLNRMTKGLRTGELITFVAGTGVGKTQVMREILYS
LIQQDKGNVGTLFLEEPVRDTGLGMMSIQAEKPLHLTDTIYTKEEFDNAYENTLGTGRVF
LYDSFGSNSVERIVSMVRYLARSCDCKFIILDHISIVVSDHAKDERKALDEIATKLKTLT
VELDICLLMVSHLNRDKNRKPPEEGGTINLQDIRGTAGIGQLSNIIIALERNTQADDELE
RNTTRVRVIKNRFTGETGVADSLSYSRHTGRLKSYEG*

>Translation: 14945..15166 (direct), 74 amino acids
MEDRYYQATGYVALSSVGGVWDEVGEPCETLEDCQEVLQEYKDGIDEYGEEPVYFDIPYE
KYRIEYRTVKVVG*

>Translation: 15171..15518 (direct), 116 amino acids
MDLWDEYGMLKREFNTSDLSDGNYCLRIEGETGNLYVFDKRDNSVVHVWEGFFWNWLWIQ
NDSDDFHKMQHFSYLAGYTHAMLDMLYEVEDLDEEQKLDIRDLIKEKSLTYGSSF*

>Translation: 15502..17244 (direct), 581 amino acids
MEVVFDIETDALDATVIHVLVAKRVGQKGFYVVRDAETFKRLAKQVTLWIGHNVIGFDIP
QIKKLWGYGIPLKDVADTLVMSRLLDPTRKGGHSLDALSGNEKIDFHDFSTYTPEMLAYC
KQDVAINEKVYLQLKEELSNFGKASIQLEHQMQAIVCEQEKNGFMLDTDIAEEIYTTCLR
ETNRIEAEIKEFMVPIAVPVKEVIIKRKKDGSIYSNQLLEGCNVQGDYTKIAWEEFNLGS
PAQVNKRLDRLGWKPTVKTKSGNSYKICPENLATIPDTAPEAVKGLKAWKVLETRWKLAQ
EWLQKSQETGRVHGRVILTGAVTHRAAHQGPNMANIPSVPHGKDGILWKMEGMYGAECRQ
AFKVPEGKLLVGTDAAGIQLRVLAHYMNDPIYTEQVIDGDIHTFNKEALGRYCKDRPTAK
TFIYAFLLGAGTGMIASILGCNNRQANEAMANFYEAIPSLKKLKSQASQAASMGWMKGLD
GRVLRIGSDHLALSVYLQGGETVIMRLANVFWQRQAKKEGINFKQCAWVHDEWQTEVDED
QAQRLGEIQVQAIKDAGTFFKLNCPMDGEAKIGKNWLETH*

>Translation: 17261..17839 (direct), 193 amino acids
MYNNQTDQRHKEINMADKKIVLKNVEVSWAKLLEAGLKYKSETEYEFSVAVKANDQLRDL
MKSFKLNKQFKSKDSTFDGEEFIQLTLDTRTKSGWVRHGEVFDEFGDPTKDLIGNGSKMN
LFVSIGQSSYGNIIKLGHLEDMDMESKEMMFHFGQVMELVPFEAPSAVIKKEKEINESVE
AMAEDEMEIPFG*

>Translation: 17832..18272 (direct), 147 amino acids
LVDYLVTKKGDELEGLSLPLHSIEEEGVVLLYDGDLFLYSHDDVEEDHYADQGPLPEDAE
ARKNIPVYSGFFKYFPDAIVAVSHLSLIGGIQHGQTRETLHWDRSKSTDHTDALLRHLLE
EDWAAVAWRALAQLQKSIEEERKYDD*

>Translation: 18265..19128 (direct), 288 amino acids
MTEETKSIDTLIDDIYSVFTDGYSKSPDNEKLIDAFGEAMKGLMRSRLTPRESSGGTLRL
SAIGKPARQLWYDSRGVEKPDFTGDQLLKFFYGDVIEEVLLTLAKLSGHSVTNEQQKVVV
AGITGHMDAVIDGHVIDVKSASATAFKKFDQGSLVFDDPFGYMHQIAAYSEAVEGNKGSG
FLAMNKVDGKLTLFQPDPDFLPDTQERVDYLKEALASDTPPERCYEEVTETNGNKKLPMG
CAFCSFKKECWKDANDGSGLRGFGYSFGTVYLTHVEKAPRVGEVDVE*

>Translation: 19118..19522 (direct), 135 amino acids
MLNSKSSTRKRALKAGYRSGLEEQTAKDLKKRKVLFTYEETKIKWLDSKVRTYTPDFVLP
NGVIIETKGRFVAADRRKHLEIQKQFGTLYDIRFVFTNSKAKLYKGAKSSYADWCNKHGF
LYADKTIPEDWLNE*

>Translation: 19515..19700 (direct), 62 amino acids
MNEDLVTRIVDRFSIAELAEVVGITPRMFIDAFEDEILDNLTALADIDQGFVIEDDDDEY
E*

>Translation: 19693..19998 (direct), 102 amino acids
MNDYQQQAVETAIYPSTAQVTYPAMGLANEAGEVLGKVKKIIRDGTFNRDDIADELGDVL
WYAAALARDLNTDLSAIAQRNLDKLASRKERGTLKGSGDKR*

>Translation: 19995..20177 (direct), 61 amino acids
MTWFWRYINYLATWREHRKVIKQLNKFSDRELRDIGITRGDIDRLVWLKEDKDASGRETK
*

>Translation: 20174..22189 (direct), 672 amino acids
MNNYQEVSTRAEVVTRRTYNRPLNDEGTVFETWEQTVDRVIDHQRWLWERQQRAELNEKQ
SAELTELRQLMLDRKATTSGRTLWLGGTDVAKKREASQFNCSFGNVETVHDVVDAQHLLL
QGCGVGFYPSVGILSGFTAPVEVQVVRSEKQKSDPKGAEDNVEAFYQEDGKSVWEVVVGD
SAEAWAKSFGKLVAMKKRVDIIRLNYREIRAAGMRLKGYGWISSGDSDLSVAQTAICGIL
NDRAGQLLRHMDILDLLNHMGTVLSSRRSAEIAVLPVTNPEIDAFISAKKDFWLHGNEHR
QQSNNSIMFDTKPTKWELSYIFDKIVEAGGSEPGFINAEAARKRAPWFKGVNPCAEILLG
NKSFCNLVEVDWGKFLGDHGGLERAVWLAARANYRQTCVNLDDGVLQRSWHELNEFLRLC
GVGGTGIVKFLDHYTGRNNVASMLQSLRAQAHKGANSMAEEFGTPMPKAVTTVKPSGTLS
KIMDTTEGVHKPLGKYIINNITFSKDDPIIASLEAAGYKVFDKPFEPSSKLVAMPVCYDD
VSFDEVETDRGTVHVNLESAVEQLDRYKLLMDNYVDHNCSVTISYDPSEVPAIIDWILTN
WDTYVGVSFIYRNDPTKTAEDLGYAYLPQEVITEEAYVAYTSQLKPLDLSGLTSDDDLVD
EACATGACPIR*

>Translation: 22298..22534 (direct), 79 amino acids
MQKYIMLTQDNCKYCTAALGLLESLGHKVEVYDVQDENMSLVVNMLRVTNTVPQLYAPDG
QFIGGYTELKEYLNDSST*

>Translation: 22518..22967 (direct), 150 amino acids
MTVVRKQFDRALYEAYDTPARDALVGYLELKGHIIVNNEENFNVDVVSQKGGYTYFNEVE
VKTAWKGDWPTHWSEIRIPQRKQRLLDKHVDDGKSVLNFYIFRPDFKQAWRIKDTLLTQE
SLKEAKGRYIQKGEKFFHIPFTQAELIKL*

>Translation: 23274..23672 (direct), 133 amino acids
MPWALPVLDVLEKHLGKGAVETGIKNGNIEMAPLALMRGRSFDNAFIIVDETQNISTHEL
KMLLTRVGEGTTIVLNGDAQQSDLKEADGLSKVIHLAKKHQLPVPIIEFGVDDIVRSDIT
AMWVRTFLKEGL*

>Translation: 23673..23861 (direct), 63 amino acids
MFTAIILMCTLDGGLCRAVAHPVVVNDLISCTLLLGEGVKKVEEQAGWKVASYRCLPWGE
PT*

>Translation: 23917..24057 (reverse), 47 amino acids
MDNKATMGVLFAALLALLGWNISTTHELTLQVQKLEIILLNDAFSK*

>Translation: 24062..24316 (reverse), 85 amino acids
VPSKRLDKDKMKCNKPRTTPDHPTKSHVVKACDNGKEKIIRFGQKGVKGSPDGTPRNKAF
KARHAKNIKKGKMSAAYWADKVKW*

>Translation: 24306..24566 (reverse), 87 amino acids
VADNQWHLSKSVPVTFILAIVMQTIALVWFVASLDSEIESNTRELVRHETRLIALEASVQ
AQAVAMARIDENINAIKGMMERDRAK*

>Translation: 24559..24897 (reverse), 113 amino acids
MVRFIVCLSLVLFLGGCLNPMSLLGGGGPNVAANVQAGAENNQTGVQVGDITKAETVYSG
VAPSGSVGSLNISNQDIPIWVILLLILGWVLPSPQEIWRGFLKTITLGKYRG*

>Translation: 24897..25088 (reverse), 64 amino acids
MRVQKRKKFVEARKGGNAEDALSVLFSAGPVEVASILAYAKQEELKEKGKPYETEMVINL
GKL*

>Translation: 25069..25671 (reverse), 201 amino acids
MASSWNLAAAQTSSAVEDVASSGTSLWQSVVDGAAEALASGKEAIIDVAEDVYDYLPEQE
DVMDAAATGVEAAGSVLPAIMDTAQAAGEGLMGSGLSEFRFLGSNFFDAGGKFTEKDLNV
SDISALGRAVKKAKAEGRSNVDYNDFGTAEGEVLKGGILAGVFDPDLRMARTVGGFKFEE
DAEGNTILRNTYNFNEGPKT*

>Translation: 25671..26453 (reverse), 261 amino acids
VVKKIHLLSKGPQMKKVSISITVYLVVLILLIPQMEKLTESLNMTSAWNKGQTLSKDMPD
YKPELLNLSSYQAKASPSTGGATYQAGAGTTPTSATSGITALQPTSTTQTTLRKVEAQSY
DTLYGNFERQDTPFKDVSVSSMTIGELADFSRASGEYGRYVKPRLPKNTYAYKKGLTSTP
MGKYQIVGSTLRDLTRRMNLPADTVFNKETQDKMFLFLAKEAVGGGKTSSEKRSRLRGIW
EGFKHVDNNTLNKVIAEIEQ*

>Translation: 26602..28359 (reverse), 586 amino acids
MAFTLDQNVSGAASAPQPVQSMSGTSIAASALGGLLGAAERSAKLRAGSGPTQTDRNNAV
FVENLRNVNQDLASGMSPDQVASKYGATFANLSLNSEQKAVLTKTIGEDIFYVPKQVETP
VDTATEMFNNTDETVRLGLIDIETKKAEAEGETISQDVAAERARNNYAAFQVAANAGILQ
GNIDFSTGFDQNMKTLDSFIKAVSAGLQVEQRGENLSLETLRQLQDGFLLLKSNPSFQKP
TGKAALEQWALMETRLQSIEDVFTRLEDYDAKGATAKAKQLMGVISLSGKSPLSALAAKD
SDFMIKMAATIADDITADLANMGEIPEVDHSALNFDPVILELMGVAPSGQQTGDVPEGGF
DIPPSPFPTELGEKYEGMKIVEKAKFRAYHRSAITGLEAGSLDSPEAINAYASSVTALSY
DLTQNDKPSSQNFDVLFSNRNIARVNALQAQGGEAGRVAGNLRAQMGAALQHNQARYGVL
AAGLVSRIPQVKIDETNGKFTLDVKDDPVLQEIAAVVSVYYGGDFEALWAEGGTAKISLK
NRLAREGKITPDSPEFERFDAATKVLESSLWKGLASRYNTVRGIP*

>Translation: 28362..32816 (reverse), 1485 amino acids
MDSIASFEELEGGLEEETTQSKSKPIDPNSPLEIETVEAQAYLLDEDPQVISETRANQDF
THEGLAKQYPDLTRYLDTLYEAGVSVEEAGRLAREHVERKAIAVAPQEFIFSSMLMVDDE
TVNPESLRVLTNYEKISSRISKRLEENDPSTFKWLTAGALNTARDFTVGVLEMAIRRDSS
LSQKYADSLFMEEDEFNDFWDKEIADAEAKGLFNIREYESLKELQALVDNFGTDTDAGFN
QLLALADIATLGGTKTVGRLASAGVKKAATGTRSVVSDLLASKTVSEAVTATKGIAEGGK
ATVKQLNSARPSPAVAYKAGPSTMDPKPSVNTPSAAVVTEGTKRSMLFEKMAEMLSSPFA
GKTFTTKSLAEATTEVADRLVAQSTNAFVKVSRRRAEGSDNYIYTALLGKADTGKPFTTK
AAAMKAVKDDPRYKAVRLNSDRTKAFGVDEDKRGWYLEYSERVDTSRLATEIEDVNVEEG
FLKRSAASLFSAGQTALGPRLGFMLNAAEGLVARVSKEADVAFKDISKLSKGESEELNKI
ITSYRDSPLGDTELDLAAQRGAPSSAKFEQDFMAVNGRVPSEQQMKAYRALVDFNNSSWN
VKATEILKKVTERGGWTVDVSEGYTTIGVPVTVADDAVVFSRLQGSVRGSAVGDRIVYKL
DEPFEDANGNFFEYVTDVADARVPQKSDVLGYNFGGSRNNETTNFFVGTLFDVEFAGGKK
GKGGFRSLLGSYSSKEATSAARELNNIQETIGGFLKATGLKSISKLELSGEDLDRVNDII
SRNNKWNPSVENFDDLKDLSQRHKESFANKFEVKRRDQQVDTEITDGYGLSVGDYQSRRV
ARKRGDAPLMEYGGGRVSNQDPITNILEQFQSTAYRYTHSKATQAAVNGWVDKARRMGNV
TFDGPVPSDPNDFLRRAKIKGSEGVDADMAEQQAVIKRRMGLEERTDKENNFFIRMGQHI
YDEGVFMGKGKGIKTNPEDWLDGAAGRVRAMVFHMKMGIFNPDQLVLNASHVAQIMAISP
KAGLKATAAVPIIAQLMRKTSKAAAKDIDALYANGFAGMTKEELLATVKYMRESGRDIVG
TSVLERSGTAFDPKKNAASEFLELGLTPYKMGELFGRIASVATAVVEHNAKKISDDVFSE
AGLQYVANREQVLSFRMTSGQKGAYQEGAIMGLATQWMSYTNRFVDNILIGRDLTKAERA
RMVGVNTVLFGTRGMGFSPKMTAALVAFGVDPEDENSTATLNAVKFGLFDFMLSQLVGED
VSLGSRIAPMGGIVQQYSELFREDPLYATLGGPSAQIGFDTYKAIKATLETITGGHKQIA
VEEFKVLMRNVKSVDIYAKVVELIETGEYRSKRRGLAGEFDEVGTGLAASVLGGATPMKV
LNHYDAKDISYKEDAKFKDARRRIDTWATKALDLISTGEPDKMKQGRELYNDAMNLIEDG
GFSEENQTKLYRAVVNLETMTDLVKRATGQSKASQLTAKASQGE*

>Translation: 32831..33265 (reverse), 145 amino acids
MVAVTGTVLAIAGTTAGIVGTVKAGKAARRSARAQQQAQEVQAKRQRRAAIRSNILASAR
AKASAQAAGTSQSSGLSGAIGAGRSQLGAELGFGSQLSGLSANISKFDMQAQTYGDIAKL
GFSAAGNAPTIGGYAEDLYGFFKK*

>Translation: 33267..33725 (reverse), 153 amino acids
VLKTTATKIRTATPDDVFDILILAKEFSKEAPQSHKWNKEKTEQFILSALQNTNMTIFVI
DVDGEIEGALVGLLSELYMSYTVQATELAWFVSKDYRGKPASLKLIKAFEKWAKESGAKQ
IGMGDIEGISSLEKLYNRLGYERAETVYLKEL*

>Translation: 33703..35850 (reverse), 716 amino acids
MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLS
DVVVPEGALVQTLDWYNVAGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHS
ASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGSD
VDVTSLYFGEGTSVSNQRIYDTYNVGWVGPKGSAALNTYGSYIVYPALTHPWYSGKDANG
AFNKADWLEIYTGSSLASNGHYVLDVFNKARTGLTTEVETGRFRSVAAYAGRVFYAGIDS
AKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAHNIRKLHVLGA
SLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTGIYAVQ
QSENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYN
NILVMDLALQAFYPWRVEDEASSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVV
ATLYRDYLEGDSEIKLLVRDGTTGKMTFATFRGDTYLDWGSADYKSFAEAGYDFMGDITT
FKNAPYVTTYMRVTEDGYVASGAGYEFINPSSCLMSVSWNLSKSGSTPREIYKLKDVPVV
NPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVIGAKNNSY*

>Translation: 35819..36136 (reverse), 106 amino acids
MAEIEVSKSEDGQTYVVKSDKTEKALVVYKPQDGFKFYAVKYENGAPLPAGLQGSWTGAA
GALAAVKLHLAHKKPTARKAVNDRAKERQAEKEKLNATKPDPENG*

>Translation: 36123..36872 (reverse), 250 amino acids
MKMTLLEMVQNILSDMDSEEINSLVDSNEAEQIAKVVENTYFAMIATRFIPEHSQTLKLT
SFSSSARPTHFSFPTRVKNIEFLDYNVSKAVGGVEYRRLKYLSPDEFFGLSDGRDSLASN
VKQVADVGSDSILLIRNDAMPMYYTSFDDDTVVLDSYDASVDAILTSAKTRAYGVKYPTF
DSFSDTFVPDIDDTMFPFLLAEAKSTAMSLFKSGADPKIEQTARRQKVYVQNDMHKVNTG
RAKNNYGRN*

>Translation: 36874..37446 (reverse), 191 amino acids
MANVEHSSLTGSALHEPKGTATANSGEVYVANGSGSGTWQPIHRHLGAATAFSSTSPYAY
TIDTDTVEKFLSFPVSSSHVEGFTVLTSPNLRFRYDDPTEITSLINVTMSSSQAGGVGHA
VQWALFKNGVEIVGSRAIRTISSGSWGSISVTGVTTLNQNDYIEIKTKADADNVDVNYAN
IYVSIIGMSA*

>Translation: 37564..38505 (reverse), 314 amino acids
MAGNTVATLALAKRAEVWSAELKEILRDELQGMKYVKWLDQFPDGDTFKIPSLGDATINN
YSEDTAVTYDPIDDAQFTFSITEYLQAGNYITNKAMQDVYYANEIMSQFVPIQERALMER
LESDIMALGGQQTVDNANTINGVDHRMLGSGTGGKIGVADFAKANLALKTAKVPQKNLVA
IVDPSVEFELNTLSQLTNVSNNPRWEGVVRDGIATGMSFIANIYGFDVYTSNYLKTEAAE
TIGGTTVNNAITNMFFSADQAVLPFVGAWRQMPEVDTEYNKDFQRTEFVTTARYGLKLYR
PENLATVMTAPLA*

>Translation: 38518..39285 (reverse), 256 amino acids
MSVFSEEQVTPATQSEQVSSFEEPTSPSVVNELVGEGRKFNDVEALPKGKLEADRFIEQM
KQENAALKADLEKQAYRLGVTEHLKETASASTAELSDPNNNIGGTADVANTKPSSSEADI
ESLVEQTLRKREQESVAKSNIALVESEREKAYGTEAAATVQQKASELGLPMAELQSMAAK
SPAAFMQLMGKPAPRSNPLVQGSIRTEGSTMQASSERDFGYYQRLRKENSSLYYKPSTQR
AMMADADRLGDNFYR*

>Translation: 39288..39530 (reverse), 81 amino acids
MKAAWFKDCKSKKEKEAVAQTLQSNRDSLDRLKEILEPMLKETTPAADYDSPSWAYKQAD
RNGFNRAVTTVLDLINLDKE*

>Translation: 39527..39826 (reverse), 100 amino acids
MGARHFAERAQRVQNLTTMYQIKASDPTVAAHLSGKEFARLLADELGEPALFKENVSVSE
QMETQKVATEAQIEFEAEQEEMVDQGSQELQPAPEEPLE*


Input Sequence
Title (optional):


Sequence:


Sequence File upload:


Use alternate genetic code:
      Mycoplasma (TGA = Trp)

Output Options
Email Address: (required for graphical output or sequences longer than 1000000 bp)


Generate PostScript graphics
Print GeneMark 2.4 predictions in addition to GeneMark.hmm predictions
Translate predicted genes into protein


Run 

Web pages maintained by GeneMark administrator, genemark-admin@amber.biology.gatech.edu. Please send any suggestions for improvements or problems to the web page maintainer.