-----Messaggio originale-----

Da: GENSCAN at GeniusNet [mailto:genome@dkfz-heidelberg.de]

Inviato: martedì 18 marzo 2003 12:49

A: iacono@mailserver.unimi.it

Oggetto: GenScan Output Sequence

GENSCAN 1.0 Date run: 18-Mar-103    Time: 12:48:51

Sequence Noname : 8460 bp : 51.00% C+G : Isochore 3 (51.00 - 57.00 C+G%)

Parameter matrix: HumanIso.smat

Predicted genes/exons:

Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..

----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------

 1.01 Init -    766    536  231  1  0   90  107   430 0.854  43.35

 1.00 Prom -   2059   2020   40                              -5.91

 2.00 Prom +   2474   2513   40                              -3.81

 2.01 Init +   2677   2749   73  0  1   38  115     1 0.029  -0.72

 2.02 Intr +   6408   6575  168  1  0   46   15   181 0.518   7.03

 2.03 Intr +   7204   7335  132  2  0   98  101   108 0.977  14.22

 2.04 Term +   8209   8345  137  2  2   47   54   181 0.851   9.09

 2.05 PlyA +   8450   8455    6                               1.05

Predicted peptide sequence(s):

>Noname|GENSCAN_predicted_peptide_1|77_aa

MPGVKLTTQAYCKMVLHGAKYPHCAVNGLLVAEKQKPRKEHLPLGGPGAHHTLFVDCIPL

FHGTLALAPMLEVALTL

>Noname|GENSCAN_predicted_peptide_2|169_aa

MLATRVFSLVGKRAISTSVCVRAHESVVKSEDFSLPAYMDRRDHPLPEVAHVKHLSASQK

ALKEKEKASWSSLSMDEKVELYRIKFKESFAEMNRGSNEWKTVVGGAMFFIGFTALVIMW

QKHYVYGPLPQSFDKEWVAKQTKRMLDMKVNPIQGLASKWDYEKNEWKK

Explanation

Gn.Ex : gene number, exon number (for reference)

Type  : Init = Initial exon

        Intr = Internal exon

        Term = Terminal exon

        Sngl = Single-exon gene

        Prom = Promoter

        PlyA = poly-A signal

S     : DNA strand (+ = input strand; - = opposite strand)

Begin : beginning of exon or signal (numbered on input strand)

End   : end point of exon or signal (numbered on input strand)

Len   : length of exon or signal (bp)

Fr    : reading frame (a codon ending at x is in frame f = x mod 3)

Ph    : net phase of exon (length mod 3)

I/Ac  : initiation signal or acceptor splice site score (x 10) Do/T  :

donor splice site or termination signal score (x 10) CodRg : coding

region score (x 10)

P     : probability of exon (sum over all parses containing exon)

Tscr  : exon score (depends on length, I/Ac, Do/T and CodRg scores)

CommentsThe SCORE of a predicted feature (e.g., exon or splice site) is

a log-odds measure of the quality of the feature based on local sequence

properties. Thus, for example, a predicted donor splice site with score

> 100 is excellent; 50-100 is acceptable; 0-50 is weak; and below 0 is

poor (probably not a real donor site).

The PROBABILITY of a predicted exon is the estimated probability under

GENSCAN's model of genomic sequence structure that the exon is correct.

This probability depends in general on global as well as local sequence

properties.  This information can be used to assess the reliability of

the predicted exon, e.g., it would be better to design PCR primers based

on a predicted exon with probability > 0.95 than one with lower

probability.