COMPOSITIONAL ANALYSIS

The composition of the input sequence is evaluated relative to the residue usage quantile  table  specified with the `-s species' flag. Low usage in the 1% quantile is indicated by the label -- (e.g., Y-- means  that  the input  sequence uses tyrosine as little as the 1% least tyrosine contain- ing proteins in the reference set); low usage in the 5% quantile is indi- cated by  the  label  `-'  (e.g., L-); high usage above the 95% quantile point is indicated by the label `+' (e.g., A+); and high usage above  the 99% quantile  point  is indicated by the label `++' (e.g., LIVFM++). The usage is evaluated for all 20 amino acids, positive (KR) and negative (ED) charge, total  charge  (KRED),  net  charge  (KR-ED),  major hydrophobics (LVIFM), and the groupings ST, AGP (encoded by CCN, GCN, and GGN codons), and FIKMNY (encoded by AAN, AUN, UAN, and UUN codons).

A :164(12.9%); C  :  0( 0.0%); D  : 56( 4.4%); E  : 66( 5.2%); F  : 49( 3.9%)

G :100( 7.9%); H--:  2( 0.2%); I  : 50( 3.9%); K  : 77( 6.1%); L- : 66( 5.2%)

M- : 4( 0.3%); N+ :104( 8.2%); P  : 36( 2.8%); Q  : 45( 3.5%); R- : 14( 1.1%)

S : 71( 5.6%); T++:176(13.9%); V++:147(11.6%); W  :  6( 0.5%); Y  : 35( 2.8%)

KR     :   91 (  7.2%);   ED      :  122 (  9.6%);   AGP     :  300 ( 23.7%);

KRED   :  213 ( 16.8%);   KR-ED   :  -31 ( -2.4%);   FIKMNY  :  319 ( 25.2%);

LVIFM  :  316 ( 24.9%);   ST    + :  247 ( 19.5%).