COMPOSITIONAL ANALYSIS
The composition of the input sequence is evaluated relative to the residue usage quantile table specified with the `-s species' flag. Low usage in the 1% quantile is indicated by the label -- (e.g., Y-- means that the input sequence uses tyrosine as little as the 1% least tyrosine contain- ing proteins in the reference set); low usage in the 5% quantile is indi- cated by the label `-' (e.g., L-); high usage above the 95% quantile point is indicated by the label `+' (e.g., A+); and high usage above the 99% quantile point is indicated by the label `++' (e.g., LIVFM++). The usage is evaluated for all 20 amino acids, positive (KR) and negative (ED) charge, total charge (KRED), net charge (KR-ED), major hydrophobics (LVIFM), and the groupings ST, AGP (encoded by CCN, GCN, and GGN codons), and FIKMNY (encoded by AAN, AUN, UAN, and UUN codons).
A :164(12.9%); C : 0( 0.0%); D : 56( 4.4%); E : 66( 5.2%); F : 49( 3.9%)
G :100( 7.9%); H--: 2( 0.2%); I : 50( 3.9%); K : 77( 6.1%); L- : 66( 5.2%)
M- : 4( 0.3%); N+ :104( 8.2%); P : 36( 2.8%); Q : 45( 3.5%); R- : 14( 1.1%)
S : 71( 5.6%); T++:176(13.9%); V++:147(11.6%); W : 6( 0.5%); Y : 35( 2.8%)
KR : 91 ( 7.2%); ED : 122 ( 9.6%); AGP : 300 ( 23.7%);
KRED : 213 ( 16.8%); KR-ED : -31 ( -2.4%); FIKMNY : 319 ( 25.2%);
LVIFM : 316 ( 24.9%); ST + : 247 ( 19.5%).