Skip to main content

Advertisement

Table 9 Empirical occurrence rate of the identified sequon in PTMs of proteins

From: Ridge regression estimated linear probability model predictions of O-glycosylation in proteins with structural and sequence data

Sequona (viewed from S/T/Y/H as “center”) % of Sequences in the collected data that are:
O-GlcNAc glycosylated O-GalNAc glycosylated Phosphorylated
N – P – S/T 0.24% 0.43% 0.145%
N – ~P – S/T 1.22% (0.5%, 2.8%) b 0.77% (0.5%, 1.2%) b 2.858% (2.8%, 2.9%) b
~N – X – S/T 98.54% (96.9%, 99.3%) b 98.80% (98.2%, 99.2%) b 80.453%
~N – X – Y Not applicable Not applicable 15.748%
~N – X – H Not applicable Not applicable 0.006%
~N – X – S/T/Y/H Not applicable Not applicable 96.2% (96.1%, 96.3%) b
W – S/T – W 0% 0% 0.0066% (0.004%, 0.011%) b
~W – S/T – ~W 100% 98.94% (98.4%, 99.3%) b 82.3133%
~W – S/T – W 0% 0.58% (0.33%, 1.0%) b 0.6422%
W – S/T – ~W 0% 0.48% (0.26%, 0.88%) b 0.4930%
W – Y/H – W Not applicable Not applicable 0.0004%
~W – Y/H – ~W Not applicable Not applicable 16.2587%
W – Y/H – ~W Not applicable Not applicable 0.1365%
~W – Y/H – W Not applicable Not applicable 0.1462%
Sequence count 411 2,079 227,810
  1. aX denotes any amino acid. b 95% confidence interval [44]