Home UH Manoa

Publication Detail

Probabilistic prediction of protein phosphorylation sites using kernel machines.

Menor M, Poisson G, Baek K.


Menor M, Poisson G, Baek K. (2012) Probabilistic prediction of protein phosphorylation sites using kernel machines. Proceedings of the 2012 ACM Symposium on Applied Computing BIO track 1393-1398


Phosphorylation is an important post-translational modification of proteins that is essential to the regulation of many cellular process. The in vivo and in vitro discovery of phos-phorylation sites is an expensive, time-consuming and laborious task. In this preliminary study, we assess the viability of using our proposed probabilistic Classification Relevance Units Machine (CRUM) for in silico phosphorylation site prediction. We conduct a comparison with the popular Support Vector Machine (SVM) and the Relevance Vector Machine (RVM) that, unlike the SVM, has not been applied to phosphorylation site prediction. The resulting CRUM and RVM predictors offer comparable predictive performance to the SVM. The main advantages of CRUM and RVM over the SVM are:

  1. An estimation of the posterior probability of the site being phosphorylatable, providing biologists an important measurement of the uncertainty of the prediction.

  2. A more parsimonious model, leading to a reduction in prediction run-time that is important in predictions on large-scale data.

Furthermore, the CRUM training algorithm has lower runtime and memory complexity and has a simpler parameter selection scheme than the RVM learning algorithm. Therefore we conclude that the CRUM is the most viable kernel machine for probabilistic prediction of protein phosphorylation sites.

Link: http://dl.acm.org/citation.cfm?id=2231997