8 Experimental setup

As stated in a comprehensive review [37], valid dataset construction for training and testing, and unbiased measurements for evaluating the performance of predictors are two indispensable steps in establishing a statistical protein predictor. This chapter will focus on these two important parts for an experimental setup. Datasets construction and performance metrics for predicting single- and multi-label proteins will be presented.

8.1 Prediction of single-label proteins

This section will focus on constructing datasets and will introduce performance metrics for single-location proteins which are used in GOASVM and InterProGOSVM.

8.1.1 Datasets construction

8.1.1.1 Datasets for GOASVM

Two benchmark datasets (EU16 [46] and ...

Get Machine Learning for Protein Subcellular Localization Prediction now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.