I'm wondering how important the coef0
parameter is for SVCs under the polynomial and sigmoid kernels. As I understand it, it is the intercept term, just a constant as in linear regression to offset the function from zero. However to my knowledge, the SVM (scikit uses libsvm) should find this value.
What's a good general range to test over (is there one?). For example, generally with C
, a safe choice is 10^-5 ... 10^5, going up in exponential steps.
But for coef0
, the value seems highly data dependent and I'm not sure how to automate choosing good ranges for each grid search on each dataset. Any pointers?