A support vector machine (SVM) is a supervised learning algorithm that can be used for binary classification or regression. Support vector machines are popular in applications such as natural language processing, speech and image recognition, and computer vision.
A support vector machine constructs an optimal hyperplane as a decision surface such that the margin of separation between the two classes in the data is maximized. Support vectors refer to a small subset of the training observations that are used as support for the optimal location of the decision surface.
Support vector machines fall under a class of machine learning algorithms called kernel methods and are also referred to as kernel machines.
Training for a support vector machine has two phases:
Only the support vectors chosen from the training data are required to construct the decision surface. Once trained, the rest of the training data are irrelevant.
Popular kernels used with SVMs include:
|Type of SVM||Mercer Kernel||Description|
|Gaussian or Radial Basis Function (RBF)||One class learning. σ is the width of the kernel|
|Linear||K(x1,x2) = x1Tx2||Two class learning.|
|Polynomial||K(x1,x2) = (x1Tx2 + 1)p||p is the order of the polynomial|
|Sigmoid||K(x1,x2) = tanh (β0 x1Tx2 + β1)||It is a mercer kernel for certain β0 and β1 values only|