python 双支持向量机

双支持向量机

原文:https://www.geeksforgeeks.org/dual-support-vector-machine/

支持向量机的拉格朗日乘子方程。该方程可由下式给出:

$\underset{\vec{w},b}{min} \underset{\vec{a}\geq 0}{max} \frac{1}{2}\left | w \right |^{2} - \sum_{j}a_j\left [ \left ( \vec{w} \cdot \vec{x}_{j} \right )y_j - 1 \right ]$

现在，根据对偶原理，上述优化问题既可以看作是原始的(最小化超过 w 和 b) 也可以看作是对偶的(最大化超过 a )。

$\underset{\vec{a}\geq 0}{max}\underset{\vec{w},b}{min} \frac{1}{2}\left | w \right |^{2} - \sum_{j}a_j\left [ \left ( \vec{w} \cdot \vec{x}_{j} \right )y_j - 1 \right ]$

凸优化的斯莱特条件保证了这两个问题是等价的。

为了获得最小 wrt w 和 b，这些变量的一阶偏导数 wrt 必须为 0:

$\frac{\partial L}{\partial w} = w -\sum_j a_j y_j x_j =0 \ w = \sum_j a_j y_j x_j \ \ \ Wrt \, b \ \ \frac{\partial L}{\partial b} = -\sum_j a_j y_j =0 \ \ \sum_j a_j y_j =0$

现在，把上面的方程放在拉格朗日乘数方程中，并简化它。

$L = \frac{1}{2}\left ( \sum_i \alpha_i y_i x_i \right )\cdot\left ( \sum_j \alpha_j y_j x_j \right ) - \left ( \sum_i \alpha_i y_i x_i \right )\cdot\left ( \sum_j \alpha_j y_j x_j \right ) - \sum_i\left ( \alpha_i y_i b \right ) + \sum_i\left ( \alpha_{i} \right ) \$

在上式中，术语

$\sum_i\left ( \alpha_i y_i b \right ) = 0$

because, b is just a constant and the rest is from the above equation”

$L = \sum \alpha_i - \frac{1}{2}\sum_i \sum_j \alpha_{i} \alpha_{j} y_i y_j \left ( x_i \cdot x_j \right ) \alpha_j \geq 0 \forall j$

为了找到 b，我们也可以使用上面的等式和约束

$\alpha_j > 0 \,for \,some\, j$

$y_j\left ( \vec{w}\cdot\vec{x} + b \right ) = 1 \ \ y_jy_j\left ( \vec{w}\cdot\vec{x} + b \right ) = y_j \ \ y_j \in \left { -1,1 \right } \ \ \left ( \vec{w}\cdot\vec{x} + b \right ) = y_j \ \ b = y_k - w \cdot x_k \forall k where \, \alpha_k > 0$

现在，决策规则可以由下式给出:

$y_i = sign(\sum \alpha_{i} y_i \left ( \vec{x}_i \cdot \vec{x} \right ) +b )$

注意，从上面的规则我们可以观察到，拉格朗日乘数只是依赖于 x _i 与未知变量 x 的点积，这个点积定义为核函数，用 K 表示

$L = \sum \alpha_i - \frac{1}{2}\sum_i \sum_j \alpha_{i} \alpha_{j} y_i y_j K(x_i,x_j) \ \ where K = (x_i.x_j)$

现在，对于线性不可分的情况，对偶方程变成:

$\underset{\alpha}{max} \sum_i \alpha_i - \sum_{i,j} \alpha_i \alpha_j y_i y_j x_i \cdot x_j \ \ for, \ \ \sum_i \alpha_i y_i =0 \ \ 0 \leq \alpha_i \leq C$

这里，我们添加了一个常数 C，之所以需要它是因为以下原因:

它阻止

$\alpha$
的值来自

$\alpha \to \infty$
。 * 它还防止模型过度拟合，这意味着一些错误分类是可以接受的。

图像描绘变换

我们将变换应用到另一个空间，如下所示。注意，我们不需要专门计算变换函数，我们只需要找到那些得到核函数的点积，然而，这个变换函数可以很容易地建立起来。

$K = \phi(i) \cdot \phi(j)$

哪里，

$\phi()$

is the transformation function.

直觉告诉我们，很多时候，一个数据可以被一个更高维度的超平面分割开来。让我们更详细地看看这个:

假设我们有一个数据集，它只包含 1 个自变量和 1 个因变量。下图显示了数据:

现在，在上面的图中，很难分离清楚地分离不同类的数据点的 1D-超平面(点)。但是当通过使用某种转换转换到 2d 时，它提供了分离类的选项。

在上面的例子中，我们可以看到 SVM 线可以清楚地将数据集的两个类别分开。

有一些非常常用的著名内核:

n 次多项式

$K(u,v) = (u \cdot v)^{n}$

n 次以下的多项式

$K(u,v) = (u \cdot v + 1)^{n}$

高斯/径向基函数核

$K(\vec{u}, \vec{v}) = e^{-\frac{\left | \vec{u}-\vec{v} \right |_{2}^{2}}{2 \sigma^2}}$

履行

Python 3

# code
import numpy as np
import matplotlib.pyplot as plt
from sklearn import svm, datasets

# import some data
cancer = datasets.load_breast_cancer()
X = cancer.data[:,:2]
Y = cancer.target

X.shape, Y.shape

# perform svm with different kernel, here c is the regularizer
h = .02
C=100
lin_svc = svm.LinearSVC(C=C)
svc = svm.SVC(kernel='linear', C=C)
rbf_svc = svm.SVC(kernel='rbf', gamma=0.7, C=C)
poly_svc = svm.SVC(kernel='poly', degree=3, C=C)

# Fit the training dataset.
lin_svc.fit(X, Y)
svc.fit(X, Y)
rbf_svc.fit(X, Y)
poly_svc.fit(X, Y)

# plot the results
x_min, x_max = X[:, 0].min() - 1, X[:, 0].max() + 1
y_min, y_max = X[:, 1].min() - 1, X[:, 1].max() + 1
xx, yy = np.meshgrid(np.arange(x_min, x_max, h),np.arange(y_min, y_max, h))

titles = ['linear kernel',
          'LinearSVC (linear kernel)',
          'RBF kernel',
          'polynomial (degree 3) kernel']

plt.figure(figsize=(10,10))

for i, clf in enumerate((svc, lin_svc,rbf_svc, poly_svc )):
    # Plot the decision boundary using the above meshgrid we generated
    plt.subplot(2, 2, i + 1)
    Z = clf.predict(np.c_[xx.ravel(), yy.ravel()])

    # Put the result into a color plot
    Z = Z.reshape(xx.shape)
    plt.set_cmap(plt.cm.flag_r)
    plt.contourf(xx, yy, Z)

    # Plot also the training points
    plt.scatter(X[:, 0], X[:, 1], c=Y)

    plt.title(titles[i])

plt.show()

((569, 2), (569,))

SVM 使用不同的内核。

参考文献:

麻省理工开放式课程幻灯片 SVM

版权属于：月萌API www.moonapi.com，转载请注明出处

本文链接：https://moonapi.com/news/1427.html

python 查看更多书籍

《GeeksForGeeks Python 中文教程 2022-05-15》

分类

最近更新

python 双支持向量机

双支持向量机

履行

Python 3

参考文献:

留言

联系客服

数据知识

系统公告

开发文档

python查看更多书籍

《GeeksForGeeks Python 中文教程 2022-05-15》

python 双支持向量机

双支持向量机

履行

Python 3

参考文献:

留言

联系客服

python 查看更多书籍