python機器學習——感知器

本文轉載自查看原文 2019-11-11 23:52 424 機器學習

最近在看機器學習相關的書籍，順便把每天閱讀的部分寫出來和大家分享，共同學習探討一起進步！作為機器學習的第一篇博客，我准備從感知器開始，之后會慢慢更新其他內容。

在實現感知器算法前，我們需要先了解一下神經元（neuron）的工作原理，神經元有很多樹突和一個軸突，樹突（Dendrites）可以從其他神經元接收信息並將其帶到細胞體（Cell nucleus），軸突（Axon）可以從細胞體發送信息到其他神經元。樹突傳遞過來的信息在細胞體中進行計算處理后，如果結果超過某個閾值，軸突會傳遞信號到其他的神經元。人們通過認識神經元的工作過程，創造出了感知器學習算法。

感知器是Frank Rosenblatt在1975年就職於康奈爾實驗室時所發明的一種人工神經網絡，它被視為一種最簡單形式的前饋神經網絡，是一種二元線性分類器，不足在於不能處理線性不可分問題。

下圖為三種不同情況，左圖中的兩類可以使用一條直線（即線性函數）分開，即線性可分；中間和右邊由於不能使用線性函數分開，則為線性不可分。

我們直接來看一個實例，假設我們現在需要對花進行分類，數據集中有兩種花朵，分別將其記為1和-1，我們需要根據數據集含有的花的一些特征來進行分類，這里僅使用兩種花的特征，即萼片長度和花瓣長度，將這兩個特征用向量表示為：

\[x = \begin{bmatrix}x_1\\x_2\end{bmatrix} \]

x也叫做輸入向量，我們再定義一個相應的權重向量w：

\[w = \begin{bmatrix}w_1\\w_2\end{bmatrix} \]

將x和w線性組合后得到z：

\[z = w_1x_1+w_2x_2 \]

我們假設，如果樣本的激活值z大於等於事先設置好的閾值b，我們就說此樣本屬於類別1，否則屬於類別-1，公式表示如下：

\[\phi(z) = \begin{cases}1,\quad z\ge b \\\\-1,\quad otherwise\end{cases} \]

可以看出這個想法和神經元的工作原理很相似。為了方便，我們將閾值b移到等式的左邊並額外定義一個權重參數來代替-b，更新z為以下等式：

\[z = w_0x_0+w_1x_1+w_2x_2 \]

那么上式中的z大於等於0的情況也就等價於之前當z大於等於閾值b的情況，可以得到：

\[\phi(z) = \begin{cases}1,\quad z\ge0 \\\\-1,\quad otherwise\end{cases} \]

上面的函數也叫做激活函數，我們通過激活函數將z壓縮到了一個二元輸出（1，-1），也就是：

我們總結一下我們學到的，感知器的總體結構如下：

感知器將輸入變量x與權重向量w結合得到z，z經過激活函數后被壓縮到一為二元輸出。

我們可以看出權重向量w決定着分類是否准確，那么我們如何選擇合適的權重向量w呢？我們不能一個一個給w賦值，這樣工作量太大且沒有效率，其實感知器可以通過數據集中的樣本自動調整w，隨着訓練的進行，w的變化趨於平穩，分類的准確率也會大大提高。

我們更新權重向量w的公式為：

\[w_j = w_j + \Delta w_j \]

\[\Delta w_j = \eta(y^i-\hat{y^i})x^i_j \]

\[\eta-學習率\\w_j-w向量的第j個特征\\y^i-第i個樣本的真實類別\\\hat{y^i}-第i個樣本的預測類別\\x_j^i-第i個樣本的第j個特征 \]

其中學習率介於0.0和1.0之間，用於控制w更新的程度，權重向量w中的每一個參數都是同步更新的，即只有在w的每個參數的更新大小都計算出來后才會改變w的值，我們使用數據集中的大量訓練樣本x來更新w，來逐漸提高分類准確率。

感知器算法只有類別線性可分且學習率較小的情況下才能保證收斂，感知器接收訓練樣本x，將x與w線性結合得到z，再將z傳遞給激活函數，產生一個分類結果作為對樣本x的預測類別，之后按照更新規則來更新w，等收斂后感知器也就訓練完成了。

接下來我們開始實現感知器算法並使用Iris數據集訓練：

import pandas as pd

讀取數據集

df = pd.read_csv('http://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data', header=None)

df.tail()

	0	1	2	3	4
145	6.7	3.0	5.2	2.3	Iris-virginica
146	6.3	2.5	5.0	1.9	Iris-virginica
147	6.5	3.0	5.2	2.0	Iris-virginica
148	6.2	3.4	5.4	2.3	Iris-virginica
149	5.9	3.0	5.1	1.8	Iris-virginica

由上表可以看到每個輸入向量x都包含4個特征（0、1、2、3）和1個正確的類別（4）

%matplotlib inline
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

取出前100個訓練樣本的類別向量，若其類別輸入‘Iris-setosa’，則將其設置為-1，否則設置為1

y = df.iloc[0:100, 4].values

y = np.where(y == 'Iris-setosa', -1, 1)

取出前100個訓練樣本的前兩個特征向量

X = df.iloc[0:100, [0, 2]].values

畫出這100個訓練樣本的類別分布圖

plt.scatter(X[:50, 0], X[:50, 1], color='red', marker='o', label='setosa')
plt.scatter(X[50:100, 0], X[50:100, 1], color='blue', marker='x', label='versicolor')
plt.xlabel('petal length')
plt.ylabel('sepal length')
plt.legend(loc='upper left')
plt.show()

實現感知器

import numpy as np

class Perceptron(object):
    """Perceptron classifier.

    Parameters
    ----------
    eta:float
        Learning rate(between 0.0 and 1.0
    n_iter:int
        Passes over the training dataset.

    Attributes
    ----------
    w_:1d-array
        weights after fitting.
    errors_:list
        Number of miscalssifications in every epoch.

    """

    def __init__(self, eta=0.01, n_iter=10):
        self.eta = eta
        self.n_iter = n_iter

    def fit(self, X, y):
        """Fit training data.

        :param X:{array-like}, shape=[n_samples, n_features]
        Training vectors, where n_samples is the number of samples and
        n_features is the number of features.
        :param y: array-like, shape=[n_samples]
        Target values.
        :return:
        self:object

        """

        self.w_ = np.zeros(1 + X.shape[1]) # Add w_0
        self.errors_ = []

        for _ in range(self.n_iter):
            errors = 0
            for xi, target in zip(X, y):
                update = self.eta * (target - self.predict(xi))
                self.w_[1:] += update * xi
                self.w_[0] += update
                errors += int(update != 0.0)
            self.errors_.append(errors)
        return self

    def net_input(self, X):
        """Calculate net input"""
        return np.dot(X, self.w_[1:]) + self.w_[0]

    def predict(self, X):
        """Return class label after unit step"""
        return np.where(self.net_input(X) >= 0.0, 1, -1) #analoge ? :n in C++

ppn = Perceptron(eta = 0.1, n_iter = 10)

ppn.fit(X, y)

<__main__.Perceptron at 0x16680906978>

畫出訓練曲線

plt.plot(range(1, len(ppn.errors_) + 1), ppn.errors_, marker = 'o')
plt.xlabel('Epoches')
plt.ylabel('Number of misclassifications')
plt.show()

畫出分界線

from matplotlib.colors import ListedColormap

def plot_decision_region(X, y, classifier, resolution=0.02):
    # setup marker generator and color map
    markers = ('s', 'x', 'o', '^', 'v')
    colors = ('red', 'blue', 'lightgreen', 'gray', 'cyan')
    cmap = ListedColormap(colors[:len(np.unique(y))])
    
    # plot the decision surface
    x1_min, x1_max = X[:, 0].min() - 1, X[:, 0].max() + 1
    x2_min, x2_max = X[:, 1].min() - 1, X[:, 0].max() + 1
    
    xx1, xx2 = np.meshgrid(np.arange(x1_min, x1_max, resolution),
                          np.arange(x2_min, x2_max, resolution))
    Z = classifier.predict(np.array([xx1.ravel(), xx2.ravel()]).T)
    Z = Z.reshape(xx1.shape)
    
    plt.contourf(xx1, xx2, Z, alpha=0.4, cmap=cmap)
    plt.xlim(xx1.min(), xx1.max())
    plt.ylim(xx2.min(), xx2.max())
    
    #plot class samples
    for idx, cl in enumerate(np.unique(y)):
        plt.scatter(x=X[y == cl, 0], y=X[y == cl, 1],
                   alpha=0.8, c=cmap(idx), marker = markers[idx],
                   label=cl)

plot_decision_region(X, y, classifier=ppn)
plt.xlabel('sepal length [cm]')
plt.ylabel('petal length [cm]')
plt.legend(loc='upper left')
plt.show()

參考：

https://www.toutiao.com/a6669391886744027662/
https://zh.wikipedia.org/wiki/%E6%84%9F%E7%9F%A5%E5%99%A8

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 機器學習 | 剖析感知器算法 & Python實現【機器學習】單層感知器機器學習:感知器（perceptron）機器學習算法--Perceptron(感知器)算法 Coursera機器學習基石第2講：感知器機器學習入門篇——感知器機器學習之感知器算法原理和Python實現【模式識別與機器學習】——3.6感知器算法3.7采用感知器算法的多類模式的分類《神經網絡與機器學習》學習筆記——第1章感知器機器學習之感知器和線性回歸、邏輯回歸以及SVM的相互對比