python實現邏輯回歸

本文轉載自查看原文 2020-05-01 12:28 869 機器學習

首先得明確邏輯回歸與線性回歸不同，它是一種分類模型。而且是一種二分類模型。

首先我們需要知道sigmoid函數，其公式表達如下：

其函數曲線如下：

sigmoid函數有什么性質呢？

1、關於（0,0.5）對稱

2、值域范圍在(0,1)之間

3、單調遞增

4、光滑

5、中間較陡，兩側較平緩

6、其導數為g(z)(1-g(z))，即可以用原函數直接計算

於是邏輯回歸的函數形式可以用以下公式表示：

其中θ表示權重參數，x表示輸入。θTx為決策邊界，就是該決策邊界將不同類數據區分開來。

為什么使用sigmoid函數呢？

1、sigmoid函數本身的性質

2、推導而來

我們知道伯努利分布：

當x=1時，f(1|p) =p，當x=0時，f(0|p)=1-p

首先要明確伯努利分布也是指數族，指數族的一般表達式為：

由於：

則有：

所以：

因為：

則有：

邏輯回歸代價函數：

為什么這么定義呢？

以單個樣本為例：

上面式子等價於：

當y=1時，其圖像如下：

也就是說當hθ(x)的值越接近1，C(θ) 的值就越小。

同理當y=0時，其圖像如下：

也就是說當hθ(x)的值越接近0，C(θ) 的值就越小。

這樣就可以將不同類區分開來。

代價函數的倒數如下：

推導過程如下：

上面參考了：

https://blog.csdn.net/sun_wangdong/article/details/80780368

https://zhuanlan.zhihu.com/p/28415991

接下來就是代碼實現了，代碼來源： https://github.com/eriklindernoren/ML-From-Scratch

from __future__ import print_function, division
import numpy as np
import math
from mlfromscratch.utils import make_diagonal, Plot
from mlfromscratch.deep_learning.activation_functions import Sigmoid


class LogisticRegression():
    """ Logistic Regression classifier.
    Parameters:
    -----------
    learning_rate: float
        The step length that will be taken when following the negative gradient during
        training.
    gradient_descent: boolean
        True or false depending if gradient descent should be used when training. If
        false then we use batch optimization by least squares.
    """
    def __init__(self, learning_rate=.1, gradient_descent=True):
        self.param = None
        self.learning_rate = learning_rate
        self.gradient_descent = gradient_descent
        self.sigmoid = Sigmoid()

    def _initialize_parameters(self, X):
        n_features = np.shape(X)[1]
        # Initialize parameters between [-1/sqrt(N), 1/sqrt(N)]
        limit = 1 / math.sqrt(n_features)
        self.param = np.random.uniform(-limit, limit, (n_features,))

    def fit(self, X, y, n_iterations=4000):
        self._initialize_parameters(X)
        # Tune parameters for n iterations
        for i in range(n_iterations):
            # Make a new prediction
            y_pred = self.sigmoid(X.dot(self.param))
            if self.gradient_descent:
                # Move against the gradient of the loss function with
                # respect to the parameters to minimize the loss
                self.param -= self.learning_rate * -(y - y_pred).dot(X)
            else:
                # Make a diagonal matrix of the sigmoid gradient column vector
                diag_gradient = make_diagonal(self.sigmoid.gradient(X.dot(self.param)))
                # Batch opt:
                self.param = np.linalg.pinv(X.T.dot(diag_gradient).dot(X)).dot(X.T).dot(diag_gradient.dot(X).dot(self.param) + y - y_pred)

    def predict(self, X):
        y_pred = np.round(self.sigmoid(X.dot(self.param))).astype(int)
        return y_pred

說明：np.linalg.pinv()用於計算矩陣的pseudo-inverse（偽逆）。第一種方法求解使用隨機梯度下降。

其中make_diagonal()函數如下：用於將向量轉換為對角矩陣

def make_diagonal(x):
    """ Converts a vector into an diagonal matrix """
    m = np.zeros((len(x), len(x)))
    for i in range(len(m[0])):
        m[i, i] = x[i]
    return m

其中Sigmoid代碼如下：

class Sigmoid():
    def __call__(self, x):
        return 1 / (1 + np.exp(-x))

    def gradient(self, x):
        return self.__call__(x) * (1 - self.__call__(x))

最后是主函數運行代碼：

from __future__ import print_function
from sklearn import datasets
import numpy as np
import matplotlib.pyplot as plt

# Import helper functions
import sys
sys.path.append("/content/drive/My Drive/learn/ML-From-Scratch/")
from mlfromscratch.utils import make_diagonal, normalize, train_test_split, accuracy_score
from mlfromscratch.deep_learning.activation_functions import Sigmoid
from mlfromscratch.utils import Plot
from mlfromscratch.supervised_learning import LogisticRegression

def main():
    # Load dataset
    data = datasets.load_iris()
    X = normalize(data.data[data.target != 0])
    y = data.target[data.target != 0]
    y[y == 1] = 0
    y[y == 2] = 1

    X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.33, seed=1)

    clf = LogisticRegression(gradient_descent=True)
    clf.fit(X_train, y_train)
    y_pred = clf.predict(X_test)

    accuracy = accuracy_score(y_test, y_pred)
    print ("Accuracy:", accuracy)

    # Reduce dimension to two using PCA and plot the results
    Plot().plot_in_2d(X_test, y_pred, title="Logistic Regression", accuracy=accuracy)

if __name__ == "__main__":
    main()

結果：

Accuracy: 0.9393939393939394

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 用python實現邏輯回歸 python實現邏輯回歸 python實現邏輯回歸 python實現邏輯回歸邏輯回歸：原理及python實現 Python實現LR(邏輯回歸) 邏輯回歸模型（Logistic Regression）及Python實現邏輯回歸原理（python代碼實現）局部加權之邏輯回歸(1) - Python實現對數幾率回歸（邏輯回歸）原理與Python實現