【原】Coursera—Andrew Ng機器學習—編程作業 Programming Exercise 2——邏輯回歸

本文轉載自查看原文 2018-10-31 15:52 882 Andrew Ng/ 機器學習/ Octave/ Coursera

作業說明

　　Exercise 2，Week 3，使用Octave實現邏輯回歸模型。數據集 ex2data1.txt ，ex2data2.txt

　　實現 Sigmoid 、代價函數計算Computing Cost 和梯度下降Gradient Descent。

文件清單

ex2.m - Octave/MATLAB script that steps you through the exercise
ex2 reg.m - Octave/MATLAB script for the later parts of the exercise
ex2data1.txt - Training set for the first half of the exercise
ex2data2.txt - Training set for the second half of the exercise
submit.m - Submission script that sends your solutions to our servers
mapFeature.m - Function to generate polynomial features
plotDecisionBoundary.m - Function to plot classifier’s decision boundary
[*] plotData.m - Function to plot 2D classification data
[*] sigmoid.m - Sigmoid Function
[*] costFunction.m - Logistic Regression Cost Function
[*] predict.m - Logistic Regression Prediction Function
[*] costFunctionReg.m - Regularized Logistic Regression Cost

　　* 為必須要完成的

結論

正則化不涉及第一個 θ₀

邏輯回歸

　　背景：大學管理員，想要根據兩門課的歷史成績記錄來每個是否被允許入學。

　　ex2data1.txt ，前兩列是兩門課的成績，第三列是y值 0 和 1。

一、繪制數據圖

　　 plotData.m：

1   positive = find(y == 1);
2   negative = find(y == 0);
3   
4   plot(X(positive,1),X(positive,2),'k+','MarkerFaceColor','g',
5     'MarkerSize',7);
6   hold on;
7   plot(X(negative,1),X(negative,2),'ko','MarkerFaceColor','y',
8     'MarkerSize',7);

　　運行效果如下：

二、sigmoid 函數

1 function g = sigmoid(z)
2 % Instructions: Compute the sigmoid of each value of z (z can be a matrix,
3 %               vector or scalar).
4   g = 1 ./ (1 + exp(-z));
5 end

三、代價函數

　　costFunction.m：

 1 function [J, grad] = costFunction(theta, X, y)
 2 
 3 　　m = length(y); % number of training examples
 4 
 5 　　part1 = -1 * y' * log(sigmoid(X * theta));
 6 　　part2 = (1 - y)' * log(1 - sigmoid(X * theta));
 7 　　J = 1 / m * (part1 - part2); 
 8     
 9 　　grad = 1 / m * X' *((sigmoid(X * theta) - y));
10 
11 end

四、預測函數

　　輸入X和theta，返回預測結果向量。每個值是 0 或 1

 1 function p = predict(theta, X)
 2 %PREDICT Predict whether the label is 0 or 1 using learned logistic 
 3 %regression parameters theta
 4 %   p = PREDICT(theta, X) computes the predictions for X using a 
 5 %   threshold at 0.5 (i.e., if sigmoid(theta'*x) >= 0.5, predict 1)
 6 
 7 m = size(X, 1); % Number of training examples
 8 
 9 % 最開始沒有四舍五入，導致錯誤
10 p = round(sigmoid(X * theta));
11 
12 end

五、進行邏輯回歸　　

　　ex1.m 中的調用：

　　加載數據：

1 data = load('ex2data1.txt');
2 X = data(:, [1, 2]); y = data(:, 3);
3 
4 [m, n] = size(X);
5 
6 % Add intercept term to x and X_test
7 X = [ones(m, 1) X];
8 
9 initial_theta = zeros(n + 1, 1);

　調用 fminunc 函數

1 options = optimset('GradObj', 'on', 'MaxIter', 400);
2 [theta, cost] = ...
3     fminunc(@(t)(costFunction(t, X, y)), initial_theta, options);

四、繪制邊界線

　　plotDecisionBoundary.m

function plotDecisionBoundary(theta, X, y)
%PLOTDECISIONBOUNDARY Plots the data points X and y into a new figure with
%the decision boundary defined by theta
%   PLOTDECISIONBOUNDARY(theta, X,y) plots the data points with + for the 
%   positive examples and o for the negative examples. X is assumed to be 
%   a either 
%   1) Mx3 matrix, where the first column is an all-ones column for the 
%      intercept.
%   2) MxN, N>3 matrix, where the first column is all-ones

% Plot Data
plotData(X(:,2:3), y);
hold on

if size(X, 2) <= 3
    % Only need 2 points to define a line, so choose two endpoints
    plot_x = [min(X(:,2))-2,  max(X(:,2))+2];

    % Calculate the decision boundary line
    plot_y = (-1./theta(3)).*(theta(2).*plot_x + theta(1));

    % Plot, and adjust axes for better viewing
    plot(plot_x, plot_y)
    
    % Legend, specific for the exercise
    legend('Admitted', 'Not admitted', 'Decision Boundary')
    axis([30, 100, 30, 100])
else
    % Here is the grid range
    u = linspace(-1, 1.5, 50);
    v = linspace(-1, 1.5, 50);

    z = zeros(length(u), length(v));
    % Evaluate z = theta*x over the grid
    for i = 1:length(u)
        for j = 1:length(v)
            z(i,j) = mapFeature(u(i), v(j))*theta;
        end
    end
    z = z'; % important to transpose z before calling contour

    % Plot z = 0
    % Notice you need to specify the range [0, 0]
    contour(u, v, z, [0, 0], 'LineWidth', 2)
end
hold off

end

正則化邏輯回歸

　　背景：預測來自制造工廠的微芯片是否通過質量保證（QA）。在QA期間，每個微芯片都經過兩個測試以確保其正常運行。

　　ex2data2.txt ，前兩列是測試結果的成績，第三列是y值 0 和 1。

　　只有兩個feature，使用直線不能划分。

　　為了讓數據擬合的更好，使用mapFeature函數，將x1，x2兩個feature擴展到六次方。

　　六次方曲線復雜，容易造成過擬合，所以需要正則化。

　　mapFeature.m

 1 function out = mapFeature(X1, X2)
 2 % MAPFEATURE Feature mapping function to polynomial features
 3 %
 4 %   MAPFEATURE(X1, X2) maps the two input features
 5 %   to quadratic features used in the regularization exercise.
 6 %
 7 %   Returns a new feature array with more features, comprising of 
 8 %   X1, X2, X1.^2, X2.^2, X1*X2, X1*X2.^2, etc..
 9 %
10 %   Inputs X1, X2 must be the same size
11 %
12 
13 degree = 6;
14 out = ones(size(X1(:,1)));
15 for i = 1:degree
16     for j = 0:i
17         out(:, end+1) = (X1.^(i-j)).*(X2.^j);
18     end
19 end
20 
21 end

二、代價函數

　　注意：θ₀不參與正則化。

　　正則化邏輯回歸的代價函數如下，分為三項：

　　梯度下降算法如下：

　　coatFunctionReg.m 如下：

function [J, grad] = costFunctionReg(theta, X, y, lambda)
  m = length(y); % number of training examples

  % theta0 不參與正則化。直接讓變量等於theta，將第一個元素置為0，再參與和 λ 的運算
  t = theta;  t(1) = 0; 
  
  % 第一項
  part1 = -y' * log(sigmoid(X * theta));
  % 第二項
  part2 = (1 - y)' * log(1 - sigmoid(X * theta));
  
  % 正則項
  regTerm = lambda / 2 / m * t' * t;
  J = 1 / m * (part1 - part2) + regTerm; 

  % 梯度
  grad = 1 / m * X' *((sigmoid(X * theta) - y)) + lambda / m * t;

end

　　em2_reg.m 里的調用

% 加載數據
data = load('ex2data2.txt');
X = data(:, [1, 2]); y = data(:, 3);

% mapfeature
X = mapFeature(X(:,1), X(:,2));

% Initialize fitting parameters
initial_theta = zeros(size(X, 2), 1);
lambda = 1;

% 調用 fminunc方法
options = optimset('GradObj', 'on', 'MaxIter', 400);
[theta, J, exit_flag] = ...
    fminunc(@(t)(costFunctionReg(t, X, y, lambda)), initial_theta, options);

三、參數調整

　　（1）使用正則化之前，決策邊界曲線如下，可以看到存在過擬合現象：

　（2）當 λ = 1，決策邊界曲線如下。此時訓練集預測准確率為 83.05%

　（3）當 λ = 100，曲線如下。此時訓練集預測准確率為 61.01%

完整代碼

https://github.com/madoubao/coursera_machine_learning/tree/master/homework/machine-learning-ex2/ex2

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 【原】Coursera—Andrew Ng機器學習—編程作業 Programming Exercise 4—反向傳播神經網絡 Stanford coursera Andrew Ng 機器學習課程編程作業（Exercise 2）及總結 stanford coursera 機器學習編程作業 exercise 3（邏輯回歸實現多分類問題）【原】Coursera—Andrew Ng機器學習—Week 1 習題—Linear Regression with One Variable 單變量線性回歸【原】Coursera—Andrew Ng機器學習—課程筆記 Lecture 2_Linear regression with one variable 單變量線性回歸解決Coursera平台上Andrew.Ng的機器學習課程無法正常提交編程作業的問題 Andrew NG 機器學習編程作業2 Octave Andrew NG 機器學習編程作業3 Octave 【原】Coursera—Andrew Ng斯坦福機器學習（0）——課程地址和軟件下載【原】Coursera—Andrew Ng機器學習—課程筆記 Lecture 1_Introduction and Basic Concepts 介紹和基本概念