training set, validation set, test set的區別

本文轉載自查看原文 2020-01-18 05:40 787 機器學習

training set：用來訓練模型
validation set : 用來做model selection
test set : 用來評估所選出來的model的實際性能

我們知道，在做模型訓練之前，我們必須選擇所訓練的模型的形式：線性模型(y = wx+b)或者非線性模型(SVM,decision tree,neural network….)。選擇好模型之后，我們才會開始訓練，訓練的目標是確定模型的參數，訓練一般是通過設計損失函數，然后對損失函數進行優化來完成訓練。

而很多時候我們並不知道哪種模型適合，所以往往我們需要對多種模型進行訓練，訓練完之后就會得到多個模型的結果，我們希望從這些訓練好的模型中選擇最適合的模型。我們通過用validation set對所有模型進行測試，然后選出error rate最小的那個模型。

所以說valaidation set主要是用來選擇模型的。

The main trick here is to 'hold out' a portion of our data from training and use the models performance on that sub-set of the data as a proxy for the true risk.

This data is known as 'validation' data. It contrasts with test data, because it's values are known at the model design time. However, in contrast to test data we don't use it to fit our model.

This means that it doesn't exhibit the same bias that the empirical risk does when estimating the true risk.

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 訓練集，驗證集，測試集（以及為什么要使用驗證集？）（Training Set, Validation Set, Test Set）訓練集(train set) 驗證集(validation set) 測試集(test set) [綜] 訓練集(train set) 驗證集(validation set) 測試集(test set) Test and Set 數據集划分——train set, validate set and test set 機器學習入門06 - 訓練集和測試集 (Training and Test Sets) @RequestBody 和 @RequestParam(“test”) 的區別與聯系 set與map的區別 Set，Map有什么區別？ List、Set、Map的區別