python分割train和test

本文轉載自查看原文 2021-05-28 17:51 189 python related/ 研究僧學習

用sklearn庫中的train_test_split方法

from  sklearn.model_selection  import  train_test_split
train, test = train_test_split(data, random_state=2021, train_size=0.8)

自己用numpy寫

import numpy as np

# 從 0~n 中隨機選取 x 個數字
def getRandomIdx(n, x):
    return np.random.choice(np.arange(n), size=x, replace=False)

# 調用方法進行分割，獲取train、test的index
total_cnt = data.shape[0]
train_idx = np.array(getRandomIdx(total_cnt, int(total_cnt * 0.8)))
test_idx = np.delete(np.arange(total_cnt), train_idx)

# 得到訓練集和測試集
data_train = data[train_idx]
data_test = data[test_idx]

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 train_test_split 分割測試集訓練集 train_test_split用法 train_test_split()函數 train loss和test loss分析 sklearn的train_test_split函數 sklearn的train_test_split，果然很好用啊！ train_test_split參數含義 train_test_split數據切分 train loss與test loss結果分析/loss不下降 train,dev,test數據集作用