python 將數據隨機分為訓練集和測試集


# -*- coding: utf-8 -*-
"""
Created on Tue Jun 23 15:24:19 2015

@author: hd
"""

from sklearn import cross_validation

c = []
j=0
filename = r'C:\Users\hd\Desktop\bookmarks\bookmarks.arff' 
out_train = open(r'C:\Users\hd\Desktop\bookmarks\train.arff','w')
out_test = open(r'C:\Users\hd\Desktop\bookmarks\test.arff','w')

for line in open(filename):
#    items = line.strip().split()
    c.append(line)
 
c_train,c_test = cross_validation.train_test_split(c,test_size = 0.6)
for i in c_train:
    out_train.write(i)
for i in c_test:
    out_test.write(i)

  


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM