python 将数据随机分为训练集和测试集


# -*- coding: utf-8 -*-
"""
Created on Tue Jun 23 15:24:19 2015

@author: hd
"""

from sklearn import cross_validation

c = []
j=0
filename = r'C:\Users\hd\Desktop\bookmarks\bookmarks.arff' 
out_train = open(r'C:\Users\hd\Desktop\bookmarks\train.arff','w')
out_test = open(r'C:\Users\hd\Desktop\bookmarks\test.arff','w')

for line in open(filename):
#    items = line.strip().split()
    c.append(line)
 
c_train,c_test = cross_validation.train_test_split(c,test_size = 0.6)
for i in c_train:
    out_train.write(i)
for i in c_test:
    out_test.write(i)

  


免责声明!

本站转载的文章为个人学习借鉴使用,本站对版权不负任何法律责任。如果侵犯了您的隐私权益,请联系本站邮箱yoyou2525@163.com删除。



 
粤ICP备18138465号  © 2018-2025 CODEPRJ.COM