貼一個做數據清洗時寫的代碼,
做數據處理時,原文件數據在進行處理時需要轉換成一定格式,
原始文件數據:123.txt
1,3,4
2,3,5
1,2,3,5
2,5
利用Python轉換成二維列表:
#!/usr/bin/env python #coding=utf-8 def loadDataSet(): file = open("123.txt", "r") List_row = file.readlines() list_source = [] for list_line in List_row: list_line = list(list_line.strip().split(',')) s = [] for i in list_line: s.append(int(i)) list_source.append(s) return list_source
運行程序后結果如下:
[[1,3,4],[2,3,5],[1,2,3,5],[2,5]]
