K近鄰算法及其Python實現


下面貼出Python代碼

knnClassify.py

 1 from numpy import *
 2 import operator
 3 
 4 def creatDataSet():
 5     group = array([[1.0,1.1],[1.0,1.0],[0,0],[0,0.1]])
 6     labels = ['A','A','B','B']
 7     return group, labels
 8 
 9 def classify(inX,dataSet,labels,k):
10     numSamples = dataSet.shape[0]
11     diffMat = tile(inX,(numSamples,1)) - dataSet
12     sqDiffMat = diffMat**2
13     sqDistances = sqDiffMat.sum(axis = 1)
14     distances = sqDistances ** 0.5
15     sortedDistIndicies = distances.argsort()
16     classCount = {}
17     for i in xrange(k):
18         voteILabel = labels[sortedDistIndicies[i]]
19         classCount[voteILabel] = classCount.get(voteILabel,0) + 1
20     maxLabel = sorted(classCount.iteritems(),
21                       key = operator.itemgetter(1),reverse = True)
22     return maxLabel[0][0]
23 
24 if __name__=="__main__":
25     g,l = creatDataSet()
26     labelOfinX = classify([1.0,1.2],g,l,1)
27     print labelOfinX

 


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM