基於sklearn的常用分類任務指標Python實現

本文轉載自查看原文 2017-03-12 20:01 18908 tensorflow/ 卷積神經網絡/ scikit-learn/ sklearn.metric/ 深度學習/ sklearn/ mnist

基於sklearn的常用分類任務指標Python實現

一、摘要

分類任務常用指標包含混淆矩陣、每類分類精度、平均分類精度、總體分類精度、f1-score等。 Python的sklearn.metrics 模塊覆蓋了分類任務中大部分常用的驗證指標，本文選擇其中幾種評價指標展示代碼片段，供讀者使用。基於tensorflow-1.0與mnist數據集做demo展示並列舉實驗結果。文末附有sklearn.metrics模塊的相關資料鏈接，方便高端玩家深入探索。

二、本文包含的評價指標

混淆矩陣（Confusion Matrix，CM）
每類別分類精度
每類別召回率
平均分類精度（Average Accuracy，AA)
總體分類精度（Overall Accuracy，OA）

三、功能代碼片段展示

代碼在tensorflow-1.0、Python3.5環境下通過測試，tf1.0版本API改動較大，1.0以下版本tensorflow可能不能通過測試，精力有限，其他環境尚未做測試。

 1 from sklearn import metrics
 2 import numpy as np
 3 #####
 4 # Do classification task, 
 5 # then get the ground truth and the predict label named y_true and y_pred
 6 classify_report = metrics.classification_report(y_true, y_pred)
 7 confusion_matrix = metrics.confusion_matrix(y_true, y_pred)
 8 overall_accuracy = metrics.accuracy_score(y_true, y_pred)
 9 acc_for_each_class = metrics.precision_score(y_true, y_pred, average=None)
10 average_accuracy = np.mean(acc_for_each_class)
11 score = metrics.accuracy_score(y_true, y_pred)
12 print('classify_report : \n', classify_report)
13 print('confusion_matrix : \n', confusion_matrix)
14 print('acc_for_each_class : \n', acc_for_each_class)
15 print('average_accuracy: {0:f}'.format(average_accuracy))
16 print('overall_accuracy: {0:f}'.format(overall_accuracy))
17 print('score: {0:f}'.format(score))