编程 Python

利用python中的matplotlib打印混淆矩阵实例

Posted in Python onJune 16, 2020

前面说过混淆矩阵是我们在处理分类问题时，很重要的指标，那么如何更好的把混淆矩阵给打印出来呢，直接做表或者是前端可视化，小编曾经就尝试过用前端（D5）做出来，然后截图，显得不那么好看。。

代码：

import itertools
import matplotlib.pyplot as plt
import numpy as np
 
def plot_confusion_matrix(cm, classes,
       normalize=False,
       title='Confusion matrix',
       cmap=plt.cm.Blues):
 """
 This function prints and plots the confusion matrix.
 Normalization can be applied by setting `normalize=True`.
 """
 if normalize:
  cm = cm.astype('float') / cm.sum(axis=1)[:, np.newaxis]
  print("Normalized confusion matrix")
 else:
  print('Confusion matrix, without normalization')
 
 print(cm)
 
 plt.imshow(cm, interpolation='nearest', cmap=cmap)
 plt.title(title)
 plt.colorbar()
 tick_marks = np.arange(len(classes))
 plt.xticks(tick_marks, classes, rotation=45)
 plt.yticks(tick_marks, classes)
 
 fmt = '.2f' if normalize else 'd'
 thresh = cm.max() / 2.
 for i, j in itertools.product(range(cm.shape[0]), range(cm.shape[1])):
  plt.text(j, i, format(cm[i, j], fmt),
     horizontalalignment="center",
     color="white" if cm[i, j] > thresh else "black")
 
 plt.tight_layout()
 plt.ylabel('True label')
 plt.xlabel('Predicted label')
 plt.show()
 # plt.savefig('confusion_matrix',dpi=200)
 
cnf_matrix = np.array([
 [4101, 2, 5, 24, 0],
 [50, 3930, 6, 14, 5],
 [29, 3, 3973, 4, 0],
 [45, 7, 1, 3878, 119],
 [31, 1, 8, 28, 3936],
])
 
class_names = ['Buildings', 'Farmland', 'Greenbelt', 'Wasteland', 'Water']
 
# plt.figure()
# plot_confusion_matrix(cnf_matrix, classes=class_names,
#      title='Confusion matrix, without normalization')
 
# Plot normalized confusion matrix
plt.figure()
plot_confusion_matrix(cnf_matrix, classes=class_names, normalize=True,
      title='Normalized confusion matrix')

在放矩阵位置，放一下你的混淆矩阵就可以，当然可视化混淆矩阵这一步也可以直接在模型运行中完成。

补充知识：混淆矩阵(Confusion matrix)的原理及使用(scikit-learn 和 tensorflow)

原理

在机器学习中, 混淆矩阵是一个误差矩阵, 常用来可视化地评估监督学习算法的性能. 混淆矩阵大小为 (n_classes, n_classes) 的方阵, 其中 n_classes 表示类的数量. 这个矩阵的每一行表示真实类中的实例, 而每一列表示预测类中的实例 (Tensorflow 和 scikit-learn 采用的实现方式). 也可以是, 每一行表示预测类中的实例, 而每一列表示真实类中的实例 (Confusion matrix From Wikipedia 中的定义). 通过混淆矩阵, 可以很容易看出系统是否会弄混两个类, 这也是混淆矩阵名字的由来.

混淆矩阵是一种特殊类型的列联表(contingency table)或交叉制表(cross tabulation or crosstab). 其有两维 (真实值 "actual" 和预测值 "predicted" ), 这两维都具有相同的类("classes")的集合. 在列联表中, 每个维度和类的组合是一个变量. 列联表以表的形式, 可视化地表示多个变量的频率分布.

使用混淆矩阵( scikit-learn 和 Tensorflow)

下面先介绍在 scikit-learn 和 tensorflow 中计算混淆矩阵的 API (Application Programming Interface) 接口函数, 然后在一个示例中, 使用这两个 API 函数.

scikit-learn 混淆矩阵函数 sklearn.metrics.confusion_matrix API 接口

skearn.metrics.confusion_matrix(
 y_true, # array, Gound true (correct) target values
 y_pred, # array, Estimated targets as returned by a classifier
 labels=None, # array, List of labels to index the matrix.
 sample_weight=None # array-like of shape = [n_samples], Optional sample weights
)

在 scikit-learn 中, 计算混淆矩阵用来评估分类的准确度.

按照定义, 混淆矩阵 C 中的元素 Ci,j 等于真实值为组 i , 而预测为组 j 的观测数(the number of observations). 所以对于二分类任务, 预测结果中, 正确的负例数(true negatives, TN)为 C0,0; 错误的负例数(false negatives, FN)为 C1,0; 真实的正例数为 C1,1; 错误的正例数为 C0,1.

如果 labels 为 None, scikit-learn 会把在出现在 y_true 或 y_pred 中的所有值添加到标记列表 labels 中, 并排好序.

Tensorflow 混淆矩阵函数 tf.confusion_matrix API 接口

tf.confusion_matrix(
 labels, # 1-D Tensor of real labels for the classification task
 predictions, # 1-D Tensor of predictions for a givenclassification
 num_classes=None, # The possible number of labels the classification task can have
 dtype=tf.int32, # Data type of the confusion matrix 
 name=None, # Scope name
 weights=None, # An optional Tensor whose shape matches predictions
)

Tensorflow tf.confusion_matrix 中的 num_classes 参数的含义, 与 scikit-learn sklearn.metrics.confusion_matrix 中的 labels 参数相近, 是与标记有关的参数, 表示类的总个数, 但没有列出具体的标记值. 在 Tensorflow 中一般是以整数作为标记, 如果标记为字符串等非整数类型, 则需先转为整数表示. 如果 num_classes 参数为 None, 则把 labels 和 predictions 中的最大值 + 1, 作为num_classes 参数值.

tf.confusion_matrix 的 weights 参数和 sklearn.metrics.confusion_matrix 的 sample_weight 参数的含义相同, 都是对预测值进行加权, 在此基础上, 计算混淆矩阵单元的值.

使用示例

#!/usr/bin/env python
# -*- coding: utf8 -*-
"""
Author: klchang
Description: 
A simple example for tf.confusion_matrix and sklearn.metrics.confusion_matrix.
Date: 2018.9.8
"""
from __future__ import print_function
import tensorflow as tf
import sklearn.metrics
 
y_true = [1, 2, 4]
y_pred = [2, 2, 4]
 
# Build graph with tf.confusion_matrix operation
sess = tf.InteractiveSession()
op = tf.confusion_matrix(y_true, y_pred)
op2 = tf.confusion_matrix(y_true, y_pred, num_classes=6, dtype=tf.float32, weights=tf.constant([0.3, 0.4, 0.3]))
# Execute the graph
print ("confusion matrix in tensorflow: ")
print ("1. default: \n", op.eval())
print ("2. customed: \n", sess.run(op2))
sess.close()
 
# Use sklearn.metrics.confusion_matrix function
print ("\nconfusion matrix in scikit-learn: ")
print ("1. default: \n", sklearn.metrics.confusion_matrix(y_true, y_pred))
print ("2. customed: \n", sklearn.metrics.confusion_matrix(y_true, y_pred, labels=range(6), sample_weight=[0.3, 0.4, 0.3]))

以上这篇利用python中的matplotlib打印混淆矩阵实例就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持三水点靠木。

利用python中的matplotlib打印混淆矩阵实例

- Author -

Kun Li

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

Python下singleton模式的实现方法

Jul 16 Python

进一步探究Python中的正则表达式

Apr 28 Python

Python实现telnet服务器的方法

Jul 10 Python

Django的session中对于用户验证的支持

Jul 23 Python

Python中列表和元组的相关语句和方法讲解

Aug 20 Python

Python获取某一天是星期几的方法示例

Jan 17 Python

python PyTorch参数初始化和Finetune

Feb 11 Python

python儿童学游戏编程知识点总结

Jun 03 Python

Django框架模板用法入门教程

Nov 04 Python

Python如何实现小程序无限求和平均

Feb 18 Python

python打包多类型文件的操作方法

Sep 21 Python

用Python爬取某乎手机APP数据

Jun 15 Python

Python SMTP配置参数并发送邮件

Jun 16 #Python

基于matplotlib中ion()和ioff()的使用详解

Jun 16 #Python

Python数据相关系数矩阵和热力图轻松实现教程

Jun 16 #Python

matplotlib.pyplot.matshow 矩阵可视化实例

Jun 16 #Python

使用python matploblib库绘制准确率,损失率折线图

Jun 16 #Python

为什么称python为胶水语言

Jun 16 #Python

在Keras中利用np.random.shuffle()打乱数据集实例

Jun 15 #Python

You might like

php 保留小数点

2009/04/21 PHP

tp5 实现列表数据根据状态排序

2019/10/18 PHP

jQuery控制图片的hover效果（smartRollover.js）

2012/03/18 Javascript

js原生appendChild的bug解决心得分享

2013/07/01 Javascript

一个封装js代码-----展开收起效果示例

2013/07/03 Javascript

window.open 以post方式传递参数示例代码

2014/02/27 Javascript

JS 打印功能代码可实现打印预览、打印设置等

2014/10/31 Javascript

javascript实现动态改变层大小的方法

2015/05/14 Javascript

JavaScript和HTML DOM的区别与联系及Javascript和DOM的关系

2015/11/15 Javascript

ES6下React组件的写法示例代码

2017/05/04 Javascript

webpack 单独打包指定JS文件的方法

2018/02/22 Javascript

vue-router传参用法详解

2019/01/19 Javascript

vue实现点击按钮“查看详情”弹窗展示详情列表操作

2020/09/09 Javascript

[38:31]完美世界DOTA2联赛PWL S3 Magma vs GXR 第一场 12.13

2020/12/17 DOTA

python实现在无须过多援引的情况下创建字典的方法

2014/09/25 Python

python实现复制整个目录的方法

2015/05/12 Python

Python3.X 线程中信号量的使用方法示例

2017/07/24 Python

Python网络爬虫神器PyQuery的基本使用教程

2018/02/03 Python

python如何停止递归

2020/09/09 Python

Python3读写ini配置文件的示例

2020/11/06 Python

BeautifulSoup中find和find_all的使用详解

2020/12/07 Python

selenium与xpath之获取指定位置的元素的实现

2021/01/26 Python

css3实现一个div设置多张背景图片及background-image属性实例演示

2017/08/10 HTML / CSS

linux比较文件内容的命令是什么

2015/09/23 面试题

新入职员工的自我介绍演讲稿

2014/01/02 职场文书

项目合作计划书

2014/01/09 职场文书

DIY手工制作经营店创业计划书

2014/02/01 职场文书

六年级语文下册教学计划

2015/01/22 职场文书

事业单位工作人员年度考核个人总结

2015/02/12 职场文书

学校计划生育责任书

2015/05/09 职场文书

2015年学校办公室主任工作总结

2015/07/20 职场文书

公司车辆管理制度

2015/08/04 职场文书

2016年学生会感恩节活动总结

2016/04/01 职场文书

html5调用摄像头截图功能

2022/01/18 Javascript

java代码实现空间切割

2022/01/18 Java/Android

国际最新研究在陨石中发现DNA主要成分或由陨石带来地球

2022/04/29 数码科技