编程 Python

pytorch 把MNIST数据集转换成图片和txt的方法

Posted in Python onMay 20, 2018

本文介绍了pytorch 把MNIST数据集转换成图片和txt的方法，分享给大家，具体如下：

1.下载Mnist 数据集

import os
# third-party library
import torch
import torch.nn as nn
from torch.autograd import Variable
import torch.utils.data as Data
import torchvision
import matplotlib.pyplot as plt 
# torch.manual_seed(1)  # reproducible
DOWNLOAD_MNIST = False
 
# Mnist digits dataset
if not(os.path.exists('./mnist/')) or not os.listdir('./mnist/'):
  # not mnist dir or mnist is empyt dir
  DOWNLOAD_MNIST = True
 
train_data = torchvision.datasets.MNIST(
  root='./mnist/',
  train=True,                   # this is training data
  transform=torchvision.transforms.ToTensor(),  # Converts a PIL.Image or numpy.ndarray to
                          # torch.FloatTensor of shape (C x H x W) and normalize in the range [0.0, 1.0]
  download=DOWNLOAD_MNIST,
)

下载下来的其实可以直接用了，但是我们这边想把它们转换成图片和txt，这样好看些，为后面用自己的图片和txt作为准备

2. 保存为图片和txt

import os
from skimage import io
import torchvision.datasets.mnist as mnist
import numpy 
root = "./mnist/raw/"
train_set = (
  mnist.read_image_file(os.path.join(root, 'train-images-idx3-ubyte')),
  mnist.read_label_file(os.path.join(root, 'train-labels-idx1-ubyte'))
)
 
test_set = (
  mnist.read_image_file(os.path.join(root,'t10k-images-idx3-ubyte')),
  mnist.read_label_file(os.path.join(root,'t10k-labels-idx1-ubyte'))
)
 
print("train set:", train_set[0].size())
print("test set:", test_set[0].size())
 
def convert_to_img(train=True):
  if(train):
    f = open(root + 'train.txt', 'w')
    data_path = root + '/train/'
    if(not os.path.exists(data_path)):
      os.makedirs(data_path)
    for i, (img, label) in enumerate(zip(train_set[0], train_set[1])):
      img_path = data_path + str(i) + '.jpg'
      io.imsave(img_path, img.numpy())
      int_label = str(label).replace('tensor(', '')
      int_label = int_label.replace(')', '')
      f.write(img_path + ' ' + str(int_label) + '\n')
    f.close()
  else:
    f = open(root + 'test.txt', 'w')
    data_path = root + '/test/'
    if (not os.path.exists(data_path)):
      os.makedirs(data_path)
    for i, (img, label) in enumerate(zip(test_set[0], test_set[1])):
      img_path = data_path + str(i) + '.jpg'
      io.imsave(img_path, img.numpy())
      int_label = str(label).replace('tensor(', '')
      int_label = int_label.replace(')', '')
      f.write(img_path + ' ' + str(int_label) + '\n')
    f.close()
 
convert_to_img(True)
convert_to_img(False)

以上就是本文的全部内容，希望对大家的学习有所帮助，也希望大家多多支持三水点靠木。

pytorch 把MNIST数据集转换成图片和txt的方法

- Author -

瓦力冫

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

深入理解Python中装饰器的用法

Jun 28 Python

python 文件操作api(文件操作函数)

Aug 28 Python

Python基础练习之用户登录实现代码分享

Nov 08 Python

Python向MySQL批量插数据的实例讲解

Mar 31 Python

matlab中实现矩阵删除一行或一列的方法

Apr 04 Python

利用python实现在微信群刷屏的方法

Feb 21 Python

python 爬虫百度地图的信息界面的实现方法

Oct 27 Python

TensorFlow实现checkpoint文件转换为pb文件

Feb 10 Python

django rest framework serializer返回时间自动格式化方法

Mar 31 Python

python 判断一组数据是否符合正态分布

Sep 23 Python

python 逐步回归算法

Apr 06 Python

使用Python通过企业微信应用给企业成员发消息

Apr 18 Python

Python安装lz4-0.10.1遇到的坑

May 20 #Python

Python requests发送post请求的一些疑点

May 20 #Python

python中virtualenvwrapper安装与使用

May 20 #Python

django静态文件加载的方法

May 20 #Python

django中静态文件配置static的方法

May 20 #Python

Python中跳台阶、变态跳台阶与矩形覆盖问题的解决方法

May 19 #Python

Python利用公共键如何对字典列表进行排序详解

May 19 #Python