编程 Python

Tensorflow使用tfrecord输入数据格式

Posted in Python onJune 19, 2018

Tensorflow 提供了一种统一的格式来存储数据，这个格式就是TFRecord,上一篇文章中所提到的方法当数据的来源更复杂，每个样例中的信息更丰富的时候就很难有效的记录输入数据中的信息了，于是Tensorflow提供了TFRecord来统一存储数据，接下来我们就来介绍如何使用TFRecord来同意输入数据的格式。

1. TFRecord格式介绍

TFRecord文件中的数据是通过tf.train.Example Protocol Buffer的格式存储的，下面是tf.train.Example的定义

message Example {
 Features features = 1;
};

message Features{
 map<string,Feature> featrue = 1;
};

message Feature{
  oneof kind{
    BytesList bytes_list = 1;
    FloatList float_list = 2;
    Int64List int64_list = 3;
  }
};

从上述代码可以看到，ft.train.Example 的数据结构相对简洁。tf.train.Example中包含了一个从属性名称到取值的字典，其中属性名称为一个字符串，属性的取值可以为字符串（BytesList ），实数列表（FloatList ）或整数列表（Int64List ）。例如我们可以将解码前的图片作为字符串，图像对应的类别标号作为整数列表。

2. 将自己的数据转化为TFRecord格式

准备数据

在上一篇中，我们为了像伟大的MNIST致敬，所以选择图像的前缀来进行不同类别的分类依据，但是大多数的情况下，在进行分类任务的过程中，不同的类别都会放在不同的文件夹下，而且类别的个数往往浮动性又很大，所以针对这样的情况，我们现在利用不同类别在不同文件夹中的图像来生成TFRecord.

我们在Iris&Contact这个文件夹下有两个文件夹，分别为iris,contact。对于每个文件夹中存放的是对应的图片

转换数据

数据准备好以后，就开始准备生成TFRecord,具体代码如下：

import os 
import tensorflow as tf 
from PIL import Image 
import matplotlib.pyplot as plt 

cwd='/home/ruyiwei/Documents/Iris&Contact/'
classes={'iris','contact'} 
writer= tf.python_io.TFRecordWriter("iris_contact.tfrecords") 

for index,name in enumerate(classes):
  class_path=cwd+name+'/'
  for img_name in os.listdir(class_path): 
    img_path=class_path+img_name 
    img=Image.open(img_path)
    img= img.resize((512,80))
    img_raw=img.tobytes()
    #plt.imshow(img) # if you want to check you image,please delete '#'
    #plt.show()
    example = tf.train.Example(features=tf.train.Features(feature={
      "label": tf.train.Feature(int64_list=tf.train.Int64List(value=[index])),
      'img_raw': tf.train.Feature(bytes_list=tf.train.BytesList(value=[img_raw]))
    })) 
    writer.write(example.SerializeToString()) 

writer.close()

3. Tensorflow从TFRecord中读取数据

def read_and_decode(filename): # read iris_contact.tfrecords
  filename_queue = tf.train.string_input_producer([filename])# create a queue

  reader = tf.TFRecordReader()
  _, serialized_example = reader.read(filename_queue)#return file_name and file
  features = tf.parse_single_example(serialized_example,
                    features={
                      'label': tf.FixedLenFeature([], tf.int64),
                      'img_raw' : tf.FixedLenFeature([], tf.string),
                    })#return image and label

  img = tf.decode_raw(features['img_raw'], tf.uint8)
  img = tf.reshape(img, [512, 80, 3]) #reshape image to 512*80*3
  img = tf.cast(img, tf.float32) * (1. / 255) - 0.5 #throw img tensor
  label = tf.cast(features['label'], tf.int32) #throw label tensor
  return img, label

4. 将TFRecord中的数据保存为图片

filename_queue = tf.train.string_input_producer(["iris_contact.tfrecords"]) 
reader = tf.TFRecordReader()
_, serialized_example = reader.read(filename_queue)  #return file and file_name
features = tf.parse_single_example(serialized_example,
                  features={
                    'label': tf.FixedLenFeature([], tf.int64),
                    'img_raw' : tf.FixedLenFeature([], tf.string),
                  }) 
image = tf.decode_raw(features['img_raw'], tf.uint8)
image = tf.reshape(image, [512, 80, 3])
label = tf.cast(features['label'], tf.int32)
with tf.Session() as sess: 
  init_op = tf.initialize_all_variables()
  sess.run(init_op)
  coord=tf.train.Coordinator()
  threads= tf.train.start_queue_runners(coord=coord)
  for i in range(20):
    example, l = sess.run([image,label])#take out image and label
    img=Image.fromarray(example, 'RGB')
    img.save(cwd+str(i)+'_''Label_'+str(l)+'.jpg')#save image
    print(example, l)
  coord.request_stop()
  coord.join(threads)

以上就是本文的全部内容，希望对大家的学习有所帮助，也希望大家多多支持三水点靠木。

Tensorflow使用tfrecord输入数据格式

- Author -

ruyiweicas

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

pycharm 使用心得（一）安装和首次使用

Jun 05 Python

基于python代码实现简易滤除数字的方法

Jul 17 Python

python使用scrapy发送post请求的坑

Sep 04 Python

Python实现分段线性插值

Dec 17 Python

浅谈PySpark SQL 相关知识介绍

Jun 14 Python

使用Python将字符串转换为格式化的日期时间字符串

Sep 01 Python

6行Python代码实现进度条效果（Progress、tqdm、alive-progress和PySimpleGUI库）

Jan 06 Python

解决TensorFlow模型恢复报错的问题

Feb 06 Python

在Keras中利用np.random.shuffle()打乱数据集实例

Jun 15 Python

Python JSON常用编解码方法代码实例

Sep 05 Python

mac系统下安装pycharm、永久激活、中文汉化详细教程

Nov 24 Python

Python多线程 Queue 模块常见用法

Jul 04 Python

Tensorflow 训练自己的数据集将数据直接导入到内存

Jun 19 #Python

python如何爬取个性签名

Jun 19 #Python

详解TensorFlow查看ckpt中变量的几种方法

Jun 19 #Python

TensorFlow 滑动平均的示例代码

Jun 19 #Python

python3个性签名设计实现代码

Jun 19 #Python

TensorFlow 模型载入方法汇总(小结)

Jun 19 #Python

python3爬虫之设计签名小程序

Jun 19 #Python

You might like

PHP输出日历表代码实例

2015/03/27 PHP

PHP获取数组最大值下标的方法

2015/05/12 PHP

PHP与以太坊交互详解

2018/08/24 PHP

javascript学习笔记(二十) 获得和设置元素的特性（属性）

2012/06/20 Javascript

检查输入的是否是数字使用keyCode配合onkeypress事件

2014/01/23 Javascript

js中自定义方法实现停留几秒sleep

2014/07/11 Javascript

Javascript学习指南

2014/12/01 Javascript

浅谈JavaScript变量的自动转换和语句

2016/06/12 Javascript

Node.js连接postgreSQL并进行数据操作

2016/12/18 Javascript

ES6新特性一： let和const命令详解

2017/04/20 Javascript

Node.js 8 中的 util.promisify的详解

2017/06/12 Javascript

vue router学习之动态路由和嵌套路由详解

2017/09/21 Javascript

react-native中ListView组件点击跳转的方法示例

2017/09/30 Javascript

深入浅析vue组件间事件传递

2017/12/29 Javascript

解决vue移动端适配问题

2018/12/12 Javascript

微信小程序学习总结（四）事件与冒泡实例分析

2020/06/04 Javascript

利用Python的Twisted框架实现webshell密码扫描器的教程

2015/04/16 Python

Python中urllib+urllib2+cookielib模块编写爬虫实战

2016/01/20 Python

windows下python安装paramiko模块和pycrypto模块（简单三步）

2017/07/06 Python

Python实现的凯撒密码算法示例

2018/04/12 Python

用django-allauth实现第三方登录的示例代码

2019/06/24 Python

python实现mean-shift聚类算法

2020/06/10 Python

简单掌握CSS3中resize属性的用法

2016/04/01 HTML / CSS

奢华时尚的独特视角：La Garçonne

2018/06/07 全球购物

工商管理专业实习大学生自我鉴定

2013/09/19 职场文书

人力资源管理毕业生自荐信

2013/11/21 职场文书

物流管理专业职业生涯规划书

2014/01/06 职场文书

抗洪救灾先进集体事迹材料

2014/05/26 职场文书

企业安全标语

2014/06/07 职场文书

董事长秘书工作职责

2014/06/10 职场文书

出国签证在职证明

2014/09/20 职场文书

领导班子三严三实对照检查材料

2014/09/25 职场文书

婚宴邀请函

2015/01/30 职场文书

土建技术员岗位职责

2015/04/11 职场文书

MySQL update set 和 and的区别

2021/05/08 MySQL

SQL Server实现分页方法介绍

2022/03/16 SQL Server