编程 Python

对tensorflow中tf.nn.conv1d和layers.conv1d的区别详解

Posted in Python onFebruary 11, 2020

在用tensorflow做一维的卷积神经网络的时候会遇到tf.nn.conv1d和layers.conv1d这两个函数，但是这两个函数有什么区别呢，通过计算得到一些规律。

1.关于tf.nn.conv1d的解释，以下是Tensor Flow中关于tf.nn.conv1d的API注解：

Computes a 1-D convolution given 3-D input and filter tensors.

Given an input tensor of shape [batch, in_width, in_channels] if data_format is "NHWC", or [batch, in_channels, in_width] if data_format is "NCHW", and a filter / kernel tensor of shape [filter_width, in_channels, out_channels], this op reshapes the arguments to pass them to conv2d to perform the equivalent convolution operation.

Internally, this op reshapes the input tensors and invokes `tf.nn.conv2d`. For example, if `data_format` does not start with "NC", a tensor of shape [batch, in_width, in_channels] is reshaped to [batch, 1, in_width, in_channels], and the filter is reshaped to [1, filter_width, in_channels, out_channels]. The result is then reshaped back to [batch, out_width, out_channels] whereoutwidthisafunctionofthestrideandpaddingasinconv2dwhereoutwidthisafunctionofthestrideandpaddingasinconv2d and returned to the caller.

Args: value: A 3D `Tensor`. Must be of type `float32` or `float64`. filters: A 3D `Tensor`. Must have the same type as `input`. stride: An `integer`. The number of entries by which the filter is moved right at each step. padding: 'SAME' or 'VALID' use_cudnn_on_gpu: An optional `bool`. Defaults to `True`. data_format: An optional `string` from `"NHWC", "NCHW"`. Defaults to `"NHWC"`, the data is stored in the order of [batch, in_width, in_channels]. The `"NCHW"` format stores data as [batch, in_channels, in_width]. name: A name for the operation (optional).

Returns:

A `Tensor`. Has the same type as input.

Raises:

ValueError: if `data_format` is invalid.

什么意思呢？就是说conv1d的参数含义：(以NHWC格式为例，即，通道维在最后)

1、value：在注释中，value的格式为：[batch, in_width, in_channels]，batch为样本维，表示多少个样本，in_width为宽度维，表示样本的宽度，in_channels维通道维，表示样本有多少个通道。事实上，也可以把格式看作如下:[batch, 行数, 列数]，把每一个样本看作一个平铺开的二维数组。这样的话可以方便理解。

2、filters：在注释中，filters的格式为：[filter_width, in_channels, out_channels]。按照value的第二种看法，filter_width可以看作每次与value进行卷积的行数，in_channels表示value一共有多少列（与value中的in_channels相对应）。out_channels表示输出通道，可以理解为一共有多少个卷积核，即卷积核的数目。

3、stride：一个整数，表示步长，每次（向下）移动的距离（TensorFlow中解释是向右移动的距离，这里可以看作向下移动的距离）。

4、padding：同conv2d，value是否需要在下方填补0。

5、name：名称。可省略。

首先从参数列表可以看出value指的输入的数据，stride就是卷积的步长，这里我们最有疑问的就是filters这个参数，那么我们对filter进行简单的说明。从上面可以看到filters的格式为：[filter_width, in_channels, out_channels],这是一个数组的维度，对应的是卷积核的大小，输入的channel的格式，和卷积核的个数，下面我们用例子说明问题：

import tensorflow as tf
import numpy as np
 
 
if __name__ == '__main__':
  inputs = tf.constant(np.arange(1, 6, dtype=np.float32), shape=[1, 5, 1])
  w = np.array([1, 2], dtype=np.float32).reshape([2, 1, 1])
  # filter width, filter channels and out channels(number of kernels)
  cov1 = tf.nn.conv1d(inputs, w, stride=1, padding='VALID')
  with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    out = sess.run(cov1)
    print(out)

其输出为：

[[[ 5.],
    [ 8.],
    [11.],
    [14.]]]

我们分析一下，输入的数据为[[[1],[2],[3],[4],[5]]],有5个特征，分别对应的数值为1，2，3，4，5，那么经过卷积的结果为5，8，11，14，那么这个结果是怎么来的呢，我们根据卷积的计算，可以得到5 = 1*1 + 2*2， 8=2*1+ 3*2， 11 = 3*1+4*2， 14=4*1+5*2，也就是W1=1， W2=2，正好和我们先面filters设置的数值相等，

w = np.array([1, 2], dtype=np.float32).reshape([2, 1, 1])

所以可以看到这个filtes设置的是是卷积核矩阵的，换句话说，卷积核矩阵我们是可以设置的。

2. 1.关于tf.layers.conv1d,函数的定义如下

tf.layers.conv1d(
 
inputs,
 
filters,
 
kernel_size,
 
strides=1,
 
padding='valid',
 
data_format='channels_last',
 
dilation_rate=1,
 
activation=None,
 
use_bias=True,
 
kernel_initializer=None,
 
bias_initializer=tf.zeros_initializer(),
 
kernel_regularizer=None,
 
bias_regularizer=None,
 
activity_regularizer=None,
 
kernel_constraint=None,
 
bias_constraint=None,
 
trainable=True,
 
name=None,
 
reuse=None
 
)

比较重要的几个参数是inputs, filters, kernel_size，下面分别说明

inputs : 输入tensor，维度(None, a, b) 是一个三维的tensor

None ：一般是填充样本的个数，batch_size

a ：句子中的词数或者字数

b : 字或者词的向量维度

filters : 过滤器的个数

kernel_size : 卷积核的大小，卷积核其实应该是一个二维的，这里只需要指定一维，是因为卷积核的第二维与输入的词向量维度是一致的，因为对于句子而言，卷积的移动方向只能是沿着词的方向，即只能在列维度移动。一个例子：

import tensorflow as tf
import numpy as np
 
 
if __name__ == '__main__':
  inputs = tf.constant(np.arange(1, 6, dtype=np.float32), shape=[1, 5, 1])
  cov2 = tf.layers.conv1d(inputs, filters=1, kernel_size=2, strides=1, padding='VALID')
  with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    out = sess.run(cov2)
    print(out)

输出结果：

[[[-1.9953331]
 [-3.5520997]
 [-5.108866 ]
 [-6.6656327]]]

也许你得到的结果和我得到的结果不同，因为在这个函数里面只是设置了卷积核的尺寸和步长，没有设置具体的卷积核矩阵，所以这个卷积核矩阵是随机生成的，就会出现可能运行上面的程序出现不同结果的情况。

以上这篇对tensorflow中tf.nn.conv1d和layers.conv1d的区别详解就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持三水点靠木。

对tensorflow中tf.nn.conv1d和layers.conv1d的区别详解

- Author -

孤独的猿行客

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

paramiko模块安装和使用(远程登录服务器)

Jan 27 Python

初步解析Python中的yield函数的用法

Apr 03 Python

Python文件及目录操作实例详解

Jun 04 Python

Python JSON格式数据的提取和保存的实现

Mar 22 Python

Numpy之reshape()使用详解

Dec 26 Python

Pytorch Tensor 输出为txt和mat格式方式

Jan 03 Python

Python Numpy库常见用法入门教程

Jan 16 Python

Python实现遗传算法(二进制编码)求函数最优值方式

Feb 11 Python

最新PyCharm从安装到PyCharm永久激活再到PyCharm官方中文汉化详细教程

Nov 17 Python

使用numpy nonzero 找出非0元素

May 14 Python

python编写五子棋游戏

May 25 Python

常用的Python代码调试工具总结

Jun 23 Python

python中文分词库jieba使用方法详解

Feb 11 #Python

Transpose 数组行列转置的限制方式

Feb 11 #Python

Tensorflow:转置函数 transpose的使用详解

Feb 11 #Python

tensorflow多维张量计算实例

Feb 11 #Python

python误差棒图errorbar()函数实例解析

Feb 11 #Python

解决Python3.8用pip安装turtle-0.0.2出现错误问题

Feb 11 #Python

python scatter函数用法实例详解

Feb 11 #Python

You might like

hessian 在PHP中的使用介绍

2010/12/13 PHP

php使用curl出现Expect:100-continue解决方法

2015/03/03 PHP

php分页原理分页代码分页类制作教程

2016/09/23 PHP

php简单计算权重的方法示例【适合抽奖类应用】

2019/06/10 PHP

详解Laravel服务容器的绑定与解析

2019/11/05 PHP

javascript实现div的拖动并调整大小类似qq空间个性编辑模块

2012/12/12 Javascript

JQuery中form验证出错信息的查看方法

2013/10/08 Javascript

jQuery $命名冲突解决方案汇总

2014/11/13 Javascript

jQuery可见性过滤选择器用法示例

2016/09/09 Javascript

Vue.js 2.0 和 React、Augular等其他前端框架大比拼

2016/10/08 Javascript

jQuery Validate表单验证插件的基本使用方法及功能拓展

2017/01/04 Javascript

你不知道的 javascript【推荐】

2017/01/08 Javascript

QRCode.js：基于JQuery的生成二维码JS库的使用

2017/06/23 jQuery

vue自动化路由的实现代码

2019/09/30 Javascript

JavaScript canvas动画实现时钟效果

2020/02/10 Javascript

Vue中computed和watch有哪些区别

2020/12/19 Vue.js

python实现爬虫统计学校BBS男女比例之多线程爬虫（二）

2015/12/31 Python

Python3.x对JSON的一些操作示例

2017/09/01 Python

详解Python核心编程中的浅拷贝与深拷贝

2018/01/07 Python

Python寻找路径和查找文件路径的示例

2019/07/10 Python

PyCharm汉化安装及永久激活详细教程(靠谱)

2020/01/16 Python

python 元组和列表的区别

2020/12/30 Python

CSS3字体效果的设置方法小结

2016/06/13 HTML / CSS

CSS中的字体大小设置属性总结

2016/05/24 HTML / CSS

用HTML5的canvas实现一个炫酷时钟效果

2016/05/20 HTML / CSS

ECCO爱步美国官网：来自丹麦的鞋履品牌

2016/11/23 全球购物

Net-A-Porter美国官网：全球时尚奢侈品名站

2017/02/11 全球购物

Supersmart英国：欧洲市场首批食品补充剂供应商之一

2018/05/05 全球购物

PatPat阿根廷：妈妈们的购物平台

2019/05/30 全球购物

数字漫画：comiXology

2020/06/13 全球购物

可以使用抽象函数重写基类中的虚函数吗

2013/06/02 面试题

中学生秋季运动会广播稿

2014/09/21 职场文书

求职信格式范文

2015/03/19 职场文书

Android开发EditText禁止输入监听及InputFilter字符过滤

2022/06/10 Java/Android

使用HBuilder制作一个简单的HTML5网页

2022/07/07 HTML / CSS