阅读(12k) 书签赞(0)

TensorFlow函数：tf.sparse_to_dense

2020-10-31 14:53 更新

tf.sparse_to_dense函数

sparse_to_dense ( 
    sparse_indices , 
    output_shape , 
    sparse_values , 
    default_value = 0 , 
    validate_indices = True , 
    name = None 
)

定义在：tensorflow/python/ops/sparse_ops.py.

请参阅指南：稀疏张量>转变

将稀疏表示形式转换为稠密张量.

构建一个 dense 形状 output_shape 的数组,如下述所示：

# If sparse_indices is scalar
dense[i] = (i == sparse_indices ? sparse_values : default_value)

# If sparse_indices is a vector, then for each i
dense[sparse_indices[i]] = sparse_values[i]

# If sparse_indices is an n by d matrix, then for each i in [0, n)
dense[sparse_indices[i][0], ..., sparse_indices[i][d-1]] = sparse_values[i]

所有其他值的 dense 都设置为 default_value.如果 sparse_values 是标量,则所有稀疏索引都设置为该单个值.

索引应按字典顺序排序,索引不得有任何重复.如果 validate_indices 为 True,则在执行期间检查这些属性.

函数参数：

sparse_indices：表示类型为 int32 或 int64 的 0-d、1-d 或 2- d Tensor；sparse_indices[i] 包含完整索引,这是 sparse_values[i] 将放置的位置.
output_shape：与 sparse_indices 具有相同类型的 1-D Tensor；密集输出张量的形状.
sparse_values：0-D 或1-D Tensor；对应于每行的 sparse_indices 值,或者将用于所有稀疏索引的标量值.
default_value：与 sparse_values 具有相同类型的 0-D Tensor；为未在 sparse_indices 中指定的索引设置值；默认为零.
validate_indices：一个布尔值；如果为 True,则检查索引以确保按照词典顺序排序并且没有重复.
name：操作的名称(可选).

函数返回值：

tf.sparse_to_dense函数返回形状为 output_shape 的密集 Tensor,它与 sparse_values 具有相同的类型.

介绍几个方法的用法：

tf.sparse_to_dense(sparse_indices, output_shape, sparse_values, default_value, name=None)

除去name参数用以指定该操作的 name，与方法有关的一共四个参数：

第一个参数 sparse_indices：稀疏矩阵中那些个别元素对应的索引值。

有三种情况：
sparse_indices 是个数，那么它只能指定一维矩阵的某一个元素
sparse_indices 是个向量，那么它可以指定一维矩阵的多个元素
sparse_indices 是个矩阵，那么它可以指定二维矩阵的多个元素
第二个参数 output_shape：输出的稀疏矩阵的 shape
第三个参数 sparse_values：个别元素的值。
分为两种情况：
sparse_values 是个数：所有索引指定的位置都用这个数
sparse_values 是个向量：输出矩阵的某一行向量里某一行对应的数（所以这里向量的长度应该和输出矩阵的行数对应，不然报错）
第四个参数 default_value：未指定元素的默认值，一般如果是稀疏矩阵的话就是0了举一个例子：
在 mnist 里面有一个把数字标签转化成 onehot 标签的操作，所谓 onehot 标签就是：
如果标签是6那么对应 onehot 就是[ 0. 0. 0. 0. 0. 0. 1. 0. 0. 0.]
如果标签是1那么对应 onehot 就是[ 0. 1. 0. 0. 0. 0. 0. 0. 0. 0.]
如果标签是0那么对应 onehot 就是[ 1. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
就是把标签变为适用于神经网络输出的形式。

BATCHSIZE=6
label=tf.expand_dims(tf.constant([0,2,3,6,7,9]),1)

假设一个 batch 有6个样本，每个样本的 label 是0，2，3，6，7，9

index=tf.expand_dims(tf.range(0, BATCHSIZE),1)生成一个index表明一个batch里面每个样本对应的序号

concated = tf.concat(1, [index, label])

最后把他们两个矩阵进行连接，连接以后的矩阵是这样的

[[0 0][1 2]

[2 3]

[3 6]

[4 7]

[5 9]]

onehot_labels = tf.sparse_to_dense(concated, tf.pack([BATCHSIZE,10]), 1.0, 0.0)

最后一步，调用 tf.sparse_to_dense 输出一个 onehot 标签的矩阵，输出的 shape 就是行数为 BATCHSIZE，列数为10的矩阵，指定元素值为1.0，其余元素值为0.0

程序源码：

import tensorflow as tf
import numpy
BATCHSIZE=6
label=tf.expand_dims(tf.constant([0,2,3,6,7,9]),1)
index=tf.expand_dims(tf.range(0, BATCHSIZE),1)
#use a matrix
concated = tf.concat(1, [index, label])
onehot_labels = tf.sparse_to_dense(concated, tf.pack([BATCHSIZE,10]), 1.0, 0.0)
#use a vector
concated2=tf.constant([1,3,4])
#onehot_labels2 = tf.sparse_to_dense(concated2, tf.pack([BATCHSIZE,10]), 1.0, 0.0)#cant use ,because output_shape is not a vector
onehot_labels2 = tf.sparse_to_dense(concated2, tf.pack([10]), 1.0, 0.0)#can use
#use a scalar
concated3=tf.constant(5)
onehot_labels3 = tf.sparse_to_dense(concated3, tf.pack([10]), 1.0, 0.0)
with tf.Session() as sess:
result1=sess.run(onehot_labels)
result2 = sess.run(onehot_labels2)
result3 = sess.run(onehot_labels3)
print ("This is result1:")
print (result1)
print ("This is result2:")
print (result2)
print ("This is result3:")
print (result3)

输出结果：

 This is result1: 
[[ 1. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
 [ 0. 0. 1. 0. 0. 0. 0. 0. 0. 0.]
 [ 0. 0. 0. 1. 0. 0. 0. 0. 0. 0.] 
[ 0. 0. 0. 0. 0. 0. 1. 0. 0. 0.]
 [ 0. 0. 0. 0. 0. 0. 0. 1. 0. 0.]
 [ 0. 0. 0. 0. 0. 0. 0. 0. 0. 1.]] 
This is result2:
 [ 0. 1. 0. 1. 1. 0. 0. 0. 0. 0.]
 This is result3: 
[ 0. 0. 0. 0. 0. 1. 0. 0. 0. 0.]

← TensorFlow函数：tf.sparse_tensor_to_dense

TensorFlow函数：tf.sparse_to_indicator →