softmax¶

paddle.sparse.nn.functional. softmax ( x, axis=- 1, name=None ) [源代码] ¶

稀疏 softmax 激活函数，要求输入 x 为 SparseCooTensor 或 SparseCsrTensor 。

当输入 x 为 SparseCsrTensor 时，仅支持 axis=-1，是由于 Csr 稀疏存储格式，更适合按行读取数据。

如果将 x 从稀疏矩阵转换为稠密矩阵， \(i\) 代表行数， \(j\) 代表列数，且 axis=-1 时有如下公式：

\[softmax_ij = \frac{\exp(x_ij - max_j(x_ij))}{\sum_j(exp(x_ij - max_j(x_ij))}\]

其中，\(x\) 为输入的 Tensor。

参数¶

x (Tensor) - 输入的稀疏 Tensor，可以是 SparseCooTensor 或 SparseCsrTensor，数据类型为 float32、float64。

axis (int, 可选) - 指定对输入 SparseTensor 计算 softmax 的轴。对于 SparseCsrTensor，仅支持 axis=-1。默认值：-1。

name (str，可选) - 具体用法请参见 Name，一般无需设置，默认值为 None。

返回¶

多维稀疏 Tensor, 数据类型和稀疏格式与 x 相同。

代码示例¶

import paddle
import numpy as np
paddle.seed(100)

mask = np.random.rand(3, 4) < 0.5
np_x = np.random.rand(3, 4) * mask
# [[0.         0.         0.96823406 0.19722934]
#  [0.94373937 0.         0.02060066 0.71456372]
#  [0.         0.         0.         0.98275049]]

csr = paddle.to_tensor(np_x).to_sparse_csr()
# Tensor(shape=[3, 4], dtype=paddle.float64, place=Place(gpu:0), stop_gradient=True,
#        crows=[0, 2, 5, 6],
#        cols=[2, 3, 0, 2, 3, 3],
#        values=[0.96823406, 0.19722934, 0.94373937, 0.02060066, 0.71456372,
#                0.98275049])

out = paddle.sparse.nn.functional.softmax(csr)
# Tensor(shape=[3, 4], dtype=paddle.float64, place=Place(gpu:0), stop_gradient=True,
#        crows=[0, 2, 5, 6],
#        cols=[2, 3, 0, 2, 3, 3],
#        values=[0.68373820, 0.31626180, 0.45610887, 0.18119845, 0.36269269,
#                1.        ])