psroi_pool¶

paddle.fluid.layers. psroi_pool ( input, rois, output_channels, spatial_scale, pooled_height, pooled_width, name=None ) [源代码] ¶

注意 rois必须为2维LoDTensor，lod_level为1

该OP执行PSROIPooling运算，是位置敏感的感兴趣区域池化方法（Position sensitive region of interest pooling，也称为PSROIPooling）。输入input是位置敏感的评分图，输入rois是感兴趣区域的位置坐标。PSROIPooling不同于普通ROIPooling的地方在于，输入input特征图的不同通道会跟输出特征图上的位置区域相关联，该方法是在R-FCN模型中首次提出来的，更多详细信息请参阅 https://arxiv.org/abs/1605.06409。

样例：

Given:
  input.shape = [2, 490, 28, 28]
  rois.shape = [5, 4], rois.lod = [[3, 2]]
  output_channels = 10
  pooled_height = 7
  pooled_width = 7

Return:
  out.shape = [5, 10, 7, 7], out.lod = [[3, 2]]

参数¶

input (Variable) - 输入特征图，4-D Tensor，格式是NCHW。其中N是batch_size，C是输入通道的数量，H是输入特征图的高度，W是特征图宽度。数据类型是float32或者float64

rois (Variable) - 感兴趣区域，2-D LoDTensor，形状为(num_rois, 4)，lod_level为1。形式如[x1, y1, x2, y2], ...]，其中(x1, y1)为左上角坐标，(x2, y2)为右下角坐标。数据类型与input相同

output_channels (int) - 输出特征图的通道数。对于共C个种类的图像分类任务，output_channels应该是 (C + 1)，其中1代表背景

spatial_scale (float) - 空间跨度因子，用于将 rois 中的坐标从其输入尺寸按比例映射到 input 特征图的尺寸

pooled_height (int) - 池化输出的高度

pooled_width (int) - 池化输出的宽度

name (str，可选) - 具体用法请参见 Name，一般无需设置，默认值为 None。

返回¶

经过PSROIPooling之后的结果，形状为(num_rois，output_channels，pooled_height，pooled_width) 的4维LoDTensor，lod_level为1，数据类型与input相同，与rois具有相同的lod信息。

返回类型¶

Variable

代码示例¶

import paddle.fluid as fluid
import paddle
paddle.enable_static()
x = fluid.data(name='x', shape=[100, 490, 28, 28], dtype='float32')
rois = fluid.data(name='rois', shape=[None, 4], lod_level=1, dtype='float32')
pool_out = fluid.layers.psroi_pool(x, rois, 10, 1.0, 7, 7)