RoIAlign

class paddle.vision.ops. RoIAlign ( output_size: Size2, spatial_scale: float = 1.0 ) [source]

This interface is used to construct a callable object of the RoIAlign class. Please refer to roi_align.

Parameters

output_size (int or tuple[int, int]) – The pooled output size(h, w), data type is int32. If int, h and w are both equal to output_size.
spatial_scale (float, optional) – Multiplicative spatial scale factor to translate ROI coords from their input scale to the scale used when pooling. Default: 1.0.

Returns

The output of ROIAlign operator is a 4-D tensor with shape (num_boxes, channels, pooled_h, pooled_w).

Examples

>>> import paddle
>>> from paddle.vision.ops import RoIAlign

>>> data = paddle.rand([1, 256, 32, 32])
>>> boxes = paddle.rand([3, 4])
>>> boxes[:, 2] += boxes[:, 0] + 3
>>> boxes[:, 3] += boxes[:, 1] + 4
>>> boxes_num = paddle.to_tensor([3]).astype('int32')
>>> roi_align = RoIAlign(output_size=(4, 3))
>>> align_out = roi_align(data, boxes, boxes_num)
>>> print(align_out.shape)
[3, 256, 4, 3]

forward ( x: Tensor, boxes: Tensor, boxes_num: Tensor, aligned: bool = True ) → Tensor forward¶

Defines the computation performed at every call. Should be overridden by all subclasses.

Parameters

*inputs (tuple) – unpacked tuple arguments
**kwargs (dict) – unpacked dict arguments