matrix_nms¶
- paddle.vision.ops. matrix_nms ( bboxes, scores, score_threshold, post_threshold, nms_top_k, keep_top_k, use_gaussian=False, gaussian_sigma=2.0, background_label=0, normalized=True, return_index=False, return_rois_num=True, name=None ) [source]
-
This operator does matrix non maximum suppression (NMS). First selects a subset of candidate bounding boxes that have higher scores than score_threshold (if provided), then the top k candidate is selected if nms_top_k is larger than -1. Score of the remaining candidate are then decayed according to the Matrix NMS scheme. After NMS step, at most keep_top_k number of total bboxes are to be kept per image if keep_top_k is larger than -1.
- Parameters
-
bboxes (Tensor) – A 3-D Tensor with shape [N, M, 4] represents the predicted locations of M bounding bboxes, N is the batch size. Each bounding box has four coordinate values and the layout is [xmin, ymin, xmax, ymax], when box size equals to 4. The data type is float32 or float64.
scores (Tensor) – A 3-D Tensor with shape [N, C, M] represents the predicted confidence predictions. N is the batch size, C is the class number, M is number of bounding boxes. For each category there are total M scores which corresponding M bounding boxes. Please note, M is equal to the 2nd dimension of BBoxes. The data type is float32 or float64.
score_threshold (float) – Threshold to filter out bounding boxes with low confidence score.
post_threshold (float) – Threshold to filter out bounding boxes with low confidence score AFTER decaying.
nms_top_k (int) – Maximum number of detections to be kept according to the confidences after the filtering detections based on score_threshold.
keep_top_k (int) – Number of total bboxes to be kept per image after NMS step. -1 means keeping all bboxes after NMS step.
use_gaussian (bool, optional) – Use Gaussian as the decay function. Default: False
gaussian_sigma (float, optional) – Sigma for Gaussian decay function. Default: 2.0
background_label (int, optional) – The index of background label, the background label will be ignored. If set to -1, then all categories will be considered. Default: 0
normalized (bool, optional) – Whether detections are normalized. Default: True
return_index (bool, optional) – Whether return selected index. Default: False
return_rois_num (bool, optional) – whether return rois_num. Default: True
name (str, optional) – Name of the matrix nms op. Default: None.
- Returns
-
A tuple with three Tensor, (Out, Index, RoisNum) if return_index is True, otherwise, a tuple with two Tensor (Out, RoisNum) is returned.
Out (Tensor), A 2-D Tensor with shape [No, 6] containing the detection results. Each row has 6 values, [label, confidence, xmin, ymin, xmax, ymax]
Index (Tensor), A 2-D Tensor with shape [No, 1] containing the selected indices, which are absolute values cross batches.
rois_num (Tensor), A 1-D Tensor with shape [N] containing the number of detected boxes in each image.
Examples
>>> import paddle >>> from paddle.vision.ops import matrix_nms >>> boxes = paddle.rand([4, 1, 4]) >>> boxes[..., 2] = boxes[..., 0] + boxes[..., 2] >>> boxes[..., 3] = boxes[..., 1] + boxes[..., 3] >>> scores = paddle.rand([4, 80, 1]) >>> out = matrix_nms(bboxes=boxes, scores=scores, background_label=0, ... score_threshold=0.5, post_threshold=0.1, ... nms_top_k=400, keep_top_k=200, normalized=False)