tensorflow:: ops:: CombinedNonMaxSuppression
#include <image_ops.h>
Greedily selects a subset of bounding boxes in descending order of score,.
Summary
This operation performs non_max_suppression on the inputs per batch, across all classes. Prunes away boxes that have high intersection-over-union (IOU) overlap with previously selected boxes. Bounding boxes are supplied as [y1, x1, y2, x2], where (y1, x1) and (y2, x2) are the coordinates of any diagonal pair of box corners and the coordinates can be provided as normalized (i.e., lying in the interval [0, 1]) or absolute. Note that this algorithm is agnostic to where the origin is in the coordinate system. Also note that this algorithm is invariant to orthogonal transformations and translations of the coordinate system; thus translating or reflections of the coordinate system result in the same boxes being selected by the algorithm. The output of this operation is the final boxes, scores and classes tensor returned after performing non_max_suppression.
Args:
- scope: A Scope object
- boxes: A 4-D float tensor of shape
[batch_size, num_boxes, q, 4]
. Ifq
is 1 then same boxes are used for all classes otherwise, ifq
is equal to number of classes, class-specific boxes are used. - scores: A 3-D float tensor of shape
[batch_size, num_boxes, num_classes]
representing a single score corresponding to each box (each row of boxes). - max_output_size_per_class: A scalar integer tensor representing the maximum number of boxes to be selected by non max suppression per class
- max_total_size: An int32 scalar representing the maximum number of boxes retained over all classes. Note that setting this value to a large number may result in OOM error depending on the system workload.
- iou_threshold: A 0-D float tensor representing the threshold for deciding whether boxes overlap too much with respect to IOU.
- score_threshold: A 0-D float tensor representing the threshold for deciding when to remove boxes based on score.
Optional attributes (see Attrs
):
- pad_per_class: If false, the output nmsed boxes, scores and classes are padded/clipped to
max_total_size
. If true, the output nmsed boxes, scores and classes are padded to be of lengthmax_size_per_class
*num_classes
, unless it exceedsmax_total_size
in which case it is clipped tomax_total_size
. Defaults to false. - clip_boxes: If true, assume the box coordinates are between [0, 1] and clip the output boxes if they fall beyond [0, 1]. If false, do not do clipping and output the box coordinates as it is.
Returns:
Output
nmsed_boxes: A [batch_size, max_detections, 4] float32 tensor containing the non-max suppressed boxes.Output
nmsed_scores: A [batch_size, max_detections] float32 tensor containing the scores for the boxes.Output
nmsed_classes: A [batch_size, max_detections] float32 tensor containing the classes for the boxes.Output
valid_detections: A [batch_size] int32 tensor indicating the number of valid detections per batch item. Only the top num_detections[i] entries in nms_boxes[i], nms_scores[i] and nms_class[i] are valid. The rest of the entries are zero paddings.
Constructors and Destructors |
|
---|---|
CombinedNonMaxSuppression(const ::tensorflow::Scope & scope, ::tensorflow::Input boxes, ::tensorflow::Input scores, ::tensorflow::Input max_output_size_per_class, ::tensorflow::Input max_total_size, ::tensorflow::Input iou_threshold, ::tensorflow::Input score_threshold)
|
|
CombinedNonMaxSuppression(const ::tensorflow::Scope & scope, ::tensorflow::Input boxes, ::tensorflow::Input scores, ::tensorflow::Input max_output_size_per_class, ::tensorflow::Input max_total_size, ::tensorflow::Input iou_threshold, ::tensorflow::Input score_threshold, const CombinedNonMaxSuppression::Attrs & attrs)
|
Public attributes |
|
---|---|
nmsed_boxes
|
|
nmsed_classes
|
|
nmsed_scores
|
|
operation
|
|
valid_detections
|
Public static functions |
|
---|---|
ClipBoxes(bool x)
|
|
PadPerClass(bool x)
|
Structs |
|
---|---|
tensorflow:: |
Optional attribute setters for CombinedNonMaxSuppression. |
Public attributes
nmsed_boxes
::tensorflow::Output nmsed_boxes
nmsed_classes
::tensorflow::Output nmsed_classes
nmsed_scores
::tensorflow::Output nmsed_scores
operation
Operation operation
valid_detections
::tensorflow::Output valid_detections
Public functions
CombinedNonMaxSuppression
CombinedNonMaxSuppression( const ::tensorflow::Scope & scope, ::tensorflow::Input boxes, ::tensorflow::Input scores, ::tensorflow::Input max_output_size_per_class, ::tensorflow::Input max_total_size, ::tensorflow::Input iou_threshold, ::tensorflow::Input score_threshold )
CombinedNonMaxSuppression
CombinedNonMaxSuppression( const ::tensorflow::Scope & scope, ::tensorflow::Input boxes, ::tensorflow::Input scores, ::tensorflow::Input max_output_size_per_class, ::tensorflow::Input max_total_size, ::tensorflow::Input iou_threshold, ::tensorflow::Input score_threshold, const CombinedNonMaxSuppression::Attrs & attrs )
Public static functions
ClipBoxes
Attrs ClipBoxes( bool x )
PadPerClass
Attrs PadPerClass( bool x )