unifying 2D and 3D messages #2

procopiostein · 2017-05-22T12:42:10Z

IMO I think we gain in simplicity by reducing the number of messages by merging, when possible, 2D and 3D msgs.
A pose can be interpreted as 2D or 3D according to the value on the z field.
For example, in many cases, wheeled robots in ROS publish their pose as a geometry_msgs/Pose, (inherently 3D) although they are always at the ground level.

tfoote

Overall this looks like a good suggestion. We generally recommend using generic 3D datatypes even in mostly 2D processing pipelines to keep the pipelines compatible. 2D processors should either have appropriate methods to either collapse or ignore the Z elements.

The above is on the caveat that a 2D bounding box in pixel space is different as it's a completely different unit and if this is expected to represent bounding boxes in pixel space that should be a different datatype.

tfoote · 2017-05-22T18:16:11Z

msg/BoundingBox.msg

Why change this to a Point?

This should be documented how to resolve this. What is it relative to? The pose of the pose?

I reverted to the original Vector type, and added a better description.

tfoote · 2017-05-22T18:24:56Z

msg/BoundingBox.msg

@@ -1,7 +1,7 @@
 # Note: This message plans to move to geometry_msgs in the final implementation.

-# position and rotation of the bounding box
+# position and orientation of the bounding box center
 geometry_msgs/Pose pose


Not in this PR but related to the other comment. This should probably be more considered an 'origin' or something instead of echoing the datatype.

I improved the description.

Kukanani · 2017-05-22T22:26:35Z

+1 on combining ObjectHypothesis. The DetectionXD messages will also have to be updated to point to the new message type.

Thanks for the PR!

This reverts commit 12f0451.

also changed datatype to uint32 as we are in the pixel space for 2D BB

procopiostein · 2017-05-23T07:39:18Z

Agreed @tfoote , I have not thought about that. I reverted the related commit and tried to improved the description and fields. Also changed the datatype for the size of a 2D BB to uint32, as it should be in pixels.

@Kukanani I updated the DetectionXD to use the generic ObjectHypothesisWithPose

mintar · 2017-05-23T15:34:11Z

msg/BoundingBox2D.msg

-# The x/y position and (optional) rotation of the bounding box center.
-geometry_msgs/Pose2D pose
+# The 2D position (in pixels) and orientation of the bounding box origin.
+geometry_msgs/Pose2D origin


Honest question: Is it common to have rotated bounding boxes in 2D classification? I've only seen axis-aligned bounding boxes so far. If so, we should replace the Pose2D with a "Point2D" (which doesn't exist yet), preferably with uint32 coordinates instead of float. If (almost) nobody uses the rotation, many people won't implement it correctly.

Rotated 2D bounding boxes often is used for grasp generators and human annotations. Since the bounding boxes will probably move to geometry_msgs, I think that a generic rotated version should be best.

mintar · 2017-05-23T15:38:32Z

msg/BoundingBox2D.msg

+# to the pose of its origin
+uint32 size_x
+uint32 size_y


Related to my previous comment: If we allow the bounding box origin to be given as floats with rotation, we should also allow floats for the size. The unit should still be pixels. However, I would prefer to remove the rotation and use uint32 everywhere (bounding box origin + size).

I agree with both your comments.

Sub-pixel bounding boxes are possible when using machine learning-based object detectors, but it's worth asking if that information is worth storing.

I'll also note that without floats for origin, it becomes difficult to accurately position an even-sized bounding box (if the BB is 2x2, where is its center?)

I think that by origin, @procopiostein meant something like the top-left corner, not the center. This is why a 2x2 BB is not a problem: it's the box between (origin.x, origin.y) and (origin.x + size_x, origin.y + size_y).

mintar · 2017-05-24T14:35:31Z

Let's continue the discussion in #5.

Procópio Stein added 2 commits May 22, 2017 14:36

unified BoundingBox3D and BoundingBox2D msgs

12f0451

unified ObjectHypothesis 2D and 3D in ObjectHypothesisWithPose

c3cdee6

tfoote reviewed May 22, 2017

View reviewed changes

Procópio Stein added 3 commits May 23, 2017 09:14

Revert "unified BoundingBox3D and BoundingBox2D msgs"

3f12c1d

This reverts commit 12f0451.

better description and attempt in normalizing fields

2a8bd77

also changed datatype to uint32 as we are in the pixel space for 2D BB

changed DetectionXD.msg to use generic ObjectHypothesisWithPose

3e13194

mintar reviewed May 23, 2017

View reviewed changes

Kukanani merged commit 1929d27 into ros-perception:master May 23, 2017

mintar mentioned this pull request May 24, 2017

[WIP] simpler BoundingBox2D #5

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unifying 2D and 3D messages #2

unifying 2D and 3D messages #2

procopiostein commented May 22, 2017

tfoote left a comment

tfoote May 22, 2017

procopiostein May 23, 2017

tfoote May 22, 2017

procopiostein May 23, 2017

Kukanani commented May 22, 2017

procopiostein commented May 23, 2017

mintar May 23, 2017

Kukanani May 23, 2017

mintar May 23, 2017

procopiostein May 23, 2017

Kukanani May 23, 2017

mintar May 24, 2017

mintar commented May 24, 2017

unifying 2D and 3D messages #2

unifying 2D and 3D messages #2

Conversation

procopiostein commented May 22, 2017

tfoote left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Kukanani commented May 22, 2017

procopiostein commented May 23, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mintar commented May 24, 2017