Skip to content

Conversation

@bernardev254
Copy link

Summary

This PR implements the preprocessing() function for the object detection pipeline in hackathon/objectdetection.py.
closes #1

Changes

  • Extracts image, bounding boxes, and labels from VOC-style TFDS samples.
  • Resizes images using prepare_image.
  • Converts bounding boxes from absolute coordinates to normalized format [ymin, xmin, ymax, xmax].
  • Flattens the "objects" dictionary into separate "bboxes" and "labels" fields.
  • Returns a structure compatible with downstream augmentation steps.

Motivation

This change enables the object detection pipeline to correctly prepare samples for augmentation and model input. It aligns the preprocessing output format with the expectations in the augmentations() function.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Complete 'preprocessing' function for bounding boxes

1 participant