Add a unified resize operation for detection and a resize op for recognition inference #295

SamitHuang · 2023-05-18T06:51:23Z

Thank you for your contribution to the MindOCR repo.
Before submitting this PR, please make sure:

You have read the Contributing Guidelines on pull requests
Your code builds clean without any errors or warnings
You are using approved terminology
You have added unit tests

Motivation

(Write your motivation for proposed changes here.)

Test Plan

(How should this PR be tested? Do you require special setup to run the test or repro the fixed bug?)

Related Issues and PRs

(Is this PR part of a group of changes? Link the other relevant PRs and Issues here. Use https://help.github.com/en/articles/closing-issues-using-keywords for help on GitHub syntax)

hadipash · 2023-05-19T01:24:33Z

mindocr/data/transforms/det_transforms.py

+                padded_img = np.zeros((tar_h, tar_w, 3), dtype=np.uint8)
+                padded_img[:resize_h, :resize_w, :] = resized_img
+                data['image'] = padded_img


Suggested change

padded_img = np.zeros((tar_h, tar_w, 3), dtype=np.uint8)

padded_img[:resize_h, :resize_w, :] = resized_img

data['image'] = padded_img

data['image'] = np.pad(data['image'], ((0, tar_h - resize_h), (0, tar_w - resize_w), (0, 0)))

hadipash · 2023-05-19T01:30:01Z

mindocr/data/transforms/det_transforms.py

+            data['polys'][:, :, 0] = data['polys'][:, :, 0] * scale_w
+            data['polys'][:, :, 1] = data['polys'][:, :, 1] * scale_h


Suggested change

data['polys'][:, :, 0] = data['polys'][:, :, 0] * scale_w

data['polys'][:, :, 1] = data['polys'][:, :, 1] * scale_h

data['polys'] = data['polys'] * [scale_w, scale_h]

hadipash · 2023-05-19T01:37:40Z

mindocr/data/transforms/rec_transforms.py

+        resize_h = self.tar_h
+
+        if self.keep_ratio==False:
+            assert self.tar_w is not None, 'Must specify target_width if keep_ratio is False'


Move assert inside __init__?

hadipash · 2023-05-19T01:39:59Z

mindocr/data/transforms/rec_transforms.py

+            padded_img = np.zeros((self.tar_h, self.tar_w, 3), dtype=np.uint8)
+            padded_img[:, :resize_w, :] = resized_img
+            data['image'] = padded_img


Suggested change

padded_img = np.zeros((self.tar_h, self.tar_w, 3), dtype=np.uint8)

padded_img[:, :resize_w, :] = resized_img

data['image'] = padded_img

data['image'] = np.pad(data['image'], ((0, 0), (0, self.tar_w - resize_w), (0, 0)))

hadipash · 2023-05-19T01:44:51Z

configs/det/dbnet/db++_r50_icdar15.yaml

+      - DetResize: 
+          target_size: [ 1152, 2048]
+          keep_ratio: True
+          limit_type: auto


I feel this is a bit confusing. When I want an image to have a specific resolution (set by target_size) and keep its ratio, I must need to set limit_type to auto. Otherwise, I will get completely unexpected output.

Don't have to set limit_type auto in this case. You can just use the default "min". Here auto is just to make the same as ScalePadImage.

If I don't set limit_type to auto, the output will be of size 736 by the shortest side.

SamitHuang · 2023-05-23T03:20:56Z

under further improvment.

SamitHuang requested review from hadipash, zhtmike, HaoyangLee and Songyuanwei and removed request for hadipash May 18, 2023 06:51

HaoyangLee requested a review from liangxhao May 18, 2023 07:00

SamitHuang changed the title ~~Add a unified resize operation for detection and a resize for recognition inference~~ Add a unified resize operation for detection and a resize op for recognition inference May 18, 2023

hadipash approved these changes May 19, 2023

View reviewed changes

HaoyangLee approved these changes May 19, 2023

View reviewed changes

hadipash self-requested a review May 19, 2023 06:59

SamitHuang changed the title ~~Add a unified resize operation for detection and a resize op for recognition inference~~ Add a unified resize operation for detection and a resize op for recognition inference (don't merge) May 23, 2023

SamitHuang force-pushed the base branch 2 times, most recently from 32ebb72 to d1cd911 Compare May 24, 2023 04:22

Update det and rec resize operation for evaluation/inference

a974e36

SamitHuang force-pushed the base branch from 49d94b5 to a974e36 Compare May 24, 2023 04:52

SamitHuang changed the title ~~Add a unified resize operation for detection and a resize op for recognition inference (don't merge)~~ Add a unified resize operation for detection and a resize op for recognition inference May 24, 2023

Songyuanwei approved these changes May 24, 2023

View reviewed changes

SamitHuang merged commit 4afbea7 into mindspore-lab:main May 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a unified resize operation for detection and a resize op for recognition inference #295

Add a unified resize operation for detection and a resize op for recognition inference #295

Uh oh!

SamitHuang commented May 18, 2023 •

edited

Loading

Uh oh!

hadipash May 19, 2023

Uh oh!

hadipash May 19, 2023

Uh oh!

hadipash May 19, 2023

Uh oh!

hadipash May 19, 2023

Uh oh!

hadipash May 19, 2023

Uh oh!

SamitHuang May 19, 2023

Uh oh!

hadipash May 19, 2023

Uh oh!

SamitHuang commented May 23, 2023

Uh oh!

Uh oh!

		data['polys'][:, :, 0] = data['polys'][:, :, 0] * scale_w
		data['polys'][:, :, 1] = data['polys'][:, :, 1] * scale_h

	data['polys'][:, :, 0] = data['polys'][:, :, 0] * scale_w
	data['polys'][:, :, 1] = data['polys'][:, :, 1] * scale_h
	data['polys'] = data['polys'] * [scale_w, scale_h]

Add a unified resize operation for detection and a resize op for recognition inference #295

Add a unified resize operation for detection and a resize op for recognition inference #295

Uh oh!

Conversation

SamitHuang commented May 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Test Plan

Related Issues and PRs

Uh oh!

hadipash May 19, 2023

Choose a reason for hiding this comment

Uh oh!

hadipash May 19, 2023

Choose a reason for hiding this comment

Uh oh!

hadipash May 19, 2023

Choose a reason for hiding this comment

Uh oh!

hadipash May 19, 2023

Choose a reason for hiding this comment

Uh oh!

hadipash May 19, 2023

Choose a reason for hiding this comment

Uh oh!

SamitHuang May 19, 2023

Choose a reason for hiding this comment

Uh oh!

hadipash May 19, 2023

Choose a reason for hiding this comment

Uh oh!

SamitHuang commented May 23, 2023

Uh oh!

Uh oh!

SamitHuang commented May 18, 2023 •

edited

Loading