add dbnet yaml for synthtext dataset and td500 dataset #257

Songyuanwei · 2023-05-05T07:27:11Z

Thank you for your contribution to the MindOCR repo.
Before submitting this PR, please make sure:

You have read the Contributing Guidelines on pull requests
Your code builds clean without any errors or warnings
You are using approved terminology
You have added unit tests

Motivation

add dbnet yaml for synthtext dataset and td500 dataset
(Write your motivation for proposed changes here.)

Test Plan

(How should this PR be tested? Do you require special setup to run the test or repro the fixed bug?)

Related Issues and PRs

related issue #222
(Is this PR part of a group of changes? Link the other relevant PRs and Issues here. Use https://help.github.com/en/articles/closing-issues-using-keywords for help on GitHub syntax)

SamitHuang · 2023-05-05T11:05:08Z

configs/det/dbnet/README.md

+
+| **Model**         | **Context**    | **Backbone** | **Pretrained** | **Recall** | **Precision** | **F-score** | **Train T.** | **Throughput** | **Recipe**                  | **Download**                                                                                                                                                                                         |
+|-------------------|----------------|--------------|----------------|------------|---------------|-------------|--------------|----------------|-----------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| DBNet (ours)      | D910x1-MS2.0-G | ResNet-50    | SynthText       | 82.47%     | 87.75%        | 85.03%      | 24.4 s/epoch  | 27.9 img/s      | [yaml](db_r50_td500.yaml) | [ckpt](https://download.mindspore.cn/toolkits/mindocr/dbnet/dbnet_resnet50_td500-0d12b5e8.ckpt)  |


Why is the Throughput for TD500 so different from that on SynthText (27.9 vs 82.02 FPS), considering the architecture and data processing pipeline should be the same?

batch_size is different. By adjusting num_workers, the current Throughput for TD500 is 51.1 img/s

SamitHuang · 2023-05-06T05:09:38Z

configs/det/dbnet/README.md

需要在model readme中补充synthtext和td500的data preparation.

SamitHuang · 2023-05-07T11:53:15Z

configs/det/dbnet/db_r50_synthtext.yaml

这里的weight decay 5e-4，跟原论文 1e-4并未对齐，原因是？（如调整后最终loss更低？）

另外，net_columns_to_net为旧参数名，须对齐最新的：net_input_column_index, label_column_index 设置上。

The original DBNet paper doesn't share details on the network pretraining with SynthText, it only mentions this:

For all the models, we first pre-train them with the SynthText dataset for 100k iterations.

Nor the original repo has a config for SynthText. The "Synthetic Data for Text Localisation in Natural Images" paper has pretraining hyperparameters where they mentioned weight decay 5e-4 (actually, 5^-4 but I find it odd), however they used these hyperparameters to pretrain FCRN. So, I guess, it is up to us to find the best combination of hyperparameters to pretrain DBNet. Although, we must be cautious to not overfit the model since there's no validation set for SynthText.

SamitHuang · 2023-05-07T13:07:53Z

configs/det/dbnet/README.md

+
+| **Model**         | **Context**    | **Backbone** | **Pretrained** |  **Train T.** | **Throughput** | **Recipe**                  | **Download**                 |
+|-------------------|----------------|--------------|----------------|------------|---------------|-------------|--------------|
+| DBNet (ours)      | D910x1-MS2.0-G | ResNet-50    | ImageNet       |  10470 s/epoch  | 82.02 img/s      | [yaml](db_r50_synthtext.yaml) | [ckpt](https://download.mindspore.cn/toolkits/mindocr/dbnet/dbnet_resnet50_synthtext-40655acb.ckpt)  |


yaml中distribute为True，但此处D910x1-MS2.0-G 为单卡，please double-check standalone/distributed mode and the number of cards used

Please add training loss result for SynthText.

Songyuanwei force-pushed the branch_1 branch from 248b950 to 8313f44 Compare May 5, 2023 07:30

Songyuanwei requested review from SamitHuang, hadipash and HaoyangLee May 5, 2023 09:34

Songyuanwei force-pushed the branch_1 branch from 8313f44 to 1b53ba9 Compare May 5, 2023 09:57

SamitHuang reviewed May 5, 2023

View reviewed changes

SamitHuang reviewed May 6, 2023

View reviewed changes

SamitHuang reviewed May 7, 2023

View reviewed changes

hadipash approved these changes May 8, 2023

View reviewed changes

Songyuanwei force-pushed the branch_1 branch 9 times, most recently from cfa13c5 to fdb3366 Compare May 10, 2023 03:34

add dbnet yaml for synthtext dataset and td500 dataset

ca27ec2

Songyuanwei force-pushed the branch_1 branch from fdb3366 to ca27ec2 Compare May 10, 2023 03:35

Merge branch 'main' into branch_1

b6f977b

SamitHuang approved these changes May 10, 2023

View reviewed changes

SamitHuang merged commit 8038db3 into mindspore-lab:main May 10, 2023

Songyuanwei deleted the branch_1 branch May 10, 2023 06:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add dbnet yaml for synthtext dataset and td500 dataset #257

add dbnet yaml for synthtext dataset and td500 dataset #257

Uh oh!

Songyuanwei commented May 5, 2023 •

edited

Loading

Uh oh!

SamitHuang May 5, 2023 •

edited

Loading

Uh oh!

Songyuanwei May 9, 2023

Uh oh!

SamitHuang May 6, 2023

Uh oh!

Songyuanwei May 8, 2023

Uh oh!

SamitHuang May 7, 2023

Uh oh!

SamitHuang May 7, 2023 •

edited

Loading

Uh oh!

hadipash May 8, 2023 •

edited

Loading

Uh oh!

SamitHuang May 7, 2023 •

edited

Loading

Uh oh!

Songyuanwei May 8, 2023

Uh oh!

SamitHuang May 9, 2023

Uh oh!

Songyuanwei May 10, 2023

Uh oh!

Uh oh!

add dbnet yaml for synthtext dataset and td500 dataset #257

add dbnet yaml for synthtext dataset and td500 dataset #257

Uh oh!

Conversation

Songyuanwei commented May 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Test Plan

Related Issues and PRs

Uh oh!

SamitHuang May 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Songyuanwei May 9, 2023

Choose a reason for hiding this comment

Uh oh!

SamitHuang May 6, 2023

Choose a reason for hiding this comment

Uh oh!

Songyuanwei May 8, 2023

Choose a reason for hiding this comment

Uh oh!

SamitHuang May 7, 2023

Choose a reason for hiding this comment

Uh oh!

SamitHuang May 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hadipash May 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SamitHuang May 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Songyuanwei May 8, 2023

Choose a reason for hiding this comment

Uh oh!

SamitHuang May 9, 2023

Choose a reason for hiding this comment

Uh oh!

Songyuanwei May 10, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Songyuanwei commented May 5, 2023 •

edited

Loading

SamitHuang May 5, 2023 •

edited

Loading

SamitHuang May 7, 2023 •

edited

Loading

hadipash May 8, 2023 •

edited

Loading

SamitHuang May 7, 2023 •

edited

Loading