Update SVTR Tiny Model #486

zhtmike · 2023-07-06T02:13:34Z

Update SVTR training data, to align with official training dataset.
Update LMDB dataset generator, support 1. dropping instance with zero text; 2. dropping instance which text length is larger than the maximum number the model can handle (especially for CTC alignment); 3. label standardization (NFKD)
Fix the SVTR augmentations, now all random variables should be randomized in __call__ instead of __init__
Update SVTR Tiny model accuracy: 89.02% -> 90.23%, FPS: 2968 -> 4560
Clear lot of warnings when model is running on Mindspore 2.0, including legacy warning of API change of nn.Dropout and ms_function
Fix SVTR convolutional kernel and support dropping positional encoding in SVTR backbone

Thank you for your contribution to the MindOCR repo.
Before submitting this PR, please make sure:

You have read the Contributing Guidelines on pull requests
Your code builds clean without any errors or warnings
You are using approved terminology
You have added unit tests

Motivation

(Write your motivation for proposed changes here.)

Test Plan

(How should this PR be tested? Do you require special setup to run the test or repro the fixed bug?)

Related Issues and PRs

(Is this PR part of a group of changes? Link the other relevant PRs and Issues here. Use https://help.github.com/en/articles/closing-issues-using-keywords for help on GitHub syntax)

mindocr/data/transforms/svtr_transform.py

HaoyangLee · 2023-07-10T06:55:32Z

configs/rec/svtr/README.md

@@ -38,7 +38,7 @@ According to our experiments, the evaluation results on public benchmark dataset

 | **Model** | **Context** | **Avg Accuracy** | **Train T.** | **FPS** | **Recipe** | **Download** |
 | :-----: | :-----------: | :--------------: | :----------: | :--------: | :--------: |:----------: |
-| SVTR-Tiny      | D910x4-MS1.10-G | 89.02%    | 4866 s/epoch       | 2968 | [yaml](https://github.com/mindspore-lab/mindocr/blob/main/configs/rec/svtr/svtr_tiny.yaml) | [ckpt](https://download.mindspore.cn/toolkits/mindocr/svtr/svtr_tiny-8542b3bb.ckpt) \| [mindir](https://download.mindspore.cn/toolkits/mindocr/svtr/svtr_tiny-8542b3bb-5cf5a130.mindir) |
+| SVTR-Tiny      | D910x4-MS1.10-G | 90.23%    | 3638 s/epoch       | 4560 | [yaml](https://github.com/mindspore-lab/mindocr/blob/main/configs/rec/svtr/svtr_tiny.yaml) | [ckpt](https://download.mindspore.cn/toolkits/mindocr/svtr/svtr_tiny-950be1c3.ckpt) \| [mindir](https://download.mindspore.cn/toolkits/mindocr/svtr/svtr_tiny-950be1c3-86ece8c8.mindir) |


After the PR is merging, ask @jianyunchao to delete the former ckpt/mindir files in https://download.mindspore.cn/toolkits/mindocr/svtr/

HaoyangLee · 2023-07-10T07:01:10Z

configs/rec/svtr/README.md

 │       ├── data.mdb
 │       └── lock.mdb
 └── validation
    ├── data.mdb
    └── lock.mdb
 ```

-#### 3.1.3 Dataset Usage
-
 Here we used the datasets under `training/` folders for training, and the union dataset `validation/` for validation. After training, we used the datasets under `evaluation/` to evaluate model accuracy.

 **Training:** (total 14,442,049 samples)


Since the ST dataset size is changed, the number of total training samples should also change?

Right, fixed.

SamitHuang · 2023-07-11T05:06:30Z

Nice work. What leads to the FPS improved from 2968 to 4560?

zhtmike · 2023-07-11T05:27:50Z

Nice work. What leads to the FPS improved from 2968 to 4560?

Not very sure. Seems removing some long text sample will be helpful. (CTC loss has max. length limit, too.long image will not contributed to the loss value but still cost some time in the previous setting)

HaoyangLee · 2023-07-11T06:47:52Z

mindocr/data/transforms/svtr_transform.py

            scale_img = cv2.pyrDown(scale_img)
        scale_img = cv2.resize(scale_img, (src_w, src_h), interpolation=get_interpolation())
        return scale_img


 class CVGaussianNoise(object):
-    def __init__(self, mean=0, var=20):
+    def __init__(self, mean=0, varience=20):


Suggested change

def __init__(self, mean=0, varience=20):

def __init__(self, mean=0, variance=20):

Do you mean variance? Same in class SVTRDeterioration

Update SVTR Model

351373f

zhtmike requested review from SamitHuang, hadipash, HaoyangLee and hqkate July 6, 2023 02:13

Merge branch 'main' into svtr_update

994e5d2

hadipash reviewed Jul 6, 2023

View reviewed changes

mindocr/data/transforms/svtr_transform.py Show resolved Hide resolved

Update url and accu

cf52b66

zhtmike marked this pull request as ready for review July 7, 2023 03:09

fix typo

6931e67

hadipash approved these changes Jul 10, 2023

View reviewed changes

HaoyangLee reviewed Jul 11, 2023

View reviewed changes

Fix the total number

2460d69

HaoyangLee approved these changes Jul 12, 2023

View reviewed changes

Fix typo

313fa23

zhtmike merged commit 7d20699 into mindspore-lab:main Jul 12, 2023

zhtmike deleted the svtr_update branch July 19, 2023 02:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update SVTR Tiny Model #486

Update SVTR Tiny Model #486

Uh oh!

zhtmike commented Jul 6, 2023 •

edited

Loading

Uh oh!

Uh oh!

HaoyangLee Jul 10, 2023

Uh oh!

HaoyangLee Jul 10, 2023

Uh oh!

zhtmike Jul 11, 2023

Uh oh!

SamitHuang commented Jul 11, 2023

Uh oh!

zhtmike commented Jul 11, 2023

Uh oh!

HaoyangLee Jul 11, 2023

Uh oh!

zhtmike Jul 12, 2023

Uh oh!

Uh oh!

	def __init__(self, mean=0, varience=20):
	def __init__(self, mean=0, variance=20):

Update SVTR Tiny Model #486

Update SVTR Tiny Model #486

Uh oh!

Conversation

zhtmike commented Jul 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Test Plan

Related Issues and PRs

Uh oh!

Uh oh!

HaoyangLee Jul 10, 2023

Choose a reason for hiding this comment

Uh oh!

HaoyangLee Jul 10, 2023

Choose a reason for hiding this comment

Uh oh!

zhtmike Jul 11, 2023

Choose a reason for hiding this comment

Uh oh!

SamitHuang commented Jul 11, 2023

Uh oh!

zhtmike commented Jul 11, 2023

Uh oh!

HaoyangLee Jul 11, 2023

Choose a reason for hiding this comment

Uh oh!

zhtmike Jul 12, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zhtmike commented Jul 6, 2023 •

edited

Loading