callbacks revision: allow group lr and correct the logging for resuming training #386

wtomin · 2023-06-13T10:02:06Z

Thank you for your contribution to the MindOCR repo.
Before submitting this PR, please make sure:

You have read the Contributing Guidelines on pull requests
Your code builds clean without any errors or warnings
You are using approved terminology
You have added unit tests

Motivation

When resuming training, the correct logging should be [cur_epoch/(training_epochs+start_epochs)] to avoid confusion.

It is possible that opt.get_lr() is not a single scalar but a list of scalars. When users pass a list of dictionaries to the optimizer, for example:

[{  'params': group_base_params , 'lr': base_lr },   { 'params': group_small_params, 'lr': small_lr}]

The visualization should print each distinct lr.

Test Plan

Try to resume training when start_epoch >1
In mindocr/optim/param_grouping.py, create a new grouping function that returns a list of dictionaries containing different lrs, for example:

def grouping_params(params, filter_name, learning_rate):
    base_params, small_params = [],  []
    for param in params:
        if filter_name in param.name:
            base_params.append(param)
        else:
            small_params.append(param)
    return [{  'params': base_params, 'lr': learning_rate}, 
                { 'params': small_params, 'lr': learning_rate*0.1}]

Related Issues and PRs

NA

mindocr/utils/callbacks.py

adapt to group lr

wtomin requested review from SamitHuang, zhtmike and jianyunchao June 13, 2023 10:02

zhtmike reviewed Jun 13, 2023

View reviewed changes

mindocr/utils/callbacks.py Outdated Show resolved Hide resolved

wtomin force-pushed the callback branch from c3effe0 to fc1d007 Compare June 20, 2023 09:44

zhtmike approved these changes Jun 21, 2023

View reviewed changes

Songyuanwei reviewed Jun 23, 2023

View reviewed changes

mindocr/utils/callbacks.py Outdated Show resolved Hide resolved

wtomin force-pushed the callback branch 2 times, most recently from ab50f17 to 0525cd3 Compare June 23, 2023 08:28

callbacks group lr and resume epochs

0525cd3

adapt to group lr

Songyuanwei approved these changes Jun 23, 2023

View reviewed changes

wtomin merged commit b193f84 into mindspore-lab:main Jun 23, 2023

wtomin deleted the callback branch July 12, 2023 03:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

callbacks revision: allow group lr and correct the logging for resuming training #386

callbacks revision: allow group lr and correct the logging for resuming training #386

Uh oh!

wtomin commented Jun 13, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

callbacks revision: allow group lr and correct the logging for resuming training #386

callbacks revision: allow group lr and correct the logging for resuming training #386

Uh oh!

Conversation

wtomin commented Jun 13, 2023

Motivation

Test Plan

Related Issues and PRs

Uh oh!

Uh oh!

Uh oh!

Uh oh!