[FIX][CI] hotfix check_grad perf regression #8581

altanh · 2021-07-28T23:41:44Z

Lifted the interpreter executor evaluation out of a hot loop in check_grad. Additionally, modified the interpreter executor to only create one Interpreter per expression evaluation (rather than each time the evaluated expression closure is invoked). Also fixed a bug that was in some code for grouped_conv2d on ARM CPU.

cc @tqchen

junrushao

Thanks for the fix!

altanh · 2021-07-29T05:57:01Z

it turns out the problem goes a bit deeper with the Interpreter executor, please wait to merge until we settle on a complete solution

…g on arm cpu

altanh · 2021-07-29T07:08:00Z

cc @Wheest author of #6137 regarding the fix I added for grouped_conv2d default config on ARM cpu, please check that it's correct

tqchen · 2021-07-29T14:23:17Z

@altanh your change might have triggered other errors. Let us skip the interepreter in the gradient checkers and focus on running VM instead

altanh · 2021-07-29T15:14:58Z

@altanh your change might have triggered other errors. Let us skip the interepreter in the gradient checkers and focus on running VM instead

I saw the errors, I'm trying one more quick fix and if it doesn't work I will open an alternative PR which disables the interpreter in check_grad

leandron · 2021-07-30T08:10:34Z

This is merged now. Thanks @altanh, @tqchen, @junrushao1994, @jcf94, @mbrookhart and @jroesch!

Wheest · 2021-08-01T14:54:01Z

Thanks @altanh, the ARM grouped conv change LGTM. How did you discover it, if it wasn't caught by the tests in CI?

altanh · 2021-08-03T15:52:13Z

Thanks @altanh, the ARM grouped conv change LGTM. How did you discover it, if it wasn't caught by the tests in CI?

Good question... I was debugging the test locally and encountered the error with a basic pytest invocation. Perhaps the ARM test wasn't being run, or the default config wasn't being used on CI? Definitely weird

* hotfix check_grad perf regression: lift compile out of hot loop * hoist interpreter creation out of python closure, fix weird conv2d bug on arm cpu * lint * try one more fix

hotfix check_grad perf regression: lift compile out of hot loop

9f997ba

altanh requested review from anijain2305, jroesch, junrushao, jwfromm, MarisaKirisame, mbrookhart, slyubomirsky, vinx13, wweic, yzhliu, zhiics and ZihengJiang as code owners July 28, 2021 23:41

tqchen approved these changes Jul 28, 2021

View reviewed changes

junrushao approved these changes Jul 28, 2021

View reviewed changes

junrushao linked an issue Jul 29, 2021 that may be closed by this pull request

[TEST] Conv2dGrad Takes long time to finish #8579

Closed

mbrookhart approved these changes Jul 29, 2021

View reviewed changes

jcf94 approved these changes Jul 29, 2021

View reviewed changes

jcf94 added the status: accepted label Jul 29, 2021

hoist interpreter creation out of python closure, fix weird conv2d bu…

23d2351

…g on arm cpu

altanh requested review from Huyuwei, kevinthesun, Laurawly and masahi as code owners July 29, 2021 07:03

lint

0505a08

try one more fix

fb7ce15

altanh changed the title ~~[FIX][CI] hotfix check_grad perf regression: lift compile out of hot loop~~ [FIX][CI] hotfix check_grad perf regression Jul 29, 2021

jroesch approved these changes Jul 29, 2021

View reviewed changes

leandron merged commit 8148028 into apache:main Jul 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX][CI] hotfix check_grad perf regression #8581

[FIX][CI] hotfix check_grad perf regression #8581

altanh commented Jul 28, 2021 •

edited

Loading

junrushao left a comment

altanh commented Jul 29, 2021

altanh commented Jul 29, 2021

tqchen commented Jul 29, 2021

altanh commented Jul 29, 2021

leandron commented Jul 30, 2021 •

edited

Loading

Wheest commented Aug 1, 2021

altanh commented Aug 3, 2021

[FIX][CI] hotfix check_grad perf regression #8581

[FIX][CI] hotfix check_grad perf regression #8581

Conversation

altanh commented Jul 28, 2021 • edited Loading

junrushao left a comment

Choose a reason for hiding this comment

altanh commented Jul 29, 2021

altanh commented Jul 29, 2021

tqchen commented Jul 29, 2021

altanh commented Jul 29, 2021

leandron commented Jul 30, 2021 • edited Loading

Wheest commented Aug 1, 2021

altanh commented Aug 3, 2021

altanh commented Jul 28, 2021 •

edited

Loading

leandron commented Jul 30, 2021 •

edited

Loading