-
Notifications
You must be signed in to change notification settings - Fork 769
[SYCL] Don't remove the command from leaves list when adding empty command #2542
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
…mmand Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
9eac4b6
to
4a76c3a
Compare
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
In this patch we start to store not only leaves for memory record. Hence, I prefer #2543 instead of this workaround. |
Too much of a workaround here. Won't really work if leaf limit is exceeded. |
kbenzie
pushed a commit
to kbenzie/intel-llvm
that referenced
this pull request
Feb 17, 2025
[L0 v2] implement urKernelGetSuggestedLocalWorkSize
Chenyang-L
pushed a commit
that referenced
this pull request
Feb 18, 2025
[L0 v2] implement urKernelGetSuggestedLocalWorkSize
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Let's have this part of dependency graph:
EC1
andEC2
are of helper typeEmptyCommand
. Both are blocked.C1
andC2
are of any other type which represents some meaningful operation i.e. the one that actually does something useful, likeExecCGCommand
with a host task orUpdateHostReqCommand
for host accessor.This part of graph is result of two submissions: the first one adds
EC1 -> C1
and the second addsEC2 -> C2 -> EC1
.The submission process in both cases adds the corresponding
Cx
command in first place and then adds the empty commandECx
. WhenCx
is added to the graph it's also added to list of leaves for corresponding memory record. Now, whenECx
is added to graph it removes correspondingCx
from list of leaves.ECx
is added to the list instead ofCx
. Blocked state ofECx
prevents it from being enqueued.Upon the second submission,
C2
isn't going to be enqueued right away as it depends on blockedEC1
.When
C1
finishes (whether it is a host task or host accessor's update command) SYCL RT will unblockEC1
and trigger enqueue process for leaves of memory recordsC1
was depending by. Eventually, RT will enqueue theEC1
which will set its event into complete state. Though,EC2
isn't going to be enqueued due to it's blocked state. Neither does its dependencies.PR with test: #2540
PR with comment fix: #2541
A more convenient way is available at #2543