sycl: Add more debug prints #13640

Rbiessy · 2025-05-19T18:11:23Z

This adds more debugging prints related to the operations being run. scope_op_debug_print is introduced to print the "call ..." and "call ... done" logs in a single line.
This adds more information for all the operations and its destination and input tensors: the tensor type, its number of elements, stride sizes and whether the tensor is strided (i.e. non-contiguous) or permuted.
For mul_mat this also add some debug prints when conversations kernels are called from and to fp16, fp32 and quantized types.
This also adds debugging prints for a few operations that were missing it.

The output for llama-2-7b.Q4_0 PP and TG is added here as an example:

ggml/src/ggml-sycl/cpy.cpp

qnixsynapse

Nice.
Is it possible to extend this to backend interface functions as well such as init, set, memset, clear, reset, etc?

Rbiessy · 2025-05-22T09:59:27Z

Nice. Is it possible to extend this to backend interface functions as well such as init, set, memset, clear, reset, etc?

I added some more logs and updated the example in the PR description. Note I am not reusing scope_op_debug_print everywhere as I wanted to make the difference between logs coming from the llama backend (like the functions you mentioned) and logs coming from the model (like operations that are run). I added some [OP] tag in the log to distinct between the 2.

Alcpz

LGTM. If logs get too noisy we can implement a couple of levels to the debug variable in the future to either print ops or API calls.

qnixsynapse

LGTM

Rbiessy · 2025-05-23T15:37:03Z

I switched from using std::string to std::string_view to avoid any cost associated with std::string. As I measured the PR wasn't enough to impact performance but a small reproducer showed more instructions generated. With std::string_view that is not the case anymore, see f1 (using macro) and f2 (using scope_dbg_print) are identical: https://gcc.godbolt.org/z/5T84n5rzs
I'll wait until Monday to merge this if there are no other comments.

sycl: Add more debug prints

986e89a

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels May 19, 2025

NeoZhangJianyu reviewed May 20, 2025

View reviewed changes

ggml/src/ggml-sycl/cpy.cpp Show resolved Hide resolved

qnixsynapse reviewed May 20, 2025

View reviewed changes

Rbiessy added 3 commits May 22, 2025 10:43

Add more prints

cb7d9cb

Use __builtin_expect

45a027a

clang-format

1ae14e0

Alcpz approved these changes May 22, 2025

View reviewed changes

qnixsynapse approved these changes May 22, 2025

View reviewed changes

AD2605 approved these changes May 22, 2025

View reviewed changes

Use string_view

65d9911

Rbiessy merged commit 9012eb9 into ggml-org:master May 26, 2025
46 checks passed

Rbiessy deleted the romain/more_dbg_print branch May 26, 2025 08:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

sycl: Add more debug prints #13640

sycl: Add more debug prints #13640

Uh oh!

Rbiessy commented May 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

qnixsynapse left a comment

Uh oh!

Rbiessy commented May 22, 2025

Uh oh!

Alcpz left a comment

Uh oh!

qnixsynapse left a comment

Uh oh!

Rbiessy commented May 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

sycl: Add more debug prints #13640

sycl: Add more debug prints #13640

Uh oh!

Conversation

Rbiessy commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

qnixsynapse left a comment

Choose a reason for hiding this comment

Uh oh!

Rbiessy commented May 22, 2025

Uh oh!

Alcpz left a comment

Choose a reason for hiding this comment

Uh oh!

qnixsynapse left a comment

Choose a reason for hiding this comment

Uh oh!

Rbiessy commented May 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Rbiessy commented May 19, 2025 •

edited

Loading