[GPU] Logging cleanup #2446

echeresh · 2025-01-18T01:10:26Z

Jira: https://jira.devtools.intel.com/browse/MFDNN-11400.

List of changes:

Moved logging functionality to src/gpu/intel/logging.hpp with some slight refactoring (constant -> enum class)
- Renamed ir_{warning,perf,trace,...} to gpu-prefixed versions
Reworked logging format: see src/gpu/intel/logging.hpp
- Example: [TRACE][ codegen.cpp:95] codegen:bind h_34 -> r18 - r19
- Added level tag and file name/line number information
Simplified logging implementation, mostly minor things:
- Removed ir_check related not-relly-important features as base_logger_t, fatal and dynamic log levels, IR check guard
- Added automatic new-line after every log message
Dropped ir_assert (and similar macros), renamed usages to the existing gpu_assert, etc

Things left for the future:

Documentation
New log level: debug (or maybe refactored info/trace combination). Now "trace" is used for both very detailed tracing and some useful/more high-level logging. There should be more granularity, e.g.:
- Trace: max detail level, logging after every IR pass and even from inside of some passes
- Debug/info: less details but enough to understand what kernel looks like, e.g. kernel parameters + final IR

rjoursler · 2025-01-18T01:29:52Z

Simplifying log levels, e.g. drop perf-specific levels as they are not really used AFAIK (?)

I have often used this when trying to optimize IR creation time. The main issue is that the trace level introduces overhead (~30% from what I remember) and this overhead was not evenly distributed. This makes is so that performance in trace mode only loosely corresponds to performance in release mode when prioritizing optimizations. Could we add perf information (perhaps in a more compressed format) to the info level, analogous to how oneDNN verbose maps into spdlog levels?

echeresh · 2025-01-21T21:47:01Z

Simplifying log levels, e.g. drop perf-specific levels as they are not really used AFAIK (?)

I have often used this when trying to optimize IR creation time. The main issue is that the trace level introduces overhead (~30% from what I remember) and this overhead was not evenly distributed. This makes is so that performance in trace mode only loosely corresponds to performance in release mode when prioritizing optimizations. Could we add perf information (perhaps in a more compressed format) to the info level, analogous to how oneDNN verbose maps into spdlog levels?

Thanks. Yes, combining it with info might be a good option. Let's keep it as is for now. I plan to open one more PR, will think about this later.

src/gpu/intel/jit/utils/utils.hpp

src/gpu/intel/logging.hpp

src/gpu/intel/jit/utils/utils.hpp

src/gpu/intel/logging.hpp

src/gpu/intel/jit/reorder/gen_reorder.cpp

src/gpu/intel/jit/pooling/gen_pooling.cpp

echeresh · 2025-01-30T19:10:31Z

make test
set test_scope=NIGHTLY
disable test_device_cpu
enable arch_gpu_xe-hpc
enable arch_gpu_xe-hpg-atsm
enable arch_gpu_xe-hpg-dg2
enable arch_gpu_xe-lp
enable arch_gpu_xe-lpg
enable arch_gpu_xe-lpg+
enable arch_gpu_xe2-hpg-bmg
enable arch_gpu_xe2-lpg
enable arch_gpu_xe3-lpg

echeresh added the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Jan 18, 2025

echeresh requested a review from a team as a code owner January 18, 2025 01:10

echeresh force-pushed the echeresh/log-cleanup branch 2 times, most recently from 3fe409d to 8b00fb1 Compare January 21, 2025 21:33

echeresh changed the title ~~WIP: [GPU] Logging cleanup~~ [GPU] Logging cleanup Jan 21, 2025

rjoursler reviewed Jan 22, 2025

View reviewed changes

echeresh force-pushed the echeresh/log-cleanup branch from 8b00fb1 to f08650a Compare January 23, 2025 20:27

rjoursler approved these changes Jan 23, 2025

View reviewed changes

atkassen approved these changes Jan 29, 2025

View reviewed changes

src/gpu/intel/jit/reorder/gen_reorder.cpp Outdated Show resolved Hide resolved

src/gpu/intel/jit/pooling/gen_pooling.cpp Outdated Show resolved Hide resolved

echeresh force-pushed the echeresh/log-cleanup branch from f08650a to 34ae4c7 Compare January 30, 2025 01:14

echeresh added 5 commits January 30, 2025 11:08

xe: ir: make log output deterministic

6f9577b

xe: jit: utils: simplify and remove unused code

486d134

xe: jit: logging: always add newline

096bf35

xe: consolidate logging functionality

a2cda6b

xe: conv: clean up log messages

4c4a607

echeresh force-pushed the echeresh/log-cleanup branch from 34ae4c7 to 4c4a607 Compare January 30, 2025 19:10

echeresh merged commit 4727d3c into main Jan 30, 2025
4 of 5 checks passed

echeresh deleted the echeresh/log-cleanup branch January 30, 2025 20:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU] Logging cleanup #2446

[GPU] Logging cleanup #2446

echeresh commented Jan 18, 2025 •

edited

Loading

rjoursler commented Jan 18, 2025 •

edited

Loading

echeresh commented Jan 21, 2025

echeresh commented Jan 30, 2025

[GPU] Logging cleanup #2446

[GPU] Logging cleanup #2446

Conversation

echeresh commented Jan 18, 2025 • edited Loading

rjoursler commented Jan 18, 2025 • edited Loading

echeresh commented Jan 21, 2025

echeresh commented Jan 30, 2025

echeresh commented Jan 18, 2025 •

edited

Loading

rjoursler commented Jan 18, 2025 •

edited

Loading