Benchmarks: Micro benchmark - add ncu profile support in cublaslt-gemm #740

yukirora · 2025-09-16T09:34:55Z

Description
This PR adds NCU (NVIDIA Nsight Compute) profiling support to the cublaslt-gemm micro benchmark, enabling detailed kernel analysis including DRAM throughput, compute throughput, and launch arguments.

Major Revision

Add --enable_ncu_profiling and --profiling_metrics for ncu profiling
Modifies command execution to use NCU when profiling is enabled
Updates result parsing to handle both standard and NCU profiled output formats

superbench/benchmarks/micro_benchmarks/cublaslt_function.py

abuccts

pls fix errors in unit tests

tests/benchmarks/micro_benchmarks/test_cublaslt_function.py

Copilot

Pull Request Overview

This PR adds NCU (NVIDIA Nsight Compute) profiling support to the cublaslt-gemm micro benchmark, enabling detailed kernel analysis including DRAM throughput, compute throughput, and launch arguments.

Adds two new command-line arguments: --enable_ncu_profiling and --profiling_metrics
Modifies command execution to use NCU when profiling is enabled
Updates result parsing to handle both standard and NCU profiled output formats

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
superbench/benchmarks/micro_benchmarks/cublaslt_function.py	Adds NCU profiling arguments, command wrapping, and CSV output parsing
tests/benchmarks/micro_benchmarks/test_cublaslt_function.py	Updates test cases to include new profiling arguments and adds NCU output parsing test

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

superbench/benchmarks/micro_benchmarks/cublaslt_function.py

codecov · 2025-09-28T09:40:15Z

Codecov Report

❌ Patch coverage is 81.81818% with 8 lines in your changes missing coverage. Please review.
✅ Project coverage is 85.71%. Comparing base (fe23426) to head (638ba7f).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...h/benchmarks/micro_benchmarks/cublaslt_function.py	81.81%	8 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #740      +/-   ##
==========================================
- Coverage   85.74%   85.71%   -0.04%     
==========================================
  Files         102      102              
  Lines        7640     7678      +38     
==========================================
+ Hits         6551     6581      +30     
- Misses       1089     1097       +8

Flag	Coverage Δ
cpu-python3.10-unit-test	`70.94% <81.81%> (+0.04%)`	⬆️
cpu-python3.12-unit-test	`70.94% <81.81%> (+0.04%)`	⬆️
cpu-python3.7-unit-test	`70.39% <81.81%> (+0.04%)`	⬆️
cuda-unit-test	`83.61% <81.81%> (-0.03%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

yukirora added 6 commits September 10, 2025 18:10

add ncu profile in cublalt-gemm

9ebc19f

fix bug

c88f41c

fix bug

629be83

fixbug

c354b78

update metric name

20e22f7

update

0a6254a

yukirora requested a review from a team as a code owner September 16, 2025 09:34

yukirora added the benchmarks SuperBench Benchmarks label Sep 16, 2025

yukirora assigned abuccts Sep 18, 2025

cp5555 approved these changes Sep 18, 2025

View reviewed changes

cp5555 added the micro-benchmarks Micro Benchmark Test for SuperBench Benchmarks label Sep 18, 2025

guoshzhao reviewed Sep 18, 2025

View reviewed changes

superbench/benchmarks/micro_benchmarks/cublaslt_function.py Outdated Show resolved Hide resolved

guoshzhao approved these changes Sep 19, 2025

View reviewed changes

abuccts reviewed Sep 22, 2025

View reviewed changes

tests/benchmarks/micro_benchmarks/test_cublaslt_function.py Outdated Show resolved Hide resolved

abuccts requested a review from Copilot September 22, 2025 07:26

Copilot AI reviewed Sep 22, 2025

View reviewed changes

superbench/benchmarks/micro_benchmarks/cublaslt_function.py Outdated Show resolved Hide resolved

superbench/benchmarks/micro_benchmarks/cublaslt_function.py Outdated Show resolved Hide resolved

superbench/benchmarks/micro_benchmarks/cublaslt_function.py Outdated Show resolved Hide resolved

yukirora added 2 commits September 28, 2025 17:05

fix test issue

7158752

fix lint issue

cbf2804

yukirora added 2 commits September 29, 2025 15:25

Merge branch 'main' into yutji/cublaslt-profile

c5dfb5e

Merge branch 'main' into yutji/cublaslt-profile

7a7ba34

guoshzhao mentioned this pull request Oct 2, 2025

V0.13.0 Release Plan #743

Open

30 tasks

Merge branch 'main' into yutji/cublaslt-profile

638ba7f

yukirora merged commit f6e65a9 into main Oct 23, 2025
26 of 27 checks passed

yukirora deleted the yutji/cublaslt-profile branch October 23, 2025 06:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Benchmarks: Micro benchmark - add ncu profile support in cublaslt-gemm #740

Benchmarks: Micro benchmark - add ncu profile support in cublaslt-gemm #740

Uh oh!

yukirora commented Sep 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

abuccts left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Sep 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Benchmarks: Micro benchmark - add ncu profile support in cublaslt-gemm #740

Benchmarks: Micro benchmark - add ncu profile support in cublaslt-gemm #740

Uh oh!

Conversation

yukirora commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

abuccts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Sep 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yukirora commented Sep 16, 2025 •

edited

Loading

codecov bot commented Sep 28, 2025 •

edited

Loading