feat[gpu]: scalar encodings #6109
Merged
CodSpeed HQ / CodSpeed Performance Analysis
failed
Jan 23, 2026 in 0s
Performance Regression: -18.15%
⚠️ Unknown Walltime execution environment detected
Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.
For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.
⚡ 8 improved benchmarks
❌ 3 regressed benchmarks
✅ 1263 untouched benchmarks
⏩ 1254 skipped benchmarks1
⚠️ Please fix the performance issues or acknowledge them on CodSpeed.
Performance Changes
| Mode | Benchmark | BASE |
HEAD |
Efficiency | |
|---|---|---|---|---|---|
| ⚡ | WallTime | u8_FoR[1K] |
14.4 µs | 6.2 µs | ×2.3 |
| ❌ | WallTime | u16_FoR[1M] |
6.1 µs | 7.4 µs | -18.15% |
| ⚡ | Simulation | canonical_into_non_nullable[(10000, 100, 0.01)] |
2.9 ms | 2.1 ms | +37.72% |
| ⚡ | Simulation | canonical_into_non_nullable[(10000, 100, 0.0)] |
2.7 ms | 1.9 ms | +42.32% |
| ⚡ | Simulation | canonical_into_non_nullable[(10000, 100, 0.1)] |
4.5 ms | 3.7 ms | +22.17% |
| ❌ | Simulation | canonical_into_nullable[(10000, 10, 0.0)] |
444.5 µs | 529.1 µs | -15.99% |
| ❌ | Simulation | canonical_into_nullable[(10000, 100, 0.0)] |
4.1 ms | 4.9 ms | -16.51% |
| ⚡ | Simulation | into_canonical_non_nullable[(10000, 100, 0.01)] |
3 ms | 2.2 ms | +36.64% |
| ⚡ | Simulation | into_canonical_non_nullable[(10000, 100, 0.1)] |
4.6 ms | 3.8 ms | +21.44% |
| ⚡ | Simulation | into_canonical_non_nullable[(10000, 100, 0.0)] |
2.7 ms | 1.9 ms | +41.68% |
| ⚡ | Simulation | into_canonical_nullable[(10000, 100, 0.0)] |
5.2 ms | 4.4 ms | +18.47% |
Comparing ji/scalar-gpu (f3a7bf3) with develop (03f0140)
Footnotes
-
1254 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩
Loading