[FA][Upstream PT] `XPU out of memory` raised by FA kernel with upstream pytorch #2042

ESI-SYD · 2024-08-29T07:27:14Z

flash attention benchmark fails with changes to use upstream pytorch.

It should be a torch issue.

Traceback (most recent call last):
  File "/runner/_work/intel-xpu-backend-for-triton/intel-xpu-backend-for-triton/benchmarks/key_benchmarks/flash_attention_fwd_benchmark.py", line 245, in <module>
    benchmark.run(show_plots=False, print_data=True)
  File "/runner/_work/intel-xpu-backend-for-triton/intel-xpu-backend-for-triton/benchmarks/key_benchmarks/triton_kernels_benchmark/benchmark_testing.py", line 249, in run
    result_dfs.append(self._run(bench, save_path, show_plots, print_data, **kwargs))
  File "/runner/_work/intel-xpu-backend-for-triton/intel-xpu-backend-for-triton/benchmarks/key_benchmarks/triton_kernels_benchmark/benchmark_testing.py", line 179, in _run
    ret = self.fn(**x_args, **{bench.line_arg: y}, **bench.args, **kwrags)
  File "/runner/_work/intel-xpu-backend-for-triton/intel-xpu-backend-for-triton/benchmarks/key_benchmarks/flash_attention_fwd_benchmark.py", line 228, in benchmark
    benchmark_suit.assert_close(triton_fn(), torch_fn(), atol=atol, rtol=1e-3, err_msg="triton to torch")
  File "/runner/_work/intel-xpu-backend-for-triton/intel-xpu-backend-for-triton/benchmarks/key_benchmarks/flash_attention_fwd_benchmark.py", line 225, in <lambda>
    torch_fn = lambda: torch.nn.functional.scaled_dot_product_attention(
RuntimeError: XPU out of memory, please use `empty_cache` to release all unoccupied cached memory.

CI:
https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/10609254853/job/29404643614

Repro:
use this poc branch feature/deprecate_benchmark_ipex

scripts/compile-triton.sh --venv
source .venv/bin/activate
scripts/test-triton.sh --attention

Related:
#1905

The text was updated successfully, but these errors were encountered:

ESI-SYD · 2024-09-06T01:33:59Z

torch.xpu.empty_cache not helps: tracked in pytorch/pytorch#135085

anmyachev · 2024-09-12T18:57:19Z

Probably pytorch/pytorch#135818 relates to this issue

vlad-penkin added bug Something isn't working performance labels Sep 1, 2024

vlad-penkin added this to the 0.1.1 [Triton / Pytorch] Switch to Pytorch from IPEX milestone Sep 1, 2024

vlad-penkin added upstream: pytorch dependencies: pytorch labels Sep 1, 2024

ESI-SYD mentioned this issue Sep 3, 2024

[Benchmarks] Deprecate import intel_extension_for_pytorch in benchmarks including decoupling XeTLA build which require ipex.h #1911

Closed

vlad-penkin mentioned this issue Sep 3, 2024

Switch to upstream PyTorch, deprecate IPEX dependency #925

Open

vlad-penkin assigned ESI-SYD Sep 3, 2024

whitneywhtsang mentioned this issue Sep 5, 2024

Build benchmarks with upstream pytorch #1905

Closed

vlad-penkin assigned riverliuintel and unassigned ESI-SYD Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FA][Upstream PT] `XPU out of memory` raised by FA kernel with upstream pytorch #2042

[FA][Upstream PT] `XPU out of memory` raised by FA kernel with upstream pytorch #2042

ESI-SYD commented Aug 29, 2024 •

edited

Loading

ESI-SYD commented Sep 6, 2024

anmyachev commented Sep 12, 2024

[FA][Upstream PT] XPU out of memory raised by FA kernel with upstream pytorch #2042

[FA][Upstream PT] XPU out of memory raised by FA kernel with upstream pytorch #2042

Comments

ESI-SYD commented Aug 29, 2024 • edited Loading

ESI-SYD commented Sep 6, 2024

anmyachev commented Sep 12, 2024

[FA][Upstream PT] `XPU out of memory` raised by FA kernel with upstream pytorch #2042

[FA][Upstream PT] `XPU out of memory` raised by FA kernel with upstream pytorch #2042

ESI-SYD commented Aug 29, 2024 •

edited

Loading