Skip to content

Sporadic llvm-test-suite failures on CUDA back-end. #6463

Open
@bader

Description

@bader

@jchlanda, do you think that this failure on NVIDIA GPU https://github.com/intel/llvm/runs/7391421134?check_suite_focus=true is related to this change? This change is not expected to change the default behavior.

It doesn't look related, even if we ignore the assert messages which are a bit flaky, upon hitting the assert PI runtime should generate error 710:

PI CUDA ERROR:
        Value:           710
        Name:            CUDA_ERROR_ASSERT

but this one reports -999 - Unknown PI error (https://github.com/intel/llvm/runs/7391421134?check_suite_focus=true#step:8:1168)

I can't reproduce this issue locally, so I restarted CI jobs and it failed again, but with the different error:

error: command failed with exit status: 255

https://github.com/intel/llvm/runs/7450227286?check_suite_focus=true

These look like flaky llvm-test-suite issues not caused by this patch. I'll file a report to investigate them and merge this PR.

Originally posted by @bader in #6378 (comment)

Update: archives with logs logs_46239.zip
logs_46467.zip

Metadata

Metadata

Assignees

No one assigned

    Labels

    cudaCUDA back-endruntimeRuntime library related issue

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions