-
Notifications
You must be signed in to change notification settings - Fork 247
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Native API failed. Native API returns: -2 (PI_ERROR_DEVICE_NOT_AVAILABLE) -2 (PI_ERROR_DEVICE_NOT_AVAILABLE) #643
Comments
Hi @MaksimDanilov, thanks for reporting this issue. Could you please help provide the Traceback error from your terminal using Please also verify the driver installation version from here: |
Update issue description 'cause it's only reproduce if I post data to model. |
Is it possible to provide us with a minimum code reproducer for this issue as well as the test image? Thanks. |
Okey. I'll try to make minimal example on weekends. |
Added code to description (it's not production code, however error occurs in it). Also, pasted log with time measures when the exception was thrown. |
Thank you for providing the sample code. I could not reproduce the same error at my end. Could we test if the Intel® oneAPI Base Toolkit 2024.1.0 is installed and sourced correctly? In your collected environment information, I noticed that the DPCPP and MKL versions are listed as N/A.
|
Okey. I didn't invoke these two bat's like that. I use setvars bat in root directory of OneApi. |
@nazneenn Maybe I can search for debug logs if exists to help you understand this problem a bit more? |
@MaksimDanilov, you may try any debugger like gdb or python debugger to get more details on the issue and report back what you see when the core dump happens: |
|
@nazneenn, I can always reproduce this problem now. step by step guide:
|
@nazneenn any help here? |
Hi @MaksimDanilov , |
Hi :-) Still have problem after pushing tensor to GPU after 5-6 minutes (first try pass successfully). When I launch second process and my gpu memory is running low, catch similar error. Maybe problem in swapping or something like that, because the loading freezes after some time. |
Hi @MaksimDanilov ,
|
@yinghu5 Hi, thanks for reply. |
@MaksimDanilov , Thank you for raising the issue to Driver team. I will check with them about the problem internally. Press Any Key To Continue... |
Describe the bug
I made program that waits to input image to be segmentated though U2NET model. After some inputs (can't determine frequency as it is always random), I start to get DEVICE_NOT_AVAILABLE_ERROR when I post tensor to xpu. Can I somehow get debug logs to post here?
* I noticed that with the same program I can't reproduce this error on cpu)
** It happens only if I have transferred data to model. Simple forward data to xpu works file.
Versions
Collecting environment information...
PyTorch version: 2.1.0.post2+cxx11.abi
PyTorch CXX11 ABI: No
IPEX version: 2.1.30+xpu
IPEX commit: 474a6b3
Build type: Release
OS: Microsoft Windows 11 Home
GCC version: N/A
Clang version: N/A
IGC version: 2024.1.0 (2024.1.0.20240308)
CMake version: version 3.28.0-msvc1
Libc version: N/A
Python version: 3.11.9 | packaged by Anaconda, Inc. | (main, Apr 19 2024, 16:40:41) [MSC v.1916 64 bit (AMD64)] (64-bit runtime)
Python platform: Windows-10-10.0.22631-SP0
Is XPU available: True
DPCPP runtime version: N/A
MKL version: N/A
GPU models and configuration:
[0] _DeviceProperties(name='Intel(R) Iris(R) Xe Graphics', platform_name='Intel(R) Level-Zero', dev_type='gpu', driver_version='1.3.29283', has_fp64=0, total_memory=7167MB, max_compute_units=96, gpu_eu_count=96)
Intel OpenCL ICD version: N/A
Level Zero version: N/A
CPU:
Revision=
Versions of relevant libraries:
[pip3] intel-extension-for-pytorch==2.1.30+xpu
[pip3] numpy==1.26.1
[pip3] torch==2.1.0.post2+cxx11.abi
[pip3] torchaudio==2.1.0.post2+cxx11.abi
[pip3] torchvision==0.16.0.post2+cxx11.abi
[conda] intel-extension-for-pytorch 2.1.30+xpu pypi_0 pypi
[conda] numpy 1.26.1 pypi_0 pypi
[conda] torch 2.1.0.post2+cxx11.abi pypi_0 pypi
[conda] torchaudio 2.1.0.post2+cxx11.abi pypi_0 pypi
[conda] torchvision 0.16.0.post2+cxx11.abi pypi_0 pypi
Code
The text was updated successfully, but these errors were encountered: