-
Notifications
You must be signed in to change notification settings - Fork 797
Description
Describe the bug
A clear and concise description of what the bug is.
I get an error when running the program below.
PI CUDA ERROR:
Value: 719
Name: CUDA_ERROR_LAUNCH_FAILED
Description: unspecified launch failure
Function: wait
Source Location: /root/intel-llvm-mirror/sycl/plugins/cuda/pi_cuda.cpp:653
terminate called after throwing an instance of 'sycl::_V1::runtime_error'
what(): Native API failed. Native API returns: -999 (Unknown PI error) -999 (Unknown PI error)
To Reproduce
Please describe the steps to reproduce the behavior:
- Include code snippet as short as possible
#include <sycl/sycl.hpp>
int main() {
int i = 0;
sycl::queue q{sycl::gpu_selector_v};
sycl::buffer<int> buf_mod{&i, 1};
q.submit([&](sycl::handler &h) {
sycl::accessor mod{buf_mod, h, sycl::write_only};
h.single_task([=] {
sycl::atomic_ref<int, sycl::memory_order::seq_cst,
sycl::memory_scope::device, sycl::access::address_space::global_space> a{mod[0]};
++a;
});
}).wait_and_throw();
std::cout << i << std::endl;
}
- Specify the command which should be used to compile the program
clang++ -O3 -DNDEBUG main.cpp -fsycl -fsycl-targets=nvptx64-nvidia-cuda
- Specify the comment which should be used to launch the program
./a.out
- Indicate what is wrong and what was expected
program should run without errors.
Environment (please complete the following information):
-
OS: [e.g Windows/Linux]
Linux -
Target device and vendor: [e.g. Intel GPU]
Nvidia GPU -
DPC++ version: [e.g. commit hash or output of
clang++ --version
]
Intel(R) oneAPI DPC++/C++ Compiler 2023.2.0 (2023.2.0.20230622)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/intel/oneapi/compiler/2023.2.0/linux/bin-llvm -
Dependencies version: [e.g. low-level runtime versions (like NEO 20.04)]
Additional context
Add any other context about the problem here.
If the sycl::atomic_ref
is using sycl::memory_order::acq_rel
or sycl::memory_order::relaxed
, then it runs without errors.