[SYCL] Gracefully handle unknown device #11254

al42and · 2023-09-21T14:59:11Z

Don't throw ICE when an unknown device is specified explicitly via -Xsycl-target-backend --offload-arch=. We don't enable macros or other niceties from sycl_ext_oneapi_device_architecture, but at least the code compiles.

Fixes #8112, #11203, #12010

Don't throw ICE when an unknown device is specified explicitly via `-Xsycl-target-backend --offload-arch=`. We don't enable macros or other niceties from sycl_ext_oneapi_device_architecture, but at least the code compiles. Fixes intel#8112, intel#11203

al42and · 2023-09-22T15:38:57Z

I am not sure how the CI failures could be related to this change. Are they known to be flaky?

mdtoguchi · 2023-09-25T22:38:57Z

clang/lib/Driver/ToolChains/Clang.cpp

          Triple.isNVPTX() || Triple.isAMDGCN()) {
        StringRef Device = JA.getOffloadingArch();
-        if (!Device.empty()) {
+        if (!Device.empty() && !SYCL::gen::getGenDeviceMacro(Device).empty()) {


Should we add a test for this circumstance? Also, would it be reasonable to emit some kind of diagnostic that the macro isn't being generated due to an issue with the device value?

Should we add a test for this circumstance?

Could you advise on the best place to put a test? clang/test/Driver/sycl-oneapi-gpu.cpp?

Also, would it be reasonable to emit some kind of diagnostic that the macro isn't being generated due to an issue with the device value?

I don't think macro alone warrants this. It's prone to false positives (what if the code don't use these macros at all?), and occurs at the wrong place (it's not the problem of the user that they have gfx1036, it's the problem of the developer who wrote code relying a macro __SYCL_TARGET_AMD_GPU_GFX1036__). Warning about unsupported macros in the code (__SYCL_TARGET_* will never be set) independently of the target architecture could will be better: it will be emitted if and only if there is an actual bug in the code, but this is already covered by -Wundef (since, if the macro is supported, it will be defined to 0, https://github.com/intel/llvm/blob/de92299c2c09d626dcbc633c83e259d828220d03/sycl/doc/design/DeviceIf.md).

We could, more generally, warn about "unsupported" devices, the way Clang warns about "unsupported" CUDA versions (-Wunknown-cuda-version). I don't think it fits the scope of this PR, though.

I agree with @al42and . If you read the DeviceIf.md, it is quite clear IMO:

"These macros are an internal implementation detail, so they should not be documented to users, and user code should not make use of them."

@al42and I think the answer to your question is that the appropriate place to place such a test is in clang/test/Driver: I guess you just want something similar to this: https://github.com/JackAKirk/llvm/blob/2b4b45af5a1ae0d23bcb632cc4588faecedc1956/clang/test/Driver/sycl-fno-libspirv-warn.cpp

except you want to check there are no warnings or errors when compiling for a device that e.g doesn't have a macro assigned but is supported by the driver.

except you want to check there are no warnings or errors when compiling for a device that e.g doesn't have a macro assigned but is supported by the driver.

That's going to be an unreliable test and in need of regular updates. Hopefully, such architectures will eventually get assigned a flag :)

I think a better test is to check that when compiling for gfx0000 or sm_00 we get a reasonable error instead of an ICE.

I think a better test is to check that when compiling for gfx0000 or sm_00 we get a reasonable error instead of an ICE.

Unfortunately, the validity of the architecture is checked before the broken compiler pass is invoked, so this idea will not work.

Instead, I'm using an old architecture (e.g., sm_30), that is valid but is not and will not be supported.

Looks like NVIDIA device libraries are not as reliable as I thought they are. Changed the test to use an old AMD architecture (gfx600, supported by LLVM, not listed in clang/lib/Driver/ToolChains/SYCL.cpp) instead.

These tests are harder than I expected 😅 Hopefully I'll get "Build + LIT" to pass this time with 081d821.

The timeout of Basic/vector/int-convert.cpp in e2e / Intel GEN12 Graphics with Level Zero is unrelated: #12011

Seems more stable

al42and requested a review from a team as a code owner September 21, 2023 14:59

al42and mentioned this pull request Sep 21, 2023

ICE when compiling for gfx1036 #11203

Closed

al42and temporarily deployed to WindowsCILock September 21, 2023 19:57 — with GitHub Actions Inactive

al42and had a problem deploying to WindowsCILock September 21, 2023 20:29 — with GitHub Actions Failure

mdtoguchi reviewed Sep 25, 2023

View reviewed changes

tom91136 mentioned this pull request Oct 4, 2023

[SYCL][HIP] Poor Memory Bandwidth due to Unnecessary Memory Write Traffic, 50% slower than OpenSYCL #10624

Closed

al42and added 2 commits November 27, 2023 19:19

Merge remote-tracking branch 'origin/sycl' into fix-8112

2f197d8

Add test

4dcf5b6

bader requested review from JackAKirk and mdtoguchi November 27, 2023 19:05

mdtoguchi approved these changes Nov 27, 2023

View reviewed changes

al42and had a problem deploying to WindowsCILock November 28, 2023 00:31 — with GitHub Actions Failure

al42and temporarily deployed to WindowsCILock November 28, 2023 07:35 — with GitHub Actions Inactive

Use AMDGCN target instead of NVPTX for testing

3d16390

Seems more stable

al42and had a problem deploying to WindowsCILock November 28, 2023 11:42 — with GitHub Actions Failure

al42and had a problem deploying to WindowsCILock November 28, 2023 12:15 — with GitHub Actions Failure

Add nogpulib flag to tests

081d821

al42and temporarily deployed to WindowsCILock November 28, 2023 13:29 — with GitHub Actions Inactive

al42and temporarily deployed to WindowsCILock November 28, 2023 14:11 — with GitHub Actions Inactive

againull merged commit 5a51c0a into intel:sycl Nov 28, 2023

al42and deleted the fix-8112 branch November 29, 2023 14:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL] Gracefully handle unknown device #11254

[SYCL] Gracefully handle unknown device #11254

Uh oh!

al42and commented Sep 21, 2023 •

edited

Loading

Uh oh!

al42and commented Sep 22, 2023

Uh oh!

mdtoguchi Sep 25, 2023

Uh oh!

al42and Sep 26, 2023 •

edited

Loading

Uh oh!

JackAKirk Oct 10, 2023

Uh oh!

al42and Oct 21, 2023

Uh oh!

al42and Nov 27, 2023

Uh oh!

al42and Nov 28, 2023 •

edited

Loading

Uh oh!

al42and Nov 28, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SYCL] Gracefully handle unknown device #11254

[SYCL] Gracefully handle unknown device #11254

Uh oh!

Conversation

al42and commented Sep 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

al42and commented Sep 22, 2023

Uh oh!

mdtoguchi Sep 25, 2023

Choose a reason for hiding this comment

Uh oh!

al42and Sep 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JackAKirk Oct 10, 2023

Choose a reason for hiding this comment

Uh oh!

al42and Oct 21, 2023

Choose a reason for hiding this comment

Uh oh!

al42and Nov 27, 2023

Choose a reason for hiding this comment

Uh oh!

al42and Nov 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

al42and Nov 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

al42and commented Sep 21, 2023 •

edited

Loading

al42and Sep 26, 2023 •

edited

Loading

al42and Nov 28, 2023 •

edited

Loading

al42and Nov 28, 2023 •

edited

Loading