[SYCL][HIP][libclc] Wire up AMD half support #11626

MartinWehking · 2023-10-23T14:44:23Z

Enables fp16 support for AMD GPUs.
Based on Zheming's previous work: 8488
Some test cases for e.g. images were disabled since they aren't supported

Draft PR for enabling fp16 support in the unified-runtime: oneapi-src/unified-runtime#988

…pport for AMD

jinz2014 · 2023-10-23T15:00:26Z

Thank you for your extensions and support !

JackAKirk · 2023-10-23T15:03:09Z

Looks good, I'll wait for the CI testing to complete though.

ldrumm · 2023-10-23T15:14:33Z

Our current squash policy is going to mangle authorship of this patchset. What are we going to do about it

JackAKirk · 2023-10-24T08:45:12Z

You can see from e.g. the log from the unexpected passing test, reduction_nd_ext_half.cpp, that the CI run was skipping all the tests requiring aspect::fp16. I think that you either need to get oneapi-src/unified-runtime#988 merged first, or if there is a way of pointing the CI to that unified-runtime PR branch do that. Otherwise we can't see that any of the affected tests are passing on the CI.

JackAKirk · 2023-10-24T10:13:11Z

sycl/test-e2e/Reduction/reduction_nd_ext_half.cpp

@@ -6,9 +6,6 @@
 // work group size not bigger than 1`.
 // XFAIL: hip_nvidia

-// Incorrect result on AMD.


I don't think it makes sense to remove XFAIL when it was only unexpectedly passing because it was being skipped because the CI device didn't have the fp16 aspect: it will have the aspect once you merge the unified-runtime patch.

Ah yes, I managed to confused myself there.
We still need to assume though that the user might use an older version of the ur repo after the merge of this PR, so it's better to simply make the test always require fp16 support (i.e. by the comment ("// REQUIRES: aspect-fp16")) and also leave the "XFAIL" since the result is always wrong on AMD GPUs

Yeah that makes sense

JackAKirk · 2023-10-24T17:43:35Z

libclc/amdgcn-amdhsa/libspirv/math/lgamma.cl

 #define __CLC_BUILTIN_F __CLC_XCONCAT(__CLC_BUILTIN, _f32)
+
+#ifdef cl_khr_fp64
+#pragma OPENCL EXTENSION cl_khr_fp64 : enable


probably a nit but I don't understand this: why enable an OPENCL EXTENSION on a non opencl backend? Presumably the cl_khr_fp64 was already working before this patch so I don't imagine this changes any behaviour.

This has been resolved in internal discussions already, but I'd still like to post it here as it might be interesting for others as well.
The fact that the AMDGPU target is a non-opencl doesn't matter here.
We basically enable the OpenCL fp64 extension because libclc is compiled by an OpenCL C compiler (i.e. it needs to be compliant with the openCL language specification).
Not enabling it would lead to the fact that a fp64 literal declared inside this cl file could be compiled as a float.
This is basically where Vanilla OpenCL C deviates from the C standard.
Potentially, this could lead to unexpected behaviour in the future and therefore, it's safer to enable the extension.

For the fp16 extension that is also enabled several times in cl files by this patch, it's a different story.
Not enabling it (i.e. removing the pragmas) leads to errors at compile time with the error message that the half type is not allowed.
It's needed to actually make it a type:
https://registry.khronos.org/OpenCL/specs/3.0-unified/html/OpenCL_C.html#reinterpreting-types-using-as_type-and-as_typen
(footnote 15)

This is also why @jinz2014 added these pragmas and wrapped them inside the ifdef macro checks

JackAKirk

LGTM

MartinWehking · 2023-10-26T15:11:21Z

ping @intel/llvm-gatekeepers to get this merged

JackAKirk · 2023-10-26T15:34:11Z

ping @intel/llvm-gatekeepers to get this merged

I think this needs a review from @intel/dpcpp-l0-pi-reviewers first

smaslov-intel · 2023-10-26T15:52:51Z

I don't see any changes to the files owned by @intel/dpcpp-l0-pi-reviewers , but I gave my provisional approval (without a review)

smaslov-intel

LGTM

MartinWehking · 2023-10-26T15:56:37Z

@smaslov-intel I had to change the UR vars in sycl/plugins/unified_runtime/CMakeLists.txt at some point to make the CI use my UR patch. This automatically requested a review from you.
I reverted those changed again in 568ea41

Jin Z and others added 13 commits October 23, 2023 10:39

Add half-precision math functions in the libclc for AMD devices

a2fe7c4

cl_khr_fp16 also needs to be enabled when building libclc

6bd901d

Update half-precision math functions in the libclc for AMD devices

d9f7b77

Update ilogb.cl for the function redefinition error

9a9fcdb

Add implementations for group and subgroup builtins to extend fp16 su…

d0ff9e1

…pport for AMD

Fence off fp16 functions for sub groups to reduce extension reliance

423a6c6

Reformat code

37a110a

Enable more tests

315d52c

Add more support for halves

6440a8f

Rename half type names to fix linking issues

df3f104

Fix naming

a89ce59

Remove unused file + fence off half symbol

ef25467

Mark failing test as xfail for AMD GPU

1859961

MartinWehking requested review from a team as code owners October 23, 2023 14:44

MartinWehking requested review from dm-vodopyanov and JackAKirk October 23, 2023 14:44

MartinWehking temporarily deployed to WindowsCILock October 23, 2023 15:17 — with GitHub Actions Inactive

MartinWehking mentioned this pull request Oct 23, 2023

Enable fp16 runtime support for hip oneapi-src/unified-runtime#988

Merged

Remove redundant pragmas for fp16 extension switch

4d52d28

MartinWehking temporarily deployed to WindowsCILock October 23, 2023 16:22 — with GitHub Actions Inactive

MartinWehking temporarily deployed to WindowsCILock October 23, 2023 17:30 — with GitHub Actions Inactive

dm-vodopyanov approved these changes Oct 23, 2023

View reviewed changes

Remove xfail from passing test

ba3fb0c

MartinWehking temporarily deployed to WindowsCILock October 24, 2023 10:11 — with GitHub Actions Inactive

JackAKirk reviewed Oct 24, 2023

View reviewed changes

MartinWehking temporarily deployed to WindowsCILock October 24, 2023 10:32 — with GitHub Actions Inactive

Reintroduce xfail and require fp16 support for test

ea4cb7a

MartinWehking temporarily deployed to WindowsCILock October 24, 2023 10:46 — with GitHub Actions Inactive

MartinWehking temporarily deployed to WindowsCILock October 24, 2023 11:05 — with GitHub Actions Inactive

Make CI use modified UR repo

624495b

MartinWehking requested a review from a team as a code owner October 24, 2023 15:09

MartinWehking marked this pull request as draft October 24, 2023 15:15

MartinWehking had a problem deploying to WindowsCILock October 24, 2023 15:26 — with GitHub Actions Failure

Fix cmake var and set it to correct 'tag'

51fe931

MartinWehking temporarily deployed to WindowsCILock October 24, 2023 15:42 — with GitHub Actions Inactive

MartinWehking temporarily deployed to WindowsCILock October 24, 2023 16:16 — with GitHub Actions Inactive

JackAKirk reviewed Oct 24, 2023

View reviewed changes

JackAKirk approved these changes Oct 25, 2023

View reviewed changes

Revert 51fe931

568ea41

MartinWehking temporarily deployed to WindowsCILock October 25, 2023 16:08 — with GitHub Actions Inactive

MartinWehking temporarily deployed to WindowsCILock October 25, 2023 16:46 — with GitHub Actions Inactive

MartinWehking marked this pull request as ready for review October 26, 2023 15:12

smaslov-intel approved these changes Oct 26, 2023

View reviewed changes

againull merged commit 8987549 into intel:sycl Oct 26, 2023

npmiller mentioned this pull request Nov 27, 2023

[SYCL][HIP] Required aspect fp16 is not supported on the device #8330

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][HIP][libclc] Wire up AMD half support #11626

[SYCL][HIP][libclc] Wire up AMD half support #11626

Uh oh!

MartinWehking commented Oct 23, 2023 •

edited

Loading

Uh oh!

jinz2014 commented Oct 23, 2023

Uh oh!

JackAKirk commented Oct 23, 2023 •

edited

Loading

Uh oh!

ldrumm commented Oct 23, 2023

Uh oh!

JackAKirk commented Oct 24, 2023

Uh oh!

JackAKirk Oct 24, 2023

Uh oh!

MartinWehking Oct 24, 2023

Uh oh!

JackAKirk Oct 24, 2023

Uh oh!

JackAKirk Oct 24, 2023

Uh oh!

MartinWehking Oct 26, 2023

Uh oh!

JackAKirk left a comment

Uh oh!

MartinWehking commented Oct 26, 2023

Uh oh!

JackAKirk commented Oct 26, 2023

Uh oh!

smaslov-intel commented Oct 26, 2023

Uh oh!

smaslov-intel left a comment

Uh oh!

MartinWehking commented Oct 26, 2023 •

edited

Loading

Uh oh!

Uh oh!

[SYCL][HIP][libclc] Wire up AMD half support #11626

[SYCL][HIP][libclc] Wire up AMD half support #11626

Uh oh!

Conversation

MartinWehking commented Oct 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jinz2014 commented Oct 23, 2023

Uh oh!

JackAKirk commented Oct 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ldrumm commented Oct 23, 2023

Uh oh!

JackAKirk commented Oct 24, 2023

Uh oh!

JackAKirk Oct 24, 2023

Choose a reason for hiding this comment

Uh oh!

MartinWehking Oct 24, 2023

Choose a reason for hiding this comment

Uh oh!

JackAKirk Oct 24, 2023

Choose a reason for hiding this comment

Uh oh!

JackAKirk Oct 24, 2023

Choose a reason for hiding this comment

Uh oh!

MartinWehking Oct 26, 2023

Choose a reason for hiding this comment

Uh oh!

JackAKirk left a comment

Choose a reason for hiding this comment

Uh oh!

MartinWehking commented Oct 26, 2023

Uh oh!

JackAKirk commented Oct 26, 2023

Uh oh!

smaslov-intel commented Oct 26, 2023

Uh oh!

smaslov-intel left a comment

Choose a reason for hiding this comment

Uh oh!

MartinWehking commented Oct 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

MartinWehking commented Oct 23, 2023 •

edited

Loading

JackAKirk commented Oct 23, 2023 •

edited

Loading

MartinWehking commented Oct 26, 2023 •

edited

Loading