-
Notifications
You must be signed in to change notification settings - Fork 797
Description
Describe the bug
In #12838 it seems like the submission time on HIP is giving weird values. I did a bit of digging and it seems to me like HIP is a little different from CUDA when checking timing-differences between events. Of particular interest here is the following line for hipEventElapsedTime():
Events which are recorded in a NULL stream will block until all commands on all other streams complete execution, and then record the timestamp.
While what we expect here is to get an event with the current time, hence using an otherwise unused stream. Fixing it might be outside the scope of this PR, but a possible solution could be to lazily have a stream specifically for recording submission time of events, tied to the context. Similar could be used in the CUDA backend to avoid the assumption noted above.
Originally posted by @steffenlarsen in oneapi-src/unified-runtime#1400 (comment)
To reproduce
No response
Environment
No response
Additional context
This affects #12838, but should be reproducible on normal profiling queues. When this is fixed, sycl/test-e2e/ProfilingTag/ tests should be enabled for HIP.