Skip to content

Conversation

@Hardcode84
Copy link
Contributor

@Hardcode84 Hardcode84 commented Jun 27, 2023

  • Use 2D dispatch in kernel impls instead of huge sequential inner loop.
  • Use nested prange in numba_mlir_p impl, numba and numba-dpex doesn't support nested pranges, but numba-mlir does.
  • Remove workaround in numba_mlir_n

@Hardcode84
Copy link
Contributor Author

@diptorupd ping

@Hardcode84
Copy link
Contributor Author

@diptorupd @ZzEeKkAa ping

Copy link
Contributor

@ZzEeKkAa ZzEeKkAa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code design looks good for me, but I don't have much context to review it on the logic perspective

* Use 2D dispatch in kernel impls instead of huge sequential inner loop.
* Use nested prange in `numba_mlir_p` impl, `numba` and `numba-dpex` doesn't support nested pranges, but `numba-mlir` does.
* Remove workaround in `numba_mlir_n`
Copy link

@diptorupd diptorupd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good changes. Thank you!

@diptorupd diptorupd merged commit ca5a710 into IntelPython:main Jul 31, 2023
@Hardcode84 Hardcode84 deleted the pairwise-fixes branch August 1, 2023 18:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants