Skip to content

Conversation

@Intron7
Copy link

@Intron7 Intron7 commented Jun 26, 2024

500k x 5k AnnData benchmark with an AMD 5950x

Axis= 0
Intel_kernel = 223 ms
new = 223 ms

Axis= 1
Intel_kernel = 3.35 s
new = 203 ms

This implementation has better memory access for axis= 1.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants