Skip to content

Conversation

@mcfi
Copy link
Contributor

@mcfi mcfi commented Nov 14, 2025

Summary:

  1. Tweaked permute_mask_v so that we can remove pair-wise duplication by _mm256_moveldup_ps
  2. Removed max_v since _mm256_min_epu8(xyzw_packed_v, max_v) is just xyzw_packed_v.

Differential Revision: D86371930

Summary:
1. Tweaked permute_mask_v so that we can remove pair-wise duplication by _mm256_moveldup_ps
2. Removed max_v since _mm256_min_epu8(xyzw_packed_v, max_v) is just xyzw_packed_v.

Differential Revision: D86371930
@meta-cla meta-cla bot added the cla signed label Nov 14, 2025
@meta-codesync
Copy link
Contributor

meta-codesync bot commented Nov 14, 2025

@mcfi has exported this pull request. If you are a Meta employee, you can view the originating Diff in D86371930.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant