[DiskBBQ] Speed up OSQ bit transformations

### Description

Through benchmarking, I noticed that much of our time for quantizing query vectors (almost half?!?) we spend just transforming from flat `int4` to the striped pattern of int4. 

The code seems like it SHOULD be able to be vectorizable. 

Or, maybe the solution is to transpose while getting the sum in sum-byte? 

I am not sure, but its hilarious that spend almost as much time doing this loop as we do actually calculating the intervals and applying them to the vector.

<img width="1211" height="160" alt="Image" src="https://github.com/user-attachments/assets/509af8ff-d8e4-4859-827b-c4cb1618a080" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DiskBBQ] Speed up OSQ bit transformations #132761

Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[DiskBBQ] Speed up OSQ bit transformations #132761

Description

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions