-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Closed
Labels
:Search Relevance/VectorsVector searchVector search>enhancementTeam:Search RelevanceMeta label for the Search Relevance team in ElasticsearchMeta label for the Search Relevance team in Elasticsearch
Description
Description
Through benchmarking, I noticed that much of our time for quantizing query vectors (almost half?!?) we spend just transforming from flat int4 to the striped pattern of int4.
The code seems like it SHOULD be able to be vectorizable.
Or, maybe the solution is to transpose while getting the sum in sum-byte?
I am not sure, but its hilarious that spend almost as much time doing this loop as we do actually calculating the intervals and applying them to the vector.

Metadata
Metadata
Assignees
Labels
:Search Relevance/VectorsVector searchVector search>enhancementTeam:Search RelevanceMeta label for the Search Relevance team in ElasticsearchMeta label for the Search Relevance team in Elasticsearch