Skip to content

Consider increasing the dimension limit for vector fields. #40492

@jtibshirani

Description

@jtibshirani

The dense_vector and sparse_vector fields place a hard limit of 500 on the number of dimensions per vector. However, many of the common pretrained text embeddings like BERT, ELMo, and Universal Sentence Encoder produce vectors of larger dimensions, typically ranging from 512 to 1024.

Currently users must truncate the vectors, or perform an additional dimensionality reduction step. Perhaps we could make the dimension limit configurable, or at least increase it to a larger value?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions