Skip to content

[FEATURE REQUEST] add support for random prefixes (MURMUR3 S3 hash) #113

@TomerHeber

Description

@TomerHeber

Is your feature request related to a problem? Please describe.

AWS S3 has hard limits for each prefix index.
The best way to scale with S3 is to create a random prefix for each object/file.

This is described here:
https://aws.amazon.com/blogs/big-data/improve-operational-efficiencies-of-apache-iceberg-tables-built-on-amazon-s3-data-lakes/ (write.object-storage.enabled'=true).

Setting this option in Polaris, returns a "temporarily unsupported" error.

Describe the solution you'd like

Below is an example:

bucket//catalog/namespace/table/file1
bucket//catalog/namespace/table/file2
etc...

This strategy removes any hard limits concerns with S3 buckets.

Describe alternatives you've considered

For each catalog generate set random location prefix:
bucket/<random_location_prefix>/namespace/table

Additional context

The current limitation may be due to RBAC concerns.
However, I could be wrong, but the desired solution may still be able support RBAC as required.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions