Skip to content

Partition writes not creating expected directory hierarchy on S3 (MinIO) #891

@hackintoshrao

Description

@hackintoshrao

Hey folks,

I created this PR to validate the issue: #890 . Please let me know if I've made the assumptions in the integration test correctly.

Summary

I created this integration test to confirm that partitioned data is written into a partition-specific directory (e.g., id=100/). However, after committing the partitioned DataFile to the table, no objects appear under the expected id=100/ prefix in MinIO. Instead, the data file is written directly under another path (e.g., data/test-00000.parquet) without partition directories.

Here is the error from the test:

No objects found under prefix demo/iceberg/rust/t1/data/id=100/ - partition layout may not be correct.

Based on the table metadata, it looks like the data file is located at s3://icebergdata/demo/iceberg/rust/t1/data/test-00000.parquet, but no partition subdirectory (id=100/) is used.


Steps to Reproduce

  • Pull the PR with the integration test
  • Run the integration test RUST_BACKTRACE=1 cargo test --test append_partition_data_file_test -- --nocapture .

Once the issue is confirmed, I'll work on a quick fix.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions