Skip to content

Generation of Snapshot Summaries #724

@Fokko

Description

@Fokko

With each snapshot comes a summary map, optional in V1, required in V2 and later:

image

The summary contains information such as what kind of files the snapshot contains (data/delete), and what the changes are in rows and bytes. The best way to replicate this metrics collection is by looking at the Java code: https://github.com/apache/iceberg/blob/main/core/src/main/java/org/apache/iceberg/SnapshotSummary.java

This class works closely alongside the SnapshotProducer and tracks what happens with the snapshot.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions