Skip to content

Extend partition statistics support to unpartitioned tables #14375

@ebyhr

Description

@ebyhr

Proposed Change

Currently, the partition statistics file is only supported for partitioned tables.

Preconditions.checkArgument(Partitioning.isPartitioned(table), "Table must be partitioned");

While this design makes sense since it’s conceptually a “partition” statistics file, the information it contains is also valuable for unpartitioned tables.

For example, the Trino Iceberg connector currently needs to read manifest files (TableStatisticsReader) to build internal table statistics. If Iceberg provided a statistics file for unpartitioned tables as well, Trino (and other query engines) could leverage it directly, improving planning performance by avoiding expensive manifest reads.

“Extending partition statistics support” is just one possible approach. We are open to other options as well.

Proposal document

No response

Specifications

  • Table
  • View
  • REST
  • Puffin
  • Encryption
  • Other

Metadata

Metadata

Assignees

No one assigned

    Labels

    proposalIceberg Improvement Proposal (spec/major changes/etc)

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions