fix(iceberg): normalize float/double stats before encoding by parisni · Pull Request #798 · apache/incubator-xtable

parisni · 2026-02-03T10:59:56Z

Important Read

GitHub issue: TBD

What is the purpose of the pull request

This pull request normalizes float and double stats before encoding Iceberg column bounds.

Brief change log

Normalize min/max stat values for FLOAT/DOUBLE before Conversions.toByteBuffer.
Added helper to coerce numeric stats to the expected primitive type.

Verify this pull request

This pull request is a trivial rework / code cleanup without any test coverage.

vinishjail97

Can we add a test for this? I want to understand the bug.

… table

parisni · 2026-02-05T14:42:35Z

Hi @vinishjail97 added a test. Also generalized the fix to any type evolution.
I got the problem on hudi tables that had type evolution such float->double or even string->int

problem is the old parquet footer have statistics in a format that differ on the current iceberg table. The idea is to best effort coerce them.

the-other-tim-brown · 2026-02-05T16:15:09Z

@parisni if the issue is on the Hudi side then the proper fix is to move this to the Hudi side. Otherwise every target needs to understand the output types from Hudi. The stats are meant to match the schema type according to the docs for ranges: https://github.com/apache/incubator-xtable/blob/main/xtable-api/src/main/java/org/apache/xtable/model/stat/Range.java#L40

parisni · 2026-02-05T20:50:27Z

if the issue is on the Hudi side

Not sure about that. Xtable get the hudi stats from the parquet files, not from a hudi api. So my understanding is that it's xtable responsibility to coerce stats coming from the parquet footer in case type evolution did happen.

…

On February 5, 2026 4:51:17 PM UTC, Tim Brown ***@***.***> wrote: the-other-tim-brown left a comment (apache/incubator-xtable#798) @parisni if the issue is on the Hudi side then the proper fix is to move this to the Hudi side. Otherwise every target needs to understand the output types from Hudi. The stats are meant to match the schema type according to the docs for ranges: https://github.com/apache/incubator-xtable/blob/main/xtable-api/src/main/java/org/apache/xtable/model/stat/Range.java#L40 -- Reply to this email directly or view it on GitHub: #798 (comment) You are receiving this because you were mentioned. Message ID: ***@***.***>

the-other-tim-brown · 2026-02-07T01:53:56Z

if the issue is on the Hudi side
Not sure about that. Xtable get the hudi stats from the parquet files, not from a hudi api. So my understanding is that it's xtable responsibility to coerce stats coming from the parquet footer in case type evolution did happen.

Yes, this is what I am saying. The HudiConversionSource needs to comply with the XTable Spec. The target is assuming the source will produce the range data according to the spec.

fix(iceberg): normalize float/double stats before encoding

0b29e5e

vinishjail97 reviewed Feb 5, 2026

View reviewed changes

parisni added 2 commits February 5, 2026 15:29

fix(iceberg): coerce stats when schema evolution did happen on source…

26c1c1d

… table

test case, type evolution iceberg

e27d8a0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(iceberg): normalize float/double stats before encoding#798

fix(iceberg): normalize float/double stats before encoding#798
parisni wants to merge 3 commits intoapache:mainfrom
leboncoin:pr-fix-iceberg-stats

parisni commented Feb 3, 2026

Uh oh!

vinishjail97 left a comment

Uh oh!

parisni commented Feb 5, 2026

Uh oh!

the-other-tim-brown commented Feb 5, 2026

Uh oh!

parisni commented Feb 5, 2026 via email

Uh oh!

the-other-tim-brown commented Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

parisni commented Feb 3, 2026

Important Read

What is the purpose of the pull request

Brief change log

Verify this pull request

Uh oh!

vinishjail97 left a comment

Choose a reason for hiding this comment

Uh oh!

parisni commented Feb 5, 2026

Uh oh!

the-other-tim-brown commented Feb 5, 2026

Uh oh!

parisni commented Feb 5, 2026 via email

Uh oh!

the-other-tim-brown commented Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments