fix: Handle negative block counts in Avro map/header parsing by pomo-mondreganto · Pull Request #6 · DataEngineeringLabs/avro-schema

pomo-mondreganto · 2025-12-17T11:45:12Z

Per the Avro specification, when reading maps (and arrays), a negative block count indicates that the absolute value should be used as the count, and a byte size follows for fast skipping.

Previously, the code cast the signed zigzag value directly to usize, causing a negative value like -8 to become 18,446,744,073,709,551,608 on 64-bit systems, triggering a hash table capacity overflow panic.

Also skips parsing of 'default' field values since the current implementation incorrectly expects them to be Schema types rather than actual default values.

Fixes reading of Apache Iceberg manifest files which use this encoding.

Per the Avro specification, when reading maps (and arrays), a negative block count indicates that the absolute value should be used as the count, and a byte size follows for fast skipping. Previously, the code cast the signed zigzag value directly to usize, causing a negative value like -8 to become 18,446,744,073,709,551,608 on 64-bit systems, triggering a hash table capacity overflow panic. Also skips parsing of 'default' field values since the current implementation incorrectly expects them to be Schema types rather than actual default values. Fixes reading of Apache Iceberg manifest files which use this encoding.

pomo-mondreganto mentioned this pull request Dec 17, 2025

Usage of avro-schema for avro format pola-rs/polars#25800

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Handle negative block counts in Avro map/header parsing#6

fix: Handle negative block counts in Avro map/header parsing#6
pomo-mondreganto wants to merge 1 commit intoDataEngineeringLabs:mainfrom
pomo-mondreganto:pomo/bug/map-read-overflow

pomo-mondreganto commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pomo-mondreganto commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant