feat: reduce temp copy when decompressing lz4 frame by zhaohaidao · Pull Request #9308 · apache/arrow-rs

zhaohaidao · 2026-01-30T08:55:02Z

Which issue does this PR close?

Closes feat: reduce temp copy when decompressing lz4 frame #9307.

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

alamb

Thanks for this contribution @zhaohaidao

Did you run any benchmarks to measure the performance improvement of this PR?

If you need some new benchmarks, it would be nice to add them as a separate PR so that we can use our automated testing infrastructure to compare

This strongly suggests the streaming FrameDecoder path (state machine + Read trait + buffer resize/zero-init) adds significant overhead beyond core LZ4 block decompression.

If the issue is optimizing the lz4 frame decoding, that might be better done in the lz4 layer 🤔 Have considered contributing this change upstream to the lz4_flex crate?

alamb · 2026-01-30T15:37:25Z

arrow-ipc/Cargo.toml

 [features]
 default = []
 lz4 = ["lz4_flex"]
+lz4_direct = ["lz4", "twox-hash"]


Why is this feature flagged? Is there any reason a user would NOT want this feature?

If we are going to add a new feature it also need to be documented

alamb · 2026-02-02T16:41:51Z

Marking as draft as I think this PR is no longer waiting on feedback and I am trying to make it easier to find PRs in need of review. Please mark it as ready for review when it is ready for another look

zhaohaidao · 2026-02-02T16:50:14Z

Marking as draft as I think this PR is no longer waiting on feedback and I am trying to make it easier to find PRs in need of review. Please mark it as ready for review when it is ready for another look

Thanks for marking it. I'm currently adding benchmarks based on the comments. The hotspot I'm encountering right now might be related to buffer/IO management in the arrow IPC path, so I'm still inclined to optimize arrow-rs for now. However, I will follow your suggestion and add benchmarks.

feat: reduce temp copy when decompressing lz4 frame

dd31d63

github-actions bot added the arrow Changes to the arrow crate label Jan 30, 2026

alamb reviewed Jan 30, 2026

View reviewed changes

alamb marked this pull request as draft February 2, 2026 16:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: reduce temp copy when decompressing lz4 frame#9308

feat: reduce temp copy when decompressing lz4 frame#9308
zhaohaidao wants to merge 1 commit intoapache:mainfrom
zhaohaidao:feat/lz4-reduce-copy

zhaohaidao commented Jan 30, 2026

Uh oh!

alamb left a comment

Uh oh!

alamb Jan 30, 2026

Uh oh!

alamb commented Feb 2, 2026

Uh oh!

zhaohaidao commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zhaohaidao commented Jan 30, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

alamb Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

alamb commented Feb 2, 2026

Uh oh!

zhaohaidao commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants