Skip to content

Reuse per-thread query buffers in IVFRaBitQFastScan compute_LUT#5123

Open
lyang24 wants to merge 1 commit intofacebookresearch:mainfrom
lyang24:opt/rabitq-reuse-compute-lut-thread-buffers
Open

Reuse per-thread query buffers in IVFRaBitQFastScan compute_LUT#5123
lyang24 wants to merge 1 commit intofacebookresearch:mainfrom
lyang24:opt/rabitq-reuse-compute-lut-thread-buffers

Conversation

@lyang24
Copy link
Copy Markdown
Contributor

@lyang24 lyang24 commented Apr 19, 2026

The goal is to remove per-(query, probe) vector allocations from the float LUT path without changing distance math or the newer mainline interfaces.

This updates the rebased branch to match the current main implementation. Instead of carrying the old residual-buffer overload forward, compute_LUT now reuses per-thread rotated_q and centroid_buf scratch space while keeping the newer compute_residual_LUT and compute_LUT_uint8 paths from main unchanged. The goal is to remove per-(query, probe) vector allocations from the float LUT path without changing distance math or the newer mainline interfaces.
@meta-cla meta-cla Bot added the CLA Signed label Apr 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant