batch rays together, decimate allocations and speedup ~3x by fjbarter · Pull Request #4 · fjbarter/PRICK.jl

fjbarter · 2026-03-19T01:03:18Z

rather large effort to batch rays together for passing to ImplicitBVH. overload LVT traversal algorithm to only trace 'active' rays, i.e. that have not been terminated (hit a sink, bbox, max bounces, max length etc)

this aims to effectively eliminate allocations in the ray tracing loop, as traversal caches are now being adequately utilised for ray tracing, and direction + position matrices do not need to be created per traverse_rays call. simply mutate the RayBatchBuffer

strong scaling is decent but not amazing: 1000 rays -> 11.3 s on 1 thread, 3.6 s on 4 threads for a ~3.1x speedup

batch rays together, decimate allocations and speedup ~3x

6af0643

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch rays together, decimate allocations and speedup ~3x#4

batch rays together, decimate allocations and speedup ~3x#4
fjbarter wants to merge 1 commit intomainfrom
batch_tracing

fjbarter commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

fjbarter commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant