
This is a project intended to benchmark the performance impact of using the DX12 implementation of hardware predication to cull primitives per draw call rather than excluding them from the command lists.
The full report on this analysis can be accessed here.
The findings of these benchmarks show large performance overheads when used for a large amount of rather small draw calls, however promising results when applied to larger instanced draw calls, which may have use cases in LOD-esque application areas.

