🔍 Enhance iterative theorem proving with DSPy by comparing full oracle vs. clipped hints using a mock Lean verifier in this streamlined setup.
experiment evaluation program-synthesis dataset rl lean clipping variance-reduction ppo tool-use policy-improvement offline-rl dspy leandojo
-
Updated
May 5, 2026 - Python