In cases where Avocado generates a spec that fails to verify, the repair prompt comprises an error message and a direction to re-generate the spec. This does not explicitly instruct a model to try generating a specification that might be weaker but possibly has a higher chance of successfully verifying.
This could be an interesting mode to explore as opposed to our standard repair-verify-assume (on repeated failure) loop.
In cases where Avocado generates a spec that fails to verify, the repair prompt comprises an error message and a direction to re-generate the spec. This does not explicitly instruct a model to try generating a specification that might be weaker but possibly has a higher chance of successfully verifying.
This could be an interesting mode to explore as opposed to our standard repair-verify-assume (on repeated failure) loop.