fix(grpo): fix list/list division TypeError in partial parse reward calculation by isaacbmiller · Pull Request #13 · cmpnd-ai/dspy

isaacbmiller · 2026-03-15T22:53:53Z

When computing the format reward for partially-parsed outputs during GRPO
bootstrapping, the code divides two lists (present / expected) instead of
their lengths. This raises TypeError: unsupported operand type(s) for /: 'list' and 'list'.

The intent is to compute the fraction of expected output fields that were
successfully parsed, so len(present) / len(expected) is the correct
expression.

…alculation

fix(grpo): fix list/list division TypeError in partial parse reward c…

f394f0b

…alculation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(grpo): fix list/list division TypeError in partial parse reward calculation#13

fix(grpo): fix list/list division TypeError in partial parse reward calculation#13
isaacbmiller wants to merge 1 commit intomainfrom
fix/bootstrap-trace-list-division

isaacbmiller commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

isaacbmiller commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant