Add GRPO natural language-to-SQL example with execution-based reward #997

NP2241 · 2026-01-22T18:39:18Z

Resolves #979

It's a good idea to open an issue first for discussion.

This PR adds a new GRPO example demonstrating natural language to SQL training with an external execution environment. The example shows how to integrate Tunix GRPO with a structured task where rewards are computed by executing generated SQL against a small SQLite database and checking result correctness.

The example is intentionally minimal and mirrors the structure of the existing GSM8K GRPO example to serve as a clear reference for users.

Reference

GitHub issue: Natural Language to SQL GRPO Example #979
Existing GRPO example: examples/rl/grpo/gsm8k/

Colab Notebook
N/A — this PR adds an example recipe under examples/ and does not introduce a new public API.

Checklist

I have added all the necessary unit tests for my change.
N/A — this PR adds an example and does not modify core library logic.
I have verified that my change does not break existing code and all unit tests pass.
I have added all appropriate doc-strings/documentation.
My PR is based on the latest changes of the main branch.
I have signed the Contributor License Agreement.
I have followed Contribution Guidelines.

basic nl to sql rough proof of concept

d1b3f24

NP2241 requested review from abheesht17, hgao327, jiangyangmu, lc5211, sizhit2, tianshub and wang2yn84 as code owners January 22, 2026 18:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GRPO natural language-to-SQL example with execution-based reward #997

Add GRPO natural language-to-SQL example with execution-based reward #997

Uh oh!

NP2241 commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add GRPO natural language-to-SQL example with execution-based reward #997

Are you sure you want to change the base?

Add GRPO natural language-to-SQL example with execution-based reward #997

Uh oh!

Conversation

NP2241 commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant