Skip to content

Actions: huggingface/trl

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
36,499 workflow runs
36,499 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Upload PR Documentation
Upload PR Documentation #10694: completed by qgallouedec
2s
Upload PR Documentation
Upload PR Documentation #10693: completed by qgallouedec
2s
Async GRPO
Build PR Documentation #14537: Pull request #5293 synchronize by qgallouedec
1m 26s async-grpo
Async GRPO
Tests (experimental) #996: Pull request #5293 synchronize by qgallouedec
13m 0s async-grpo
Async GRPO
Build PR Documentation #14536: Pull request #5293 synchronize by qgallouedec
1m 5s async-grpo
Async GRPO
Tests (experimental) #995: Pull request #5293 synchronize by qgallouedec
13m 2s async-grpo
Automatic Dependency Submission (Python)
Automatic Dependency Submission #3081: by github-advanced-security bot
2m 24s async-grpo
2m 24s
Upload PR Documentation
Upload PR Documentation #10692: completed by qgallouedec
1s
Add SDPO (Self-Distillation Policy Optimization) trainer
Build PR Documentation #14535: Pull request #4935 synchronize by kashif
Action required MengAiDev:4929
Add SDPO (Self-Distillation Policy Optimization) trainer
Tests (experimental) #994: Pull request #4935 synchronize by kashif
Action required MengAiDev:4929
Async GRPO
Build PR Documentation #14534: Pull request #5293 synchronize by qgallouedec
1m 10s async-grpo
Automatic Dependency Submission (Python)
Automatic Dependency Submission #3080: by github-advanced-security bot
2m 40s async-grpo
2m 40s
Async GRPO
Tests (experimental) #993: Pull request #5293 synchronize by qgallouedec
13m 31s async-grpo
Add SDPO (Self-Distillation Policy Optimization) trainer
Build PR Documentation #14533: Pull request #4935 synchronize by kashif
Action required MengAiDev:4929
Add SDPO (Self-Distillation Policy Optimization) trainer
Tests (experimental) #992: Pull request #4935 synchronize by kashif
Action required MengAiDev:4929
Upload PR Documentation
Upload PR Documentation #10691: completed by qgallouedec
45s
Add SDPO (Self-Distillation Policy Optimization) trainer
Build PR Documentation #14532: Pull request #4935 synchronize by kashif
Action required MengAiDev:4929
Add SDPO (Self-Distillation Policy Optimization) trainer
Tests (experimental) #991: Pull request #4935 synchronize by kashif
Action required MengAiDev:4929