Skip to content

feat: add support for skill management in Codex CLI generator#394

Closed
omkargaikwad23 wants to merge 5 commits into
GoogleCloudPlatform:mainfrom
omkargaikwad23:codex-skills
Closed

feat: add support for skill management in Codex CLI generator#394
omkargaikwad23 wants to merge 5 commits into
GoogleCloudPlatform:mainfrom
omkargaikwad23:codex-skills

Conversation

@omkargaikwad23
Copy link
Copy Markdown
Contributor

@omkargaikwad23 omkargaikwad23 commented May 14, 2026

Description

This PR adds native support for declarative skill management in the CodexCliGenerator, bringing Codex integration to parity with Claude Code within EvalBench.

Prior to this PR, EvalBench lacked a mechanism to provision custom Codex Skills automatically. This change allows evaluating externally hosted or local skills seamlessly prior to evaluation runs.

Key Changes

  • Skill Installation Support: Extended the model configuration setup phase to allow installing skills directly using action: link for local paths and action: install_from_repo for GitHub URLs.
  • Claude Code Parity: The CodexCliGenerator is now hooked into the skill extraction pipeline in agentevaluator.py, maintaining pattern consistency across agent generators. Also added support for top-level skills_dir fallback matching the Claude Code configuration model.
  • Skill Extraction: Implemented extraction methods to identify activated skills directly from the trajectory and intercept internal scripts via shell tools.
  • Example Configurations: Added sample datasets and model configs illustrating the new skill capabilities.

@omkargaikwad23 omkargaikwad23 marked this pull request as draft May 14, 2026 06:49
@omkargaikwad23 omkargaikwad23 marked this pull request as ready for review May 14, 2026 08:41
@omkargaikwad23 omkargaikwad23 marked this pull request as draft May 14, 2026 08:46
@omkargaikwad23 omkargaikwad23 changed the title feat: add support for skill management in Codex CLI generator and int… feat: add support for skill management in Codex CLI generator May 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants