feat: route self-evolution through Hermes Codex OAuth by stephenschoettler · Pull Request #92 · NousResearch/hermes-agent-self-evolution

stephenschoettler · 2026-05-27T07:06:51Z

Summary

route self-evolution DSPy model creation through a Hermes Codex OAuth adapter for openai-codex/* model strings
move the nightly controller/policy to the Babbage/default ownership path and Codex-backed models
cap Codex GEPA cron budgets with explicit max_full_evals so policy iterations maps to a bounded run instead of GEPA's large hosted-model auto preset
reject byte-identical baseline/evolved artifacts as no-change even when judge scores improve, so no-op candidates are not reviewable/deployable

Coordination with existing work

I found overlapping open work before opening this PR:

feat: add CodexLM support via dspy-local for zero-cost evolution #8 adds Codex support through dspy-local / CodexLM and ~/.codex/auth.json.
Add ChatGPT OAuth model backend and fix evolution runtime regressions #22 adds a separate chatgpt/* OAuth backend.
Fix DSPy 3.2+ API compat (max_full_evals + metric kwargs) and 2 evolve_skill validator bugs #73/Fix dspy.GEPA(max_steps=...) crash on DSPy 3.2+ #91 address DSPy GEPA budget/API compatibility.
Phase 1 SkillModule architecture prevents GEPA from mutating actual skill content + Nous API integration patches #38 documents the no-op skill mutation problem.

This PR is narrower/different in two ways: it uses Hermes Agent's existing openai-codex OAuth credential/runtime path directly, and it hardens the scheduled cron governance path so a successful score cannot mask a byte-identical no-op artifact.

Validation

python -m pytest tests/core/test_dspy_lm_codex.py tests/test_nightly_evolve_cron.py tests/skills/test_evolve_skill_budget.py -q
python -m pytest -q -> 165 passed
/usr/bin/python /home/w0lf/.hermes/scripts/nightly-self-evolution.py --profile babbage --skill goal-planning --dry-run
Codex smoke call through make_dspy_lm('openai-codex/gpt-5.4-mini') returned OK
Controlled forced run after budget cap completed in ~81s and produced an artifact
Follow-up controlled forced run after no-op gate reported status: no-change, gate: no-material-diff, review: rejected, applied: no

Safety

auto_apply remains false
no generated skill candidate was applied
direct pushes to main were avoided; this is on a named branch

- All dspy.LM() calls: num_retries=8, timeout=120 - LiteLLM backoff env vars: INITIAL_RETRY_DELAY=5, MAX_RETRY_DELAY=60 - Switch nightly from anthropic/sonnet to openai/gpt-4.1 (no rate limit conflicts) - Robust JSON parsing in dataset_builder (handles trailing commas, unescaped newlines) - Tested: gpt-4.1 optimizer yielded +10.6% improvement, o3 optimizer yielded 0%

stephenschoettler added 10 commits April 3, 2026 10:40

switch nightly to ChatGPT OAuth (gpt-5.2) — no API key needed

3a71216

feat: add profile-aware nightly self-evolution controller

92b359e

chore: switch self-evolution model to gpt-5.4-mini

702a3cc

chore: move self-evolution ownership to babbage

285e7b0

feat: harden skill evolution candidate gates

b1e2f27

feat: route skill evolution through Codex OAuth

42dff5c

fix: cap Codex GEPA cron budget

3152de8

fix: reject no-op evolution candidates

0a26b6c

chore: drop generated evolution artifacts from PR

b3c1093

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: route self-evolution through Hermes Codex OAuth#92

feat: route self-evolution through Hermes Codex OAuth#92
stephenschoettler wants to merge 10 commits into
NousResearch:mainfrom
stephenschoettler:feat/codex-oauth-self-evolution

stephenschoettler commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

stephenschoettler commented May 27, 2026

Summary

Coordination with existing work

Validation

Safety

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant