Conversation
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
… no wf executed true
…, fall back to 1.0 if provider rejects it
…ead of system Python
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
… from escaping local optimum
…) claim-list cache
… set: crossover rate to 0.4 + hard cap at 0.7
…uant floor When the model creator only self-hosts at int4/fp4 (e.g. moonshotai for kimi-k2.7-code), the previous floor dropped their endpoint and routed to a community fp8 reseller. For benchmark reproducibility the authoritative first-party endpoint is preferred over a third-party requantization; the JSON-validity + determinism probe still gates them on merit. Also widen the runtime quantizations filter to admit any quant the precheck approved, so OpenRouter doesn't silently drop the pinned first-party endpoint. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
…gimosa_v2 g Please enter a commit message to explain why this merge is necessary,
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This PR merges the
mimosa_v2development branch intodev, bringing ~85 commits of verifier hardening, quality-diversity evolution improvements, workflow runner isolation, and CLI robustness fixes.What's Changed
Verifier & Grounding
Evolution & QD
Workflow Runner & Orchestrator
max_tokenscapped at 16384 to prevent token-limit crashesCLI & UX
evaluation_cli,memory_chat_cli, andonboard_cliAgent & Misc
additional_authorized_importsupdatedBreaking Changes
None expected — all changes are additive or internal refactoring.
Testing
Contribution checklist
Please confirm the following before requesting review:
CONTRIBUTING.md,docs/licensing-notes.md, and the repository license information (LICENSE,NOTICE) for this repository (Apache License 2.0).docs/cla-process.mdand understand the CLA workflow for contributions.