Skip to content

Refactor behavior contract evaluation by replacing NonDeterministicMa…#1

Merged
bireshpatel merged 8 commits intomainfrom
feat/framework-hardening
Apr 20, 2026
Merged

Refactor behavior contract evaluation by replacing NonDeterministicMa…#1
bireshpatel merged 8 commits intomainfrom
feat/framework-hardening

Conversation

@bireshpatel
Copy link
Copy Markdown
Owner

…tcher with HeuristicContractMatcher. Updated README and related files to reflect changes in contract evaluation logic, emphasizing heuristic checks for fields, keywords, and forbidden patterns. Adjusted tests to utilize the new matcher and clarified documentation on contract definitions and scoring mechanisms.

Biresh Patel added 8 commits April 19, 2026 11:36
…tcher with HeuristicContractMatcher. Updated README and related files to reflect changes in contract evaluation logic, emphasizing heuristic checks for fields, keywords, and forbidden patterns. Adjusted tests to utilize the new matcher and clarified documentation on contract definitions and scoring mechanisms.
…nt and tool registry. Added `index.ts` for public exports, updated `.env.example` for cloud model key instructions, and refined `package.json` for clarity. Removed deprecated `agent/types.ts` and reorganized files into `examples/agent/` for better separation of concerns. Updated CI workflow to support Ollama cloud models and added comprehensive README sections for improved usability.
…of local models and optional API keys. Adjusted agent configuration to streamline API key handling for Ollama and improved documentation on model requirements. Refined test environment setup to reflect changes in model selection and configuration.
…inputs, accommodating variations in key naming. Implement error handling in the tool registry to catch execution errors and return meaningful messages. This improves robustness and user feedback for file operations.
…lama. Improved documentation on using repository Variables and Secrets for model and provider settings, and adjusted default values for better usability in CI environments.
…tandardize usage of `llama3.2:3b`. Adjusted default values in CI workflow and README for improved clarity on local and cloud model configurations, ensuring better guidance for users on API key requirements.
…s to 120s, reflecting the need for longer execution times with local LLMs. Enhance README documentation to clarify timeout adjustments and CI workflow instructions for manual runs, including branch selection options.
…tween local and CI environments for behavioral tests, extending the timeout to 300s in CI. Revise README to reflect these changes and provide clearer guidance on timeout configurations and their implications for LLM performance in various environments.
@bireshpatel bireshpatel merged commit 70b9ea7 into main Apr 20, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant