-
Notifications
You must be signed in to change notification settings - Fork 0
feat: add agent evaluation CLI #9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
14 commits
Select commit
Hold shift + click to select a range
6ef7d0c
docs: update AGENTS.md to specify schemas.ts for pipeline classes
valuecodes 9b503be
feat: add agent evaluation CLI with reporting and assertion framework
valuecodes 01afdc9
test: add unit tests for parseArgs function with Zod validation
valuecodes 4e16806
feat: add file assertion types and enhance evaluation framework
valuecodes 7a1e10d
docs: update README with new suite field notes and file assertion types
valuecodes d3d07b0
feat: add delete file tool and related assertions to evaluation frame…
valuecodes a66279f
refactor: update report generation to log display paths instead of fu…
valuecodes ae95f75
feat: add deleteFile tool and update documentation for agent evals
valuecodes 19e0283
chore: remove outdated CHECKLIST.md file
valuecodes c20e80c
docs: add comment to clarify purpose of sanitizeArgs function
valuecodes c9a6fd9
feat: add path traversal checks to file assertions and utilities
valuecodes 880f072
refactor: update description and test case for empty path handling
valuecodes 8659764
feat: add lodash-es for deep equality checks in assertions
valuecodes 384b9e2
refactor: reorganize schemas into separate types directory
valuecodes File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The documentation states schemas should be in
types/schemas.ts, but this PR and other CLIs (like etf-backtest) place schemas directly asschemas.tsin the CLI directory. The codebase is inconsistent - some CLIs usetypes/subdirectories (name-explorer, scrape-publications) while others don't. Either update the documentation to reflect actual practice or establish a consistent pattern across all CLIs.