Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
68 commits
Select commit Hold shift + click to select a range
c97f386
Add TrajFM migration plan (failure_mode)
nianjunz Mar 31, 2026
67679e0
Add failure_mode tests fixtures and README
nianjunz Mar 31, 2026
3f9e4c5
Add trajectory failure-mode analysis module
nianjunz Mar 31, 2026
8594108
Mark Phase 2 LLM refactor as complete
nianjunz Mar 31, 2026
e6a527e
Update tests README for Phase 2
nianjunz Mar 31, 2026
43540d8
Add unit tests for failure_mode module
nianjunz Mar 31, 2026
8605bf7
Add failure-mode generator, utils, and pipeline
nianjunz Mar 31, 2026
21d2525
Add example failure mode analysis CLI script
nianjunz Apr 1, 2026
87a9675
Export failure_mode pipeline and generator
nianjunz Apr 1, 2026
6b18d7b
Update test_utils.py
nianjunz Apr 1, 2026
a0d162a
Set default out_dir to package path
nianjunz Apr 1, 2026
87465c3
Use uv runner and update script defaults
nianjunz Apr 1, 2026
ebee678
Move failure_mode into core; add reducer and CLI
nianjunz Apr 1, 2026
0894a20
Support clustering & re-export core functions
nianjunz Apr 1, 2026
8511b37
Create format_handlers.py
nianjunz Apr 1, 2026
ff421e6
Update LLM model/provider messages
nianjunz Apr 1, 2026
9d38dfc
Add timeout wrapper for LLM calls
nianjunz Apr 1, 2026
9c0629b
Add LiteLLM model connection and bulk tests
nianjunz Apr 1, 2026
cedcdc4
Add timeout wrapper and LLM timeout tests
nianjunz Apr 1, 2026
12fecf4
Add trajectory import verification script
nianjunz Apr 1, 2026
4bb7ebd
Switch default LLM to AWS Claude Sonnet
nianjunz Apr 1, 2026
d8bbcab
Create litellm_models_report.md
nianjunz Apr 1, 2026
fcc8fd4
Update .gitignore
nianjunz Apr 1, 2026
6dd3daf
Remove LiteLLM models test report
nianjunz Apr 1, 2026
da46c85
Fetch LiteLLM models dynamically from proxy
nianjunz Apr 1, 2026
b7fa448
Update analyze_trajectories.py
nianjunz Apr 1, 2026
bb68b3c
Remove failure_mode docs and test scripts
nianjunz Apr 14, 2026
7cabeb2
Add trajectory-analysis optional deps
nianjunz Apr 14, 2026
94a52f0
Rename format handlers and add handler metadata
nianjunz Apr 14, 2026
136f8cc
Add verbose logging option to analysis script
nianjunz Apr 14, 2026
21f4787
Add logging, error handling, and rename outputs
nianjunz Apr 14, 2026
229a9f0
Use AWS Claude Sonnet as default LLM
nianjunz Apr 14, 2026
7e2972f
Add logging and handler info in utils
nianjunz Apr 14, 2026
7dfc5fe
Add diagnostics scripts for LLM and trajectory tests
nianjunz Apr 14, 2026
b145a5f
Delete MIGRATION_PLAN.md
nianjunz Apr 14, 2026
f85ff2f
Update test fixtures and LLM backend refs
nianjunz Apr 14, 2026
4b560c0
Update uv.lock
nianjunz Apr 14, 2026
30efe8e
Remove additional_fm_clustered.csv
nianjunz Apr 14, 2026
4308ced
Rewrite failure_mode README with CLI & structure
nianjunz Apr 14, 2026
5ef5c0f
Add --cluster-only mode and pipeline refactor
nianjunz Apr 14, 2026
d92aaa9
Update generator.py
nianjunz Apr 14, 2026
d88ebc5
Add run combining & optional clustering to pipeline
nianjunz Apr 14, 2026
a5e0284
Return handler name from get_llm_answer_from_json
nianjunz Apr 14, 2026
c195043
Change output dir & fix additional_fm.csv typo
nianjunz Apr 14, 2026
e59350d
Document CSV column schema for failure modes
nianjunz Apr 14, 2026
99305a9
Remove obsolete trajectory_analysis ignores
nianjunz Apr 14, 2026
7117c31
Improve logging: add file handlers & run_id
nianjunz Apr 15, 2026
b15d40e
Add optional run_id to failure pipeline
nianjunz Apr 15, 2026
e68d73c
Include trajectory_path in failure mode outputs
nianjunz Apr 15, 2026
c243a23
Update failure_mode README: logging & clustering
nianjunz Apr 15, 2026
226a92c
Update tests for API changes and mocking
nianjunz Apr 15, 2026
da6ada2
Use llm package and write outputs to results
nianjunz Apr 15, 2026
68dbf61
Update diagnostics scripts imports and CLI usage
nianjunz Apr 15, 2026
fb09ef4
Add system architecture and expand diagnostics
nianjunz Apr 15, 2026
885e804
Remove the signature of Bob
nianjunz Apr 15, 2026
d0d1d45
Update reducer.py
nianjunz Apr 15, 2026
0592948
docs: improve failure_mode README (formats, paths)
nianjunz Apr 15, 2026
4463906
Remove 'Made with Bob' footer from tests
nianjunz Apr 15, 2026
2196521
Clarify CLI usage in analyze_trajectories.py
nianjunz Apr 15, 2026
620a941
Document three usage modes and update examples
nianjunz Apr 15, 2026
fea1b8b
Sample Trajectories for analysis
nianjunz Apr 15, 2026
0e6846d
Rename verify_trajectory_import to verify_trajectory_format
nianjunz Apr 15, 2026
d107f44
Add working-directory hint to script usage
nianjunz Apr 15, 2026
dbae514
Add cd /path/to/AssetOpsBench to examples
nianjunz Apr 15, 2026
1b3116d
Update README test and diagnostic commands
nianjunz Apr 15, 2026
ff817f3
Clarify diagnostic script usage in README
nianjunz Apr 15, 2026
76b34b5
Use verify_trajectory_format in examples
nianjunz Apr 15, 2026
8929ce0
Use package-style imports in tests
nianjunz Apr 16, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -201,3 +201,5 @@ mcp/couchdb/sample_data/bulk_docs.json
.env
mcp/servers/tsfm/artifacts/tsfm_models/
src/tmp/


5 changes: 5 additions & 0 deletions pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -50,6 +50,11 @@ tsfm = [
"torch>=2.0",
"transformers>=4.40",
]
# Optional deps for trajectory analysis clustering (Phase 3)
trajectory-analysis = [
"sentence-transformers>=2.0",
"scikit-learn>=1.0",
]

[tool.uv]
package = true
Expand Down
11 changes: 11 additions & 0 deletions src/trajectory_analysis/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
"""Trajectory analysis module for AssetOpsBench.

This module provides tools for analyzing agent trajectories and identifying
failure modes in multi-agent systems.
"""

from . import failure_mode

__all__ = ["failure_mode"]

# Made with Bob
Loading