agentdojo

Here are 4 public repositories matching this topic...

immu4989 / dspy-security-bench

Measure how DSPy prompt optimization affects the prompt-injection robustness of agentic LLM programs, using AgentDojo's attack suite.

python robustness dspy prompt-injection llm-security llm-evaluation prompt-optimization agentic-ai agent-benchmark agentdojo

Updated Jun 16, 2026
Python

guangxiangdebizi / tool-output-spoofing-lab

Star

Benchmarking schema-valid false tool observations and defense baselines for tool-using LLM agents.

benchmark mcp ai-safety tool-use prompt-injection llm-agents agent-security rag-security agentdojo toolsandbox tool-output-spoofing

Updated Jun 10, 2026
Python

LQ458 / daily-admin-agent-security-eval

Star

AgentDojo suite for daily-admin agent security evaluation with simulated dynamic tool workflows.

evaluation glm ai-safety prompt-injection llm-security siliconflow agent-security agentdojo

Updated Jun 15, 2026
Python

Chunduri-Aditya / agent-shield

Star

Personal research project — solo, unaffiliated. Inspect AI evaluation framework for LLM agent security: ASR, benign utility, and Transparency Rate across prompt injection, tool poisoning, and psych attacks.

mcp red-teaming prompt-injection llm-evaluation llm-agents agent-security inspect-ai agentdojo transparency-rate

Updated Jun 21, 2026
Python

Improve this page

Add a description, image, and links to the agentdojo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the agentdojo topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agentdojo

Here are 4 public repositories matching this topic...

immu4989 / dspy-security-bench

guangxiangdebizi / tool-output-spoofing-lab

LQ458 / daily-admin-agent-security-eval

Chunduri-Aditya / agent-shield

Improve this page

Add this topic to your repo