adversarial-testing

Here are 102 public repositories matching this topic...

0xSanei / darwinia

The Self-Evolving Agent Ecosystem — Trading agents that evolve through Darwinian selection and adversarial self-play

bitcoin trading genetic-algorithm quantitative-finance autonomous-agents backtesting ai-agents multi-agent-system evolutionary-computing streamlit adversarial-testing openclaw darwinian-evolution

Updated Apr 13, 2026
Python

humanbound / humanbound

Star

Open-source adversarial testing engine, SDK, and CLI for AI agents. Runs locally or against the Humanbound Platform.

Updated Jul 15, 2026
Python

IBM / ares

Star

AI Robustness Evaluation System

security ai owasp owasp-top-10 red-teaming blue-teaming agentic-ai automated-red-teaming adversarial-testing

Updated Jul 16, 2026
Python

sherifkozman / the-red-council

Star

LLM Adversarial Security Arena — Jailbreak → Detect → Defend → Verify

security gemini red-team llm langchain adversarial-testing

Updated May 9, 2026
Python

Open-source framework for building and testing LLM-powered applications: IRIS (single-agent orchestration), AETHER (declarative multi-agent systems), and AEGIS (adversarial security testing). Developed at MSU Denver's Community-Centered Computing (C3) Lab.

python open-source benchmarking research ai nsf multi-agent-systems security-testing red-teaming rag llm langchain langgraph agentic-ai adversarial-testing msu-denver c3-lab

Updated Jul 16, 2026
Python

howardpen9 / grok-mcp

Star

MCP server that wraps the xAI Grok CLI. Lets Claude Code, Cursor, Cline, and any MCP host use Grok as a peer code reviewer, adversary, and second-opinion consultant.

typescript mcp code-review cursor grok cline peer-review xai ai-tools ai-agent llm-tools agent-tools model-context-protocol mcp-server second-opinion claude-code adversarial-testing

Updated Jul 3, 2026
TypeScript

ProofAgent-ai / proofagent-harness

Star

Open-source test harness for AI agents. Stress-test production agents with adversarial multi-turn scenarios in CI

python mcp pytest agents ai-safety ai-agents red-teaming rag llmops prompt-injection genai evals llm-evaluation hallucination-detection llm-testing agent-evaluation ai-red-teaming agent-testing adversarial-testing

Updated Jul 13, 2026
Python

audn-ai / skills

Star

Red-team your AI agents from any coding IDE. Adversarial security testing skills for Claude Code, Cursor, Codex, and 40+ agents.

skills jailbreak red-team ai-security prompt-injection llm-security claude-code adversarial-testing agent-skill

Updated Apr 13, 2026

zakky8 / llm-jailbreak-taxonomy

Star

Mechanism-grounded taxonomy of 40 LLM jailbreak patterns across 10 categories. 8,000-trial bootstrap evaluation for the June 2026 frontier (Claude Opus 4-8, GPT-5.5, Gemini 3.5, DeepSeek V4). Every citation direct-WebFetch verified; refuted claims documented.

taxonomy jailbreak alignment ai-safety security-testing responsible-disclosure jailbreak-detection adversarial-attacks red-teaming ai-security model-robustness adversarial-ml prompt-injection red-teaming-tools llm-security llm-evaluation llm-jailbreaks ai-red-teaming adversarial-testing

Updated Jun 2, 2026
Jupyter Notebook

jhlee0409 / elenchus-mcp

Sponsor

Star

Elenchus MCP Server - Adversarial verification system for code review

nodejs typescript ai mcp static-analysis code-review claude code-verification llm anthropic model-context-protocol mcp-server adversarial-testing

Updated Jan 29, 2026
TypeScript

Zandereins / hydra

Star

Multi-perspective code review council for Claude Code. 3 advisors by default, 10 agents in deep mode (Opus + Codex). Evidence chains, adversarial self-test, dual-path verdict. Based on Karpathy's LLM Council.

security-audit multi-agent code-review opus codex cross-model architecture-review prompt-engineering ai-code-review claude-code adversarial-testing claude-skill claude-code-skill llm-council evidence-chains dual-path-verdict

Updated Jul 3, 2026
Python

alejandrosaenz117 / bonfires-marketplace

Star

A marketplace of Claude Code plugins for adversarial security and architectural code review.

security architecture code-review threat-modeling security-review claude-code adversarial-testing plugin-marketplace

Updated Mar 30, 2026

bassrehab / red-queen

Star

Evolutionary adversarial testing framework for AI safety using quality-diversity search to discover interpretable, transferable vulnerabilities across LLMs. (ICLR 2026)

evolutionary-algorithms model-evaluation ai-safety ai-agents quality-diversity jailbreak-detection red-teaming foundation-models llm llm-security adversarial-testing iclr2026

Updated Mar 16, 2026
Rust

CodedRichy / food-chain-ideation

Star

Claude Code skill that stress-tests startup ideas with adversarial AI agents — 68 animals, elimination rounds, blind scoring. Your idea either survives or you get 3 pivots

ideation ai-agents claude product-strategy prompt-engineering claude-code adversarial-testing claude-skills

Updated Jun 10, 2026
HTML

vibheksoni / jailbench

Star

Benchmark LLM jailbreak resilience across providers with standardized tests, adversarial mode, rich analytics, and a clean Web UI.

Updated Aug 12, 2025
Python

stchakwdev / Gaslight_EVAL

Star

AI safety evaluation framework testing LLM epistemic robustness under adversarial self-history manipulation

python ai-safety openrouter llm-evaluation adversarial-testing alignment-research epistemic-robustness

Updated Dec 18, 2025
Python

dr-gareth-roberts / context-engineering

Star

Context engineering toolkit for LLMs — pack, cache, debug, red-team, and orchestrate context windows. Council of Experts, adversarial testing, immune system, context compiler, drift detection, multi-agent entanglement. TypeScript + Python.