adversarial-testing
Here are 24 public repositories matching this topic...
Elenchus MCP Server - Adversarial verification system for code review
-
Updated
Jan 29, 2026 - TypeScript
AI safety evaluation framework testing LLM epistemic robustness under adversarial self-history manipulation
-
Updated
Dec 18, 2025 - Python
Benchmark LLM jailbreak resilience across providers with standardized tests, adversarial mode, rich analytics, and a clean Web UI.
-
Updated
Aug 12, 2025 - Python
Adversarial MCP server benchmark suite for testing tool-calling security, drift detection, and proxy defenses
-
Updated
Dec 27, 2025 - JavaScript
A dependency-aware Bayesian belief gate that resists correlated evidence and yields only under true independent verification.
-
Updated
Jan 18, 2026 - Python
A governance doctrine for AI systems based on explicit oversight. Externalizes trust and uncertainty into auditable, adversarial, and constrainable layers. A design framework, not an implementation guide.
-
Updated
Feb 2, 2026
Analysis of ChatGPT-5 reviewer failure: speculative reasoning disguised as certainty. Captures how evidence-only review drifted into hypotheses, later admitted as review-process failure. Includes logs, checksums, screenshots, and external video.
-
Updated
Oct 7, 2025 - PowerShell
Description URF Application Stress Test — adversarial and scalability tests for Unified Rigidity Framework applications, validating limits under load, noise, and edge cases.
-
Updated
Feb 15, 2026 - Shell
Domain-expert evaluation framework for AI judgment quality in healthcare investing
-
Updated
Feb 18, 2026 - Python
Investigation into ChatGPT-5 reviewer misalignment: PDF claimed screenshots as evidence, but assistant denied their visibility. Includes JSONL + human-readable logs, screenshots, checksums, and video. Highlights structural risks in AI reviewer reliability.
-
Updated
Oct 7, 2025 - PowerShell
Adversarial testing of LLMs on constraint satisfaction deadlocks
-
Updated
Jan 27, 2026
LLM-powered fuzzing and adversarial testing framework for Solana programs. Generates intelligent attack scenarios, builds real transactions, and reports vulnerabilities with CWE classifications.
-
Updated
Jan 19, 2026 - Python
Adversarial testing and robustness evaluation for the Crucible framework
-
Updated
Dec 29, 2025 - Elixir
Red team toolkit for stress-testing MCP security scanners — find detection gaps before attackers do
-
Updated
Feb 18, 2026 - Python
Extremely hard, multi-turn, open-source-grounded coding evaluations that reliably break every current frontier models (Claude, GPT, Grok, Gemini, Llama, etc.) on numerical stability, zero-allocation, autograd, SIMD, and long-chain correctness.
-
Updated
Jan 27, 2026
Generate adversarial pytest tests using LLM. Tries to find edge cases in your Python code.
-
Updated
Jan 22, 2026 - Python
A multi-agent safety engineering framework that subjects systems to adversarial audit. Orchestrates specialized agents (Engineer, Psychologist, Physicist) to find process risks and human factors.
-
Updated
Dec 16, 2025 - Python
API for generating LLM bot/agent personalities based on the Big Five personality model.
-
Updated
Jan 2, 2026 - Python
Red-team framework for discovering alignment failures in frontier language models.
-
Updated
Feb 17, 2026 - Python
Improve this page
Add a description, image, and links to the adversarial-testing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the adversarial-testing topic, visit your repo's landing page and select "manage topics."