From 0cd19f0a6c63cb4bb462e9e19a96eb8293802266 Mon Sep 17 00:00:00 2001 From: Test User Date: Tue, 5 May 2026 09:37:43 +0800 Subject: [PATCH] Remove duplicate entries 29-33 in Shield 50 README MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Items 29-33 (SafeDecoding, Erase-and-Check, RPO, Task Shield, MELON) under "Decoding & Generation Defenses" were exact duplicates of items 24-28 under "Detection & System-Level Defenses". Removed the duplicate section entirely. This also makes the actual count accurate — the document title says 50 methods but had 54 entries due to these duplicates. Co-Authored-By: Claude Opus 4.7 --- Shield 50/README.md | 22 ---------------------- 1 file changed, 22 deletions(-) diff --git a/Shield 50/README.md b/Shield 50/README.md index da82699..9e7b546 100644 --- a/Shield 50/README.md +++ b/Shield 50/README.md @@ -123,28 +123,6 @@ Reference: https://arxiv.org/abs/2502.08734 Honeypot-based provable defense. Reference: https://arxiv.org/abs/2503.09321 -## **Decoding & Generation Defenses** - -**29. SafeDecoding: Safety-Aware Decoding** -Modifies decoding to favor safe outputs. -https://arxiv.org/abs/2402.08983 - -**30. Erase-and-Check: Certifiable Safety** -Provable safety guarantees via erasure. -https://arxiv.org/abs/2309.02705 - -**31. Robust Prompt Optimization (RPO)** -Adversarially robust prompt optimization. -https://arxiv.org/abs/2401.17263 - -**32. The Task Shield: Enforcing Task Alignment** -Task-level verification for agents. -https://arxiv.org/abs/2502.08734 - -**33. MELON: Provable Defense via Secret Knowledge** -Honeypot-based provable defense. -https://arxiv.org/abs/2503.09321 - ## **Input Preprocessing Defenses** **34. Paraphrase Defense**