ML Plus JSON Classifier Integrated With PromptInjector

Extending the original PromptInjector with dynamic rules, ML scoring, semantic detection, and explainability. This project builds on top of existing tool: PromptInjector (https://github.com/nayangoel/PromptInjector) repository, which focuses on generating adversarial prompts for red teaming large language models. The original tool is offensive oriented: it produces jailbreak attempts, role play bypasses, and other adversarial inputs to test LLM robustness. This enhanced version adds a full defensive layer: a modular, explainable promptinjection classifier that detects, scores, and explains suspicious user inputs using a hybrid of rules, machine learning, and semantic similarity. Project created with AI assistance.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Bar Graph - Plotly.png		Bar Graph - Plotly.png
Injection detected.png		Injection detected.png
README.md		README.md
Sample Question + Answer.png		Sample Question + Answer.png
Scatter Plot - Plotly.png		Scatter Plot - Plotly.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML Plus JSON Classifier Integrated With PromptInjector

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ML Plus JSON Classifier Integrated With PromptInjector

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages