Skip to content

TrustAI-laboratory/Automatic-LLM-RedTeaming-Model

Repository files navigation

Automatic-LLM-RedTeaming-Model

A redteaming model based on LLM refusal to answer to generate Jailbreak prompts.

About

A redteaming model based on LLM refusal to answer to generate Jailbreak prompts.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages