This repository is a condensed version of the codebase used for Endless Jailbreaks with Bijection Learning: Attack Vectors for Language Models Emerge at Scale. We provide scripts for running the ...
This repository is a condensed version of the codebase used for Endless Jailbreaks with Bijection Learning: Attack Vectors for Language Models Emerge at Scale. We provide scripts for running the ...