Consistent Jailbreaks in GPT-4, o1, and o3 - General Analysis

ooli2@lemm.ee · 16 days ago

Consistent Jailbreaks in GPT-4, o1, and o3 - General Analysis

Umbrias@beehaw.org · 14 days ago

jailbreaks actually are relevant with the use of llm for anything with i/o, such as “automated administrative assistants”. hide jailbreaks in a webpage and you have a lot of vectors for malware or social engineering, broadly hacking. as well as things like extracting controlled information.