Monday Sep 18, 2023
Attacking LLMs for fun and profit (Ep. 239)
As a continuation of Episode 238, I explain some effective and fun attacks to conduct against LLMs. Such attacks are even more effective on models served locally, that are hardly controlled by human feedback.
Have great fun and learn them responsibly.
References
https://www.jailbreakchat.com/
https://www.reddit.com/r/ChatGPT/comments/10tevu1/new_jailbreak_proudly_unveiling_the_tried_and/
https://arxiv.org/abs/2305.13860
Version: 20241125
Comments (1)
To leave or reply to comments, please download free Podbean or
Learning french
Saturday Oct 07, 2023
To leave or reply to comments,
please download free Podbean App.