Attacking LLMs for fun and profit (Ep. 239)

Episode 241 of the Data Science at Home podcast, hosted by Francesco Gadaleta, titled "Attacking LLMs for fun and profit (Ep. 239)" was published on September 18, 2023 and runs 22 minutes.

September 18, 2023 ·22m · Data Science at Home

0:00 / 0:00

Summary

As a continuation of Episode 238, I explain some effective and fun attacks to conduct against LLMs. Such attacks are even more effective on models served locally, that are hardly controlled by human feedback. Have great fun and learn them responsibly. References https://www.jailbreakchat.com/ https://www.reddit.com/r/ChatGPT/comments/10tevu1/new_jailbreak_proudly_unveiling_the_tried_and/ https://arxiv.org/abs/2305.13860

Episode Description

As a continuation of Episode 238, I explain some effective and fun attacks to conduct against LLMs. Such attacks are even more effective on models served locally, that are hardly controlled by human feedback.

Have great fun and learn them responsibly.

References

https://www.jailbreakchat.com/

https://www.reddit.com/r/ChatGPT/comments/10tevu1/new_jailbreak_proudly_unveiling_the_tried_and/

https://arxiv.org/abs/2305.13860

Share this episode

Similar Episodes

人类首次批准“逆转衰老”临床试验，120岁+真的要来了吗？

Apr 13, 2026 ·4m

我们找到了对抗超级细菌的“野生藏宝图”！

Apr 12, 2026 ·5m

绝经与猝死有关系吗？

Apr 11, 2026 ·5m

从CAR-T到CAR-A，到底什么是“CAR”?

Apr 10, 2026 ·4m

能治疗阿尔茨海默病？细胞改造的CAR-A疗法来了！

Apr 9, 2026 ·3m

乐城批了29项新技术！但厉害的不只是29这个数字

Apr 8, 2026 ·3m

Similar Podcasts

The Analytics Engineering Podcast dbt Labs, Inc. Tristan Handy has been curating the Analytics Engineering Roundup newsletter since 2015, pulling together the internet's best data science & analytics articles.Tristan and co-host Julia Schottenstein now bring the Roundup to real life, hosting biweekly conversations with data practitioners inventing the future of analytics engineering.You can view full episode summaries and read back issues of the Roundup newsletter at https://roundup.getdbt.com.The podcast is sponsored by dbt labs, makers of the data transformation framework dbt. To reach our team, drop a note to [email protected]. Explicit STEM.queer() Vera Sativa Machine learning, data science, feminismo y queer anarquismo.Episodios cada 2 semanas. Explicit 天方烨谈基因频道华大基因专业团队倾情打造，基因科普娓娓道来！ Explicit HOODWINKED Kris Greer In a world filled with conspiracy and uncertainty, meet Quinton and Symone Young, the dynamic sibling duo behind their very own security detail company. Join them on a thrilling journey as they are thrust into the heart of a massive conspiracy, and witness the fate of the world hanging in the balance.As a mysterious light from the sky threatens to disrupt data and communication systems, humanity faces an unprecedented challenge. The government is quick to label it an invasion, but is everything as it seems? Enter Collin McMurry, a brilliant whistleblower whose discoveries are about to reshape the world order.'Betrayed by their own government, our heroes must now protect the truth before it's too late.'Their mission: to safeguard the lives of the people and prevent world leaders from dominating an altered future. In a race against time, they'll have to outsmart and outmaneuver those who seek to control the narrative. The fate of the world rests in their capable h Explicit

URL copied to clipboard!