EPISODE · Jan 12, 2024 · 36 MIN
Is open-source AI safe? (with SafeLlama founder, Enoch Kan)
from Thinking Machines: AI & Philosophy · host Daniel Reid Cahn
Founder of the SafeLlama community, Enoch Kan joins us today, to talk about safety in open source and medical AI. Enoch previously worked in AI for radiology, focused on mammography at Kheiron Medical. Enoch is an open source contributor, and his substack is called Cross Validated.Key topics they discuss include:New jailbreaks for LLMs appear every day. Does it matter?How do internet firewalls compare to AI “firewalls”?Why do human radiologists still exist? Would it be safe to replace them all today?Does safety matter more or less as models become more accurate?If regulation is too intense, could we end up with illegal consumer LLMs? For example, could we stop the masses from using an illegal AI doctor that you can access from your phone?Share your thoughts with us at [email protected] or tweet us @slingshot_ai.
What this episode covers
Founder of the SafeLlama community, Enoch Kan joins us today, to talk about safety in open source and medical AI. Enoch previously worked in AI for radiology, focused on mammography at Kheiron Medical. Enoch is an open source contributor, and his substack is called Cross Validated.Key topics they discuss include:New jailbreaks for LLMs appear every day. Does it matter?How do internet firewalls compare to AI “firewalls”?Why do human radiologists still exist? Would it be safe to replace them all today?Does safety matter more or less as models become more accurate?If regulation is too intense, could we end up with illegal consumer LLMs? For example, could we stop the masses from using an illegal AI doctor that you can access from your phone?Share your thoughts with us at [email protected] or tweet us @slingshot_ai.
NOW PLAYING
Is open-source AI safe? (with SafeLlama founder, Enoch Kan)
No transcript for this episode yet
Similar Episodes
Mar 31, 2026 ·54m
Mar 27, 2026 ·14m
Mar 24, 2026 ·42m
Mar 20, 2026 ·42m
Mar 17, 2026 ·41m
Mar 13, 2026 ·44m