EPISODE · May 29, 2024
213 – Are Transformer Models Aligned By Default?
from The Bayesian Conspiracy · host BayesianAdmin
Our species has begun to scrute the inscrutable shoggoth! With Matt Freeman 🙂 LINKS Anthropic’s latest AI Safety research paper, on interpretability Anthropic is hiring Episode 93 of The Mind Killer Talkin’ Fallout VibeCamp 0:00:17 – A Layman’s AI Refresher 0:21:06 – Aligned By Default 0:50:56 – Highlights from Anthropic’s Latest Interpretability Paper 1:26:47 – […]
NOW PLAYING
213 – Are Transformer Models Aligned By Default?
No transcript for this episode yet
Similar Episodes
Dec 5, 2025 ·50m
Oct 9, 2025 ·33m
Oct 3, 2025 ·40m
Sep 11, 2025 ·31m
Aug 27, 2025 ·39m
Aug 18, 2025 ·54m