PodParley PodParley

213 – Are Transformer Models Aligned By Default?

An episode of the The Bayesian Conspiracy podcast, hosted by BayesianAdmin, titled "213 – Are Transformer Models Aligned By Default?" was published on May 29, 2024.

May 29, 2024 · The Bayesian Conspiracy

0:00 / 0:00
Our species has begun to scrute the inscrutable shoggoth! With Matt Freeman 🙂 LINKS Anthropic’s latest AI Safety research paper, on interpretability Anthropic is hiring Episode 93 of The Mind Killer Talkin’ Fallout VibeCamp 0:00:17 – A Layman’s AI Refresher 0:21:06 – Aligned By Default 0:50:56 – Highlights from Anthropic’s Latest Interpretability Paper 1:26:47 – […]
URL copied to clipboard!