Home /
technology Podcasts /
LessWrong (30+ Karma) /
[Linkpost] “Claude is Now Alignment Pretrained” by RogerDearnaley

EPISODE · May 14, 2026 · 2 MIN

[Linkpost] “Claude is Now Alignment Pretrained” by RogerDearnaley

from LessWrong (30+ Karma)

This is a link post. Anthropic are now actively using the approach to alignment often called “Alignment Pretraining” or “Safety Pretraining” — using Stochastic Gradient Descent on a large body of natural or synthetic documents showing the AI assistant doing the right thing. They tried this out, ound it works well, and are now using it. I’m absolutely delighted. I’ve been advocating this approach on LessWrong and the Alignment Forum for several years: How to Control an LLM's Behavior (why my P(DOOM) went down)Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI?A "Bitter Lesson" Approach to Aligning AGI and ASIWhy Aligning an LLM is Hard, and How to Make it EasierThe Best Way to Align an LLM: Is Inner Alignment Now a Solved Problem?Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI?Pretraining on Aligned AI Data Dramatically Reduces Misalignment—Even After Post-Training I’ve been very excited about this alignment technique for a couple of years, ever since I read the seminal paper demonstrating that it was extremely effective, Pretraining Language Models with Human Preferences (Korbak et al., ’23). This was later followed up by Safety Pretraining: Toward the Next Generation [...] --- First published: May 13th, 2026 Source: https://www.lesswrong.com/posts/Xqh9bDw7Ei5bExC6h/claude-is-now-alignment-pretrained-1 Linkpost URL:https://www.anthropic.com/research/teaching-claude-why --- Narrated by TYPE III AUDIO.

NOW PLAYING

[Linkpost] “Claude is Now Alignment Pretrained” by RogerDearnaley

0:00 2:59

1×

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

#005 Les secrets de la Manipulation - Le pervers narcissique en couple

Dec 20, 2021 ·7m

#004 Les secrets de la Manipulation - Le pervers narcissique : petite définition

Dec 20, 2021 ·6m

#003 Les secrets de la Manipulation - Je repère les manipulateurs de mon entourage

Dec 20, 2021 ·8m

#002 Les secrets de la Manipulation - Pourquoi user de la manipulation ?

Dec 20, 2021 ·8m

#001 Les secrets de la Manipulation - Introduction

Dec 20, 2021 ·0m

Similar Podcasts

Accidental Accountant Regan Williams Hi, I'm Regan! I'm a CPA of 30+ years helping "accidental accountants" navigate tax & accounting issues with confidence! Here, we find solutions to common challenges bookkeepers, accountants and CPAs face. Don't see an answer to your question? Then ask! I'm here to help people like you. Profit Powerhouse Glenn Poulos Glenn Poulos is the co-founder, Vice President, and General Manager of Gap Wireless Inc., a leading product and service distributor for the mobile broadband and wireless markets. With over three decades of experience in sales, he has spent thousands of hours in the field or on the phone with customers and working with salespeople to help create several very successful companies. He is now also the host of this podcast, Profit Powerhouse!Our 20 to 30-min podcast shares amazing founder stories who reveal the smartest strategies for scaling TODAY. Love My Quarter Life Beth Schofield In a world filled with countless decisions and societal pressures, navigating our twenties & thirties can be tough. But you’re not alone, and you’re in the right place because this podcast is dedicated to supporting 20 & 30-somethings to overcome the overwhelm of Quarter Life Confusion. The weekly episodes offer you the motivation and inspiration you need to get unstuck, find what’s missing and move forward in life with meaning, passion and purpose. Two Recruiters: Zero Filter Two Recruiters At Two Recruiters: Zero Filter, we're on a mission to demystify the hiring process, share insider tips, and empower you to maneuver through the professional world with confidence. With more than 30 years of combined experience navigating the intricate web of job markets, talent acquisition, and career development, we're here to spill the tea on everything career related. But wait, there’s more! We will dive into many life topics that are interesting to us as well. Get ready for a rollercoaster of insights, stories, and no-holds-barred advice!Join us for conversations that matter – where work, life, and authenticity collide in the most unexpected and rewarding ways.

URL copied to clipboard!