1

“Predicting Rare LLM Failures with 30× Fewer Rollouts” by Santiago Aranguri, Francisco Pernice

May 14, 2026

12:06

2

[Linkpost] “Claude is Now Alignment Pretrained” by RogerDearnaley

May 14, 2026

2:59

3

“The primary sources of near-term cybersecurity risk” by lc

May 14, 2026

4:02

4

“Most “inner work” looks like entertainment.” by Chris Lakin

May 14, 2026

4:58

5

[Linkpost] “Apollo Update May 2026” by Marius Hobbhahn

May 13, 2026

1:36

6

“Voters are surprisingly open to talking about AI risk” by less_raichu

May 13, 2026

8:33

7

“Childhood and Education #18: Do The Math” by Zvi

May 12, 2026

25:19

8

“The Owned Ones” by Eliezer Yudkowsky

May 12, 2026

9:28

9

“Optimisation: Selective versus Predictive” by Raymond Douglas

May 12, 2026

6:25

10

“AI companies are already profitable (in the way that matters)” by Yair Halberstadt

May 11, 2026

3:24

11

“The Iliad Intensive Course Materials” by Leon Lang, David Udell, Alexander Gietelink Oldenziel

May 11, 2026

29:35

12

“Empowerment, corrigibility, etc. are simple abstractions (of a messed-up ontology)” by Steven Byrnes

May 11, 2026

31:05

13

“How useful is the information you get from working inside an AI company?” by Buck, Anders Cairns Woodruff

May 11, 2026

13:22

14

“Who Got Breasts First and How We Got Them” by rba

May 11, 2026

21:13

15

“Anthropic’s strange fixation on “hyperstition”” by Simon Lermen

May 11, 2026

12:03

16

“How the AI Labs Make Profit (Maybe, Eventually)” by mabramov

May 11, 2026

6:18

17

“Sawtooth Problems” by Alexander Slugworth

May 10, 2026

43:01

18

“The Darwinian Honeymoon - Why I am not as impressed by human progress as I used to be” by Elias Schmied

May 10, 2026

7:13

19

“International Law Cannot Prevent Extinction Either” by Sausage Vector Machine

May 10, 2026

9:38

20

“Neural Networks learn Bloom Filters” by Alex Gibson

May 10, 2026

20:24

21

“If digital computers are conscious, they are conscious at the hardware level” by cube_flipper

May 9, 2026

36:14

22

“Why You Can’t Use Your Right to Try” by Stephen Martin

May 9, 2026

9:23

23

“A benchmark is a sensor” by Håvard Tveit Ihle, mabynke

May 9, 2026

5:33

24

“Bad Problems Don’t Stop Being Bad Because Somebody’s Wrong About Fault Analysis” by Linch

May 9, 2026

5:36

25

“Write Cause You Have Something to Say” by Logan Riggs

May 8, 2026

3:43

26

“AI is Breaking Two Vulnerability Cultures” by jefftk

May 8, 2026

3:51

27

“Is ProgramBench Impossible?” by frmsaul

May 8, 2026

5:06

28

“Bringing More Expertise to Bear on Alignment” by Edmund Lau, Geoffrey Irving, Cameron Holmes, David Africa

May 8, 2026

16:34

29

[Linkpost] “How to prevent AI’s 2008 moment (We’re hiring)” by felixgaston

May 8, 2026

4:49

30

“AI #167: The Prior Restraint Era Begins” by Zvi

May 8, 2026

86:45

31

“Mechanistic estimation for wide random MLPs” by Jacob_Hilton

May 7, 2026

9:14

32

“Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations” by Subhash Kantamneni, kitft, Euan Ong, Sam Marks

May 7, 2026

18:13

33

“Try, even if they have you cold” by WalterL

May 7, 2026

3:31

34

“A review of “Investigating the consequences of accidentally grading CoT during RL”” by Buck

May 7, 2026

15:27

35

“There is no evidence you should reapply sunscreen every 2 hours.” by Hide

May 7, 2026

16:07

36

“Many individual CEVs are probably quite bad” by Viliam

May 6, 2026

5:23

37

“x-risk-themed” by kave

May 6, 2026

6:10

38

“What is Anthropic?” by Zvi

May 6, 2026

17:30

39

“What if LLMs are mostly crystallized intelligence?” by deep

May 6, 2026

18:39

40

“Your rights when flying to Europe” by Yair Halberstadt

May 6, 2026

9:17

41

“Model Spec Midtraining: Improving How Alignment Training Generalizes” by Chloe Li, saraprice, Sam Marks, Jonathan Kutasov

May 6, 2026

13:47

42

“The AI Ad-Hoc Prior Restraint Era Begins” by Zvi

May 5, 2026

18:34

43

“Motivated reasoning, confirmation bias, and AI risk theory” by Seth Herd

May 5, 2026

78:41

44

“Are you looking up?” by Craig Green

May 5, 2026

15:57

45

[Linkpost] “Interpreting Language Model Parameters” by Lucius Bushnaq, Dan Braun, Oliver Clive-Griffin, Bart Bussmann, Nathan Hu, mivanitskiy, Linda Linsefors, Lee Sharkey

May 5, 2026

4:38

46

“Housing Roundup #15: The War Against Renters” by Zvi

May 5, 2026

25:50

47

“It’s nice of you to worry about me, but I really do have a life” by Viliam

May 4, 2026

6:59

48

“Irretrievability; or, Murphy’s Curse of Oneshotness upon ASI” by Eliezer Yudkowsky

May 4, 2026

37:38

49

“AI Industrial Takeoff — Part 1: Maximum growth rates with current technology” by djbinder

May 4, 2026

64:52

50

“Taking woo seriously but not literally” by Kaj_Sotala

May 4, 2026

38:43

51

“Dairy cows make their misery expensive (but their calves can’t)” by Elizabeth

May 3, 2026

12:56

52

“Measuring the ability of Opus 4.5 to fool narrow classifiers” by Fabien Roger, John Hughes

May 3, 2026

15:27

53

“A new rationalist self-improvement book: the 12 Levers” by spencerg

May 3, 2026

11:19

54

“OpenAI’s red line for AI self-improvement is fundamentally flawed” by Charbel-Raphaël

May 3, 2026

6:14

55

“You Are Not Immune To Mode Collapse” by J Bostock

May 2, 2026

8:25

56

“Primary Care Physicians are Incompetent. We Need More of Them.” by Hide

May 2, 2026

18:06

57

“How Go Players Disempower Themselves to AI” by Ashe Vazquez Nuñez

May 2, 2026

15:18

58

“How much should the ideal person cry wolf?” by KatjaGrace

May 1, 2026

3:01

59

“Conditional misalignment: Mitigations can hide EM behind contextual cues” by Jan Dubiński, Owain_Evans

May 1, 2026

24:13

60

“Risk from fitness-seeking AIs: mechanisms and mitigations” by Alex Mallen

May 1, 2026

63:19

61

“Sanity-checking “Incompressible Knowledge Probes”” by Sturb, LawrenceC

May 1, 2026

32:07

62

“AI unemployment and AI extinction are often the same” by KatjaGrace

May 1, 2026

3:32

63

“AI risk was not invested by AI CEOs to hype their companies” by KatjaGrace

May 1, 2026

6:49

64

“Cyborg evals” by Eye You, frmsaul

Apr 30, 2026

10:29

65

“To what extent is Qwen3-32B predicting its persona?” by Arjun Khandelwal, ryan_greenblatt, Alex Mallen

Apr 30, 2026

54:29

66

“Research Sabotage in ML Codebases” by egan

Apr 30, 2026

13:38

67

“Maybe I was too harsh on deep learning theory (three days ago)” by LawrenceC

Apr 30, 2026

5:01

68

“Notes on Transformer Consciousness” by slavachalnev

Apr 30, 2026

3:38

69

“On today’s panel with Bernie Sanders” by David Scott Krueger

Apr 30, 2026

4:11

70

“No Strong Orthogonality From Selection Pressure” by lumpenspace

Apr 30, 2026

20:00

71

“Learning zero, and what SLT gets wrong about it” by Dmitry Vaintrob

Apr 30, 2026

22:53

72

“The Most Important Charts In The World” by Zvi

Apr 30, 2026

10:27

73

“LLM Style Slop is Absolutely Everywhere” by silentbob

Apr 29, 2026

22:12

74

“Goblin Mode, 24 Hours Later” by Dylan Bowman

Apr 29, 2026

7:47

75

“Let Kids Keep More Productivity Gains” by jefftk

Apr 29, 2026

2:40

76

“llm assistant personas seem increasingly incoherent (some subjective observations)” by nostalgebraist

Apr 29, 2026

15:47

77

“Not a Paper: “Frontier Lab CEOs are Capable of In-Context Scheming”” by LawrenceC

Apr 29, 2026

14:51

78

“The Problem in the “Nerd Sniping” xkcd Comic” by peralice

Apr 29, 2026

19:15

79

“Recursive forecasting: Eliciting long-term forecasts from myopic fitness-seekers” by Jozdien, Alex Mallen

Apr 28, 2026

17:59

80

“Contra Binder on far-UVC and filtration” by jefftk

Apr 28, 2026

6:57

81

“Takes from two months as an aspiring LLM naturalist” by AnnaSalamon

Apr 28, 2026

15:59

82

“Forecasting is Not Overrated and It’s Probably Funded Appropriately” by Ben S.

Apr 28, 2026

9:17

83

“On the political feasibility of stopping AI” by David Scott Krueger

Apr 28, 2026

4:21

84

“Sleeper Agent Backdoor Results Are Messy” by Sebastian Prasanna, Alek Westover, Dylan Xu, Vivek Hebbar, Julian Stastny

Apr 28, 2026

18:21

85

“LessWrong Shows You Social Signals Before the Comment” by TurnTrout

Apr 28, 2026

8:31

86

“Fail safe(r) at alignment by channeling reward-hacking into a “spillway” motivation” by Anders Cairns Woodruff, Alex Mallen

Apr 27, 2026

31:30

87

“Curious cases of financial engineering in biotech” by Abhishaike Mahajan

Apr 27, 2026

44:16

88

“Update on the Alex Bores campaign” by Eric Neyman

Apr 27, 2026

6:44

89

“AI companies should publish security assessments” by ryan_greenblatt

Apr 27, 2026

5:42

90

“In defense of parents” by Yair Halberstadt

Apr 27, 2026

9:10

91

“The other paper that killed deep learning theory” by LawrenceC

Apr 27, 2026

11:29

92

“What holds AI safety together? Co-authorship networks from 200 papers” by Anna Thieser

Apr 27, 2026

5:33

93

″“Bad faith” means intentionally misrepresenting your beliefs” by TFD

Apr 27, 2026

11:20

94

“Retrospective on my unsupervised elicitation challenge” by DanielFilan

Apr 27, 2026

12:36

95

“Control protocols don’t always need to know which models are scheming” by Fabien Roger

Apr 26, 2026

10:35

96

“Anthropic spent too much don’t-be-annoying capital on Mythos” by draganover

Apr 26, 2026

10:02

97

“The paper that killed deep learning theory” by LawrenceC

Apr 26, 2026

11:25

98

“Forecasting is Way Overrated, and We Should Stop Funding It” by mabramov

Apr 25, 2026

8:43

99

″“Thinkhaven”” by Raemon

Apr 25, 2026

13:58

100

“Is the Cat Out of the Bag?: Who knows how to make AGI?” by Oliver Sourbut

Apr 25, 2026

9:39

101

“Against the “Permanent” Underclass” by Marcus Plutowski

Apr 25, 2026

31:06

102

“Quick Paper Review: “There Will Be a Scientific Theory of Deep Learning”” by LawrenceC

Apr 25, 2026

10:55

103

“Protecting Cognitive Integrity: Our internal AI use policy (V1)” by Tom DAVID

Apr 24, 2026

9:19

104

“Methodology for inferring propensities of LLMs” by Olli Järviniemi

Apr 24, 2026

10:27

105

“vLLM-Lens: Fast Interpretability Tooling That Scales to Trillion-Parameter Models” by Alan Cooney, Sid Black

Apr 24, 2026

9:58

106

“What Happens When a Model Thinks It Is AGI?” by josh :), David Africa

Apr 24, 2026

12:37

107

“Should We Train Against (CoT) Monitors?” by RohanS

Apr 23, 2026

62:55

108

“If Everyone Reads It, Nobody Dies - Course Launch” by Luc Brinkman, Chris-Lons

Apr 23, 2026

5:17

109

“Does your AI perform badly because you — you, specifically — are a bad person” by Natalie Cargill

Apr 23, 2026

14:19

110

“A “Lay” Introduction to “On the Complexity of Neural Computation in Superposition”” by LawrenceC

Apr 23, 2026

8:35

111

“An Angry Review of Greg Egan’s “Didicosm”” by LawrenceC

Apr 23, 2026

9:15

112

“Evil is bad, actually (Vassar and Olivia Schaefer)” by plex

Apr 23, 2026

15:33

113

“Your Supplies Probably Won’t Be Stolen in a Disaster” by jefftk

Apr 23, 2026

3:54

114

“Community misconduct disputes are not about facts” by mingyuan

Apr 23, 2026

3:25

115

“Why no new notations since 1960?” by Carl Feynman

Apr 23, 2026

1:48

116

“Narrow Secret Loyalty Dodges Black-Box Audits” by Alfie Lamerton, Fabien Roger

Apr 22, 2026

27:37

117

“10 posts I don’t have time to write” by habryka

Apr 22, 2026

9:11

118

“A taxonomy of barriers to trading with early misaligned AIs” by Alexa Pan

Apr 22, 2026

96:08

119

″$50 million a year for a 10% chance to ban ASI” by Andrea_Miotti, Alex Amadori, Gabriel Alfour

Apr 21, 2026

40:12

120

“Automated Deanonymization is Here” by jefftk

Apr 21, 2026

3:48

121

“Evil is bad, actually (Vassar and Olivia Schaefer callout post)” by plex

Apr 21, 2026

15:55

122

“10 non-boring ways I’ve used AI in the last month” by habryka

Apr 21, 2026

13:37

123

“Introducing LinuxArena” by Tyler Tracy, Ram Potham, Nick Kuhn, Myles H

Apr 21, 2026

9:30

124

“The “Budgeting” Skill Has The Most Betweenness Centrality (Probably)” by JenniferRM

Apr 20, 2026

19:47

125

“Finetuning Borges” by Linch

Apr 20, 2026

6:44

126

“9 kinds of hard-to-verify tasks” by Cleo Nardo

Apr 20, 2026

6:05

127

“How do LLMs generalize when we do training that is intuitively compatible with two off-distribution behaviors?” by dx26, Alek Westover, Vivek Hebbar, Sebastian Prasanna, Buck, Julian Stastny

Apr 20, 2026

37:38

128

“Automating philosophy if Timothy Williamson is correct” by Cleo Nardo

Apr 20, 2026

4:52

129

“CLR’s Safe Pareto Improvements Research Agenda” by Anthony DiGiovanni

Apr 20, 2026

46:20

130

“LLMs are about to disrupt algorithmic media feeds” by lsusr

Apr 20, 2026

3:59

131

“Resources for starting and growing an AI safety org” by Bryce Robertson, Søren Elverlin, Melissa Samworth, jakkdl

Apr 20, 2026

2:15

132

“Quality Matters Most When Stakes are Highest” by LawrenceC

Apr 20, 2026

5:44

133

“Feel like a room has bad vibes? The lighting is probably too “spiky” or too blue” by habryka

Apr 20, 2026

6:45

134

“I did a jhana meditation retreat (in 2024) with Jhourney and it was okay.” by Jules

Apr 20, 2026

14:16

135

“R1 CoT illegibility revisited” by nostalgebraist

Apr 19, 2026

11:45

136

“Reevaluating AGI Ruin in 2026” by lc

Apr 19, 2026

49:52

137

“If It’s Worth Arguing, It’s Worth Arguing With Whiteboards” by Drake Morrison

Apr 19, 2026

4:04

138

“There are only four skills: design, technical, management and physical” by habryka

Apr 19, 2026

10:07

139

“Having OCD is like living in North Korea (Here’s how I escaped)” by Declan Molony

Apr 18, 2026

59:00

140

“Claude knows who you are” by Smaug123

Apr 18, 2026

4:22

141

“Vladimir Putin’s CEV is probably pretty good” by habryka

Apr 18, 2026

8:58

142

“Post-mortem’ing my earliest ML research paper, 7 years later” by LawrenceC

Apr 18, 2026

12:33

143

“If You’ve Never Bought a Tool You Didn’t Need, You’re Not Buying Enough Tools” by Drake Morrison

Apr 18, 2026

4:39

144

“3” by AnnaJo

Apr 18, 2026

8:55

145

“Consent-Based RL: Letting Models Endorse Their Own Training Updates” by Logan Riggs

Apr 17, 2026

6:05

146

“Prompted CoT Early Exit Undermines the Monitoring Benefits of CoT Uncontrollability” by Elle Najt, Asa Cooper Stickland, Xander Davies

Apr 17, 2026

46:18

147

“Let goodness conquer all that it can defend” by habryka

Apr 17, 2026

11:12

148

“Specialization is a Driver of Natural Ontology” by johnswentworth

Apr 17, 2026

5:15

149

[Linkpost] “You can only build safe ASI if ASI is globally banned” by Connor Leahy

Apr 17, 2026

3:27

150

“Beware of Well-Written Posts” by alseph

Apr 17, 2026

7:19

151

“You Aren’t in Charge of the Overton Window; Politics Is Not Interior Design” by Davidmanheim

Apr 16, 2026

22:06

152

“Carpathia Day” by Drake Morrison

Apr 16, 2026

3:53

153

“Do not conquer what you cannot defend” by habryka

Apr 16, 2026

10:30

154

“What is the Iliad Intensive?” by Leon Lang, Alexander Gietelink Oldenziel, David Udell

Apr 16, 2026

4:55

155

“The Mirror Test Is Complicated” by J Bostock

Apr 15, 2026

8:15

156

“Contra Leicht on AI Pauses” by David Scott Krueger (formerly: capybaralet)

Apr 15, 2026

10:09

157

“Nectome: All That I Know” by Raelifin

Apr 15, 2026

80:03

158

“Effective Altruism, Seen From Slytherin” by Xylix

Apr 15, 2026

7:48

159

“Majority Report” by peralice

Apr 15, 2026

19:25

160

“Current AIs seem pretty misaligned to me” by ryan_greenblatt

Apr 15, 2026

65:04

161

“Contra Byrnes on UV & Cancer” by HedonicEscalator

Apr 15, 2026

13:20

162

“Everyone Has a Plan Until They Get Social Pressure To the Face” by Czynski

Apr 15, 2026

10:48

163

“Mechanisms of Introspective Awareness” by Uzay Macar

Apr 14, 2026

32:49

164

“Load-Bearing Sincerity: On the Motive Reinforcement Thesis” by Fiora Starlight

Apr 14, 2026

30:39

165

“Diary of a “Doomer”: 12+ years arguing about AI risk (part 1)” by David Scott Krueger (formerly: capybaralet)

Apr 14, 2026

18:29

166

“A Retrospective of Richard Ngo’s 2022 List of Conceptual Alignment Projects” by LawrenceC

Apr 14, 2026

17:53

167

“From personas to intentions: towards a science of motivations for AI models” by David Africa, Jacob Pfau

Apr 14, 2026

14:54

168

“The Shapley Share of Responsibility?” by Raemon

Apr 14, 2026

6:07

169

“Who Killed Common Law?” by Benquo

Apr 14, 2026

8:05

170

“Anthropic repeatedly accidentally trained against the CoT, demonstrating inadequate processes” by Alex Mallen, ryan_greenblatt

Apr 14, 2026

11:27

171

“Meaningful Questions Have Return Types” by Drake Morrison

Apr 14, 2026

5:24

172

“Only Law Can Prevent Extinction” by Eliezer Yudkowsky

Apr 13, 2026

38:31

173

“AI Safety’s Biggest Talent Gap Isn’t Researchers. It’s Generalists.” by Topaz, agucova, Alexandra Bates, Parv Mahajan

Apr 13, 2026

13:51

174

“Tomas Bjartur: The Last Prodigy” by Linch

Apr 13, 2026

17:12

175

“Annoyingly Principled People, and what befalls them” by Raemon

Apr 13, 2026

7:48

176

“TAPs or it didn’t happen” by Raemon

Apr 13, 2026

7:20

177

“Returns to intelligence” by RobertM

Apr 13, 2026

4:08

178

“Daycare illnesses” by Nina Panickssery

Apr 13, 2026

10:10

179

“The policy surrounding Mythos marks an irreversible power shift” by sil

Apr 13, 2026

3:40

180

“Talk English, Think Something Else” by J Bostock

Apr 13, 2026

4:59

181

“Sparse Autoencoders for Single-Cell Models” by Ihor Kendiukhov

Apr 13, 2026

3:55

182

“Eggs, rooms, puzzles, and talking about AI” by KatjaGrace

Apr 13, 2026

7:49

183

“Morale” by J Bostock

Apr 12, 2026

4:43

184

“Your Mom is a Chimera” by michaelwaves

Apr 12, 2026

5:39

185

“The Blast Radius Principle” by Martin Sustrik

Apr 12, 2026

18:09

186

“How to make good tea” by RobertM

Apr 12, 2026

0:00

187

“Catching illicit distributed training operations during an AI pause” by Robi Rahman

Apr 12, 2026

7:18

188

[Linkpost] “Scott Alexander gentrified my meetup” by dominicq

Apr 11, 2026

2:36

189

“Pausing AI Is the Best Answer to Post-Alignment Problems” by MichaelDickens

Apr 11, 2026

5:21

190

“Some thoughts on Nectome’s risk and resilience” by Aurelia

Apr 11, 2026

21:41

191

“Chocolate Sloths, Tinder, and Moral Backstops” by J Bostock

Apr 11, 2026

7:22

192

“Dario probably doesn’t believe in superintelligence” by RobertM

Apr 11, 2026

12:33

193

“The Unintelligibility is Ours: Notes on Chain-of-Thought” by 1a3orn

Apr 11, 2026

12:43

194

“If Mythos actually made Anthropic employees 4x more productive, I would radically shorten my timelines” by ryan_greenblatt

Apr 11, 2026

13:05

195

“Why Control Creates Conflict, and When to Open Instead” by plex

Apr 10, 2026

5:30

196

“Reproducing steering against evaluation awareness in a large open-weight model” by Thomas Read, Bronson Schoen, Joseph Bloom

Apr 10, 2026

35:17

197

“Have we already lost? Part 2: Reasons for Doom” by LawrenceC

Apr 10, 2026

6:31

198

“Model organisms researchers should check whether high LRs defeat their model organisms” by dx26, Sebastian Prasanna, Alek Westover, Vivek Hebbar, Julian Stastny

Apr 10, 2026

12:45

199

“Anthropic did not publish a “risk discussion” of Mythos when required by their RSP” by RobertM

Apr 10, 2026

7:12

200

“Some takes on UV & cancer” by Steven Byrnes

Apr 10, 2026

12:09

201

“Help me launch Obsolete: a book aimed at building a new movement for AI reform” by garrison

Apr 9, 2026

12:08

202

“Slightly-Super Persuasion Will Do” by Tomás B.

Apr 9, 2026

7:19

203

“Have we already lost? Part 1: The Plan in 2024” by LawrenceC

Apr 9, 2026

5:30

204

“Do not be surprised if LessWrong gets hacked” by RobertM

Apr 9, 2026

7:37

205

“One Week in the Rat Farm” by Philip Harker

Apr 9, 2026

14:22

206

“101 Humans of New York on the Risks of AI” by Corm

Apr 9, 2026

12:25

207

“Baking tips” by RobertM

Apr 8, 2026

7:00

208

“An easy coordination problem?” by KatjaGrace

Apr 8, 2026

1:57

209

“Excerpts and Notes on Mythos Model Card” by williawa

Apr 8, 2026

28:33

210

“The effects of caffeine consumption do not decay with a ~5 hour half-life” by kman

Apr 8, 2026

10:24

211

“You don’t know what you are made of till you’ve been stalked across three countries” by Shoshannah Tekofsky

Apr 8, 2026

11:51

212

“Why is Flesh So Weak?” by J Bostock

Apr 8, 2026

5:19

213

“The hard part isn’t noticing when papers are bad, it’s deciding what to do afterwards” by LawrenceC

Apr 8, 2026

3:56

214

“We can prevent progress! Conceptual clarity, and inspiration from the FDA” by KatjaGrace

Apr 8, 2026

5:12

215

“AI as a Trojan horse race” by KatjaGrace

Apr 8, 2026

3:14

216

“My unsupervised elicitation challenge” by DanielFilan

Apr 8, 2026

4:40

217

“Role-playing vs Self-modelling” by Jan_Kulveit

Apr 8, 2026

6:57

218

“Elementary Condensation” by Jan

Apr 8, 2026

19:55

219

“Hedging and Survival-Weighted Planning” by Vaniver

Apr 8, 2026

5:05

220

“Opus’s Schelling Steganography Has Amplifiable Secrecy Against Weaker Eavesdroppers” by Elle Najt

Apr 8, 2026

95:55

221

“An Alignment Journal: Features and policies” by JessRiedel, Dan MacKinlay, Luca, Daniel Murfet, david reinstein

Apr 8, 2026

49:48

222

“Fantasy ideology” by Ninety-Three

Apr 7, 2026

18:56

223

[Linkpost] “Questions raised about OpenAI leaders’ trustworthiness by the New Yorker” by Remmelt

Apr 7, 2026

2:43

224

“Claude Mythos System Card Preview” by anaguma

Apr 7, 2026

6:09

225

“My picture of the present in AI” by ryan_greenblatt

Apr 7, 2026

21:05

226

[Linkpost] ”[Paper] Stringological sequence prediction I” by Vanessa Kosoy

Apr 7, 2026

4:16

227

“We’re actually running out of benchmarks to upper bound AI capabilities” by LawrenceC

Apr 7, 2026

8:01

228

“Don’t write for LLMs, just record everything” by RobertM

Apr 7, 2026

10:22

229

“Contra Nina Panickssery on advice for children” by Sean Herrington

Apr 7, 2026

6:08

230

“By Strong Default, ASI Will End Liberal Democracy” by MichaelDickens

Apr 7, 2026

6:10

231

“AIs can now often do massive easy-to-verify SWE tasks and I’ve updated towards shorter timelines” by ryan_greenblatt

Apr 6, 2026

29:31

232

“Paper close reading: “Why Language Models Hallucinate”” by LawrenceC

Apr 6, 2026

17:33

233

“Ten different ways of thinking about Gradual Disempowerment” by David Scott Krueger (formerly: capybaralet)

Apr 5, 2026

10:11

234

“11 pieces of advice for children” by Nina Panickssery

Apr 5, 2026

5:55

235

“Steering Might Stop Working Soon” by J Bostock

Apr 5, 2026

7:31

236

“Am I the baddie?” by Ustice

Apr 5, 2026

4:41

237

“Academic Proof-of-Work in the Age of LLMs” by LawrenceC

Apr 5, 2026

4:46

238

“Positive sum does not mean “win-win”” by loops

Apr 5, 2026

4:27

239

“Considerations for growing the pie” by Zach Stein-Perlman

Apr 5, 2026

5:21

240

″“Following the incentives”” by David Scott Krueger (formerly: capybaralet)

Apr 4, 2026

5:03

241

“Chicken-Free Egg Whites” by jefftk

Apr 4, 2026

2:31

242

“dark ilan” by ozymandias

Apr 4, 2026

19:35

243

“Mean field sequence: an introduction” by Dmitry Vaintrob, Lauren Greenspan

Apr 4, 2026

23:36

244

“Democracy Dies With The Rifleman” by Vaniver

Apr 4, 2026

4:41

245

“The bar is lower than you think” by XelaP

Apr 4, 2026

4:16

246

“Did Anyone Predict the Industrial Revolution?” by Lost Futures

Apr 4, 2026

9:14

247

“Why do I believe preserving structure is enough?” by Aurelia

Apr 4, 2026

11:32

248

“There should be $100M grants to automate AI safety” by Marius Hobbhahn

Apr 3, 2026

16:37

249

“Sadly, The Whispering Earring” by Dentosal

Apr 3, 2026

4:26

250

“Common research advice #2: say precisely what you want to say” by LawrenceC

Apr 3, 2026

5:43

251

“2026: The year of throwing my agency at my health (now with added cyborgism)” by Ruby

Apr 3, 2026

7:03

252

[Linkpost] “Q1 2026 Timelines Update” by Daniel Kokotajlo, elifland, bhalstead

Apr 3, 2026

6:09

253

“How social ideas get corrupt” by Kaj_Sotala

Apr 2, 2026

13:33

254

“The Indestructible Future” by WillPetillo

Apr 2, 2026

8:02

255

“My most common advice for junior researchers” by LawrenceC

Apr 2, 2026

5:16

256

“The Practical Guide to Superbabies” by GeneSmith

Apr 2, 2026

58:09

257

“The Corner-Stone” by Benquo

Apr 2, 2026

32:05

258

“Systematically dismantle the AI compute supply chain.” by David Scott Krueger (formerly: capybaralet)

Apr 2, 2026

8:16

259

“The quest for general intelligence is hitting a wall” by Sean Herrington

Apr 2, 2026

3:40

260

“Intelligence Dissolves Privacy” by Vaniver

Apr 2, 2026

10:45

261

“Anthropic’s Pause is the Most Expensive Alarm in Corporate History” by Ruby

Apr 2, 2026

25:06

262

“I’m Suing Anthropic for Unauthorized Use of My Personality” by Linch

Apr 2, 2026

17:58

263

“Orders of magnitude: use semitones, not decibels” by Oliver Sourbut

Apr 2, 2026

9:27

264

“Dying with Whimsy” by NickyP

Apr 2, 2026

6:12

265

“AI for AI for Epistemics” by owencb, Lukas Finnveden

Apr 1, 2026

17:50

266

“Announcing Doublehaven with Reflections on Humour” by J Bostock

Apr 1, 2026

9:22

267

“Save the Sun Shrimp!” by Jack

Apr 1, 2026

7:52

268

“LIMBO: Who We Are, What We Do, and an Exciting High-Impact Funding Opportunity” by faul_sname

Apr 1, 2026

23:04

269

“Chat, is this sus?” by Tyler Tracy

Apr 1, 2026

3:07

270

″“You Have Not Been a Good User” (LessWrong’s second album)” by habryka

Apr 1, 2026

1:50

271

“Lesswrong Liberated” by Ronny Fernandez

Apr 1, 2026

3:21

272

“The Claude Code Source Leak” by Error

Apr 1, 2026

2:18

273

“Experiments With Opus 4.6’s Fiction” by Tomás B.

Mar 31, 2026

23:46

274

“Product Alignment is not Superintelligence Alignment (and we need the latter to survive)” by plex

Mar 31, 2026

4:21

275

“Co-Found Lens Academy With Me. (We have early users and funding)” by Luc Brinkman

Mar 31, 2026

11:25

276

“Slack in Cells, Slack in Brains” by Mateusz Bagiński

Mar 31, 2026

10:55

277

“I am definitely missing the pre-AI writing era” by N. Cailie

Mar 31, 2026

4:06

278

“The state of AI safety in four fake graphs” by Boaz Barak

Mar 30, 2026

4:29

279

“AI should be a good citizen, not just a good assistant” by Tom Davidson, wdmacaskill

Mar 30, 2026

33:45

280

″(Some) Natural Emergent Misalignment from Reward Hacking in Non-Production RL” by 7vik, Sid Black, Joseph Bloom

Mar 30, 2026

46:07

281

[Linkpost] “Parkinson’s Law of Worry” by Jakub Halmeš

Mar 29, 2026

2:54

282

“Folie à Machine: LLMs and Epistemic Capture” by DaystarEld

Mar 29, 2026

36:43

283

“Stop asking “how good is this” to decide between donation opportunities I recommend” by Zach Stein-Perlman

Mar 29, 2026

2:50

284

“Nick Bostrom: How big is the cosmic endowment?” by Zach Stein-Perlman

Mar 28, 2026

5:48

285

“Don’t Overdose Locally Beneficial Changes” by Mateusz Bagiński

Mar 28, 2026

6:35

286

“Stanley Milgram wasn’t pessimistic enough about human nature?” by David Gross

Mar 28, 2026

5:42

287

[Linkpost] “What if superintelligence is just weak?” by Simon Lermen

Mar 28, 2026

4:55

288

“Pray for Casanova” by Tomás B.

Mar 28, 2026

9:57

289

“ControlAI 2025 Impact Report” by Andrea_Miotti, Alex Amadori

Mar 27, 2026

8:35

290

“AI’s capability improvements haven’t come from it getting less affordable” by Anders Woodruff

Mar 27, 2026

17:21

291

“Scaffolded Reproducers, Scaffolded Agents” by Mateusz Bagiński

Mar 27, 2026

5:57

292

“My hobby: running deranged surveys” by leogao

Mar 27, 2026

16:56

293

“The Terrarium” by Caleb Biddulph

Mar 26, 2026

51:05

294

“Sen. Sanders (I-VT) and Rep. Ocasio-Cortez (D-NY) propose AI Data Center Moratorium Act” by Matrice Jacobine

Mar 26, 2026

1:25

295

“Test your best methods on our hard CoT interp tasks” by daria, Riya Tyagi, Josh Engels, Neel Nanda

Mar 26, 2026

42:02

296

″“What Exactly Would An International AI Treaty Say?” Is a Bad Objection” by Davidmanheim

Mar 26, 2026

11:43

All Episodes