EPISODE · May 29, 2026 · 46 MIN
“Claude Opus 4.8: The System Card” by Zvi
Only six weeks after Opus 4.7, we have Opus 4.8. For everyone, that means another incremental upgrade to Claude. It is once again smarter, and can do tasks for longer, and comes with a number of hot new features. For me, that also means reading another 244 page system card. It was only April 20 when I did a full review of the Opus 4.7 system card, plus an additional post focusing on related issues of model welfare. These updates are incremental and coming more rapidly, and this still is below the capability level of Claude Mythos, so the focus will be on the delta. What is different about Opus 4.8 versus what we already know about Opus 4.7 and Mythos? It turns out there's still a lot to talk about. Image created as self-portrait for this post by Claude Opus 4.8 Table of Contents Here We Go Again: Executive Summary. Introduction (1). RSP Evaluations (2). Move That Goalpost. The Failures Are News. Alignment Risk Slowly Rises. New Risk Pathways Just Dropped. Cyber (3). Harmful Requests (4.1). We Need To Talk (4.2 [...] ---Outline:(01:16) Here We Go Again: Executive Summary(02:33) Introduction (1)(02:42) RSP Evaluations (2)(03:47) Move That Goalpost(05:41) The Failures Are News(07:33) Alignment Risk Slowly Rises(09:00) New Risk Pathways Just Dropped(11:26) Cyber (3)(12:22) Harmful Requests (4.1)(14:23) We Need To Talk (4.2 and 4.3)(17:36) Overcoming Bias (4.4)(19:33) Agentic Safety (5)(21:40) Prompt Injection (5.2)(25:18) Alignment (6)(26:33) Looking For Problems(27:55) Who Watches The Training (6.2.2)(32:07) Automated Behavioral Audit(32:47) The Model Is Smarter Than The Eval (6.2.3.2)(34:39) You Should See The Other Guy(36:30) UK AISI Testing (6.2.4)(36:50) In Vendbench (6.2.5)(39:27) Honesty (6.3.3 to 6.3.6)(41:35) Chain of Thought (CoT) Monitorability (6.5)(44:09) What's In The Box? (6.6)(45:57) That's All For Now --- First published: May 29th, 2026 Source: https://www.lesswrong.com/posts/Gx6cJ6cG9JfeSNcLB/claude-opus-4-8-the-system-card --- Narrated by TYPE III AUDIO. ---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
NOW PLAYING
“Claude Opus 4.8: The System Card” by Zvi
No transcript for this episode yet
Similar Episodes
Dec 20, 2021 ·0m