All Episodes
AI Safety Newsletter — 78 episodes
AISN #72: New Research on AI Wellbeing
AISN #71: Cyberattacks & Datacenter Moratorium Bill
AISN #70: AI Layoffs and Automated Warfare
AISN #69: Department of War, Anthropic, and National Security
AISN #68: Moltbook Exposes Risky AI Behavior
AISN #67: Trump’s preemption order, H200s go to China, and new frontier AI from OpenAI and DeepSeek
AISN #66: AISN #66: Evaluating Frontier Models, New Gemini and Claude, Preemption is Back
AISN #65: Measuring Automation and Superintelligence Moratorium Letter
AISN #63: New AGI Definition and Senate Bill Would Establish Liability for AI Harms
AISN #63: California’s SB-53 Passes the Legislature
AISN #62: Big Tech Launches $100 Million pro-AI Super PAC
AISN #61: OpenAI Releases GPT-5
AISN #60: The AI Action Plan
AISN #59: EU Publishes General-Purpose AI Code of Practice
AISN #58: Senate Removes State AI Regulation Moratorium
AISN #57: The RAISE Act
AISN #56: Google Releases Veo 3
AISN #55: Trump Administration Rescinds AI Diffusion Rule, Allows Chip Sales to Gulf States
AISN #54: OpenAI Updates Restructure Plan
AISN #53: An Open Letter Attempts to Block OpenAI Restructuring
AISN #52: An Expert Virology Benchmark
AISN #51: AI Frontiers
AISN #50: AI Action Plan Responses
AISN #49: AI Action Plan Responses
AISN
Superintelligence Strategy: Expert Version
Superintelligence Strategy: Standard Version
AISN #48: Utility Engineering and EnigmaEval
AISN #47: Reasoning Models
AISN #46: The Transition
AISN #45: Center for AI Safety 2024 Year in Review
AISN #44: The Trump Circle on AI Safety
AISN #43: White House Issues First National Security Memo on AI
AISN #42: Newsom Vetoes SB 1047
AISN #41: The Next Generation of Compute Scale
AISN #40: California AI Legislation
AISN #39: Implications of a Trump Administration for AI Policy
AISN #38: Supreme Court Decision Could Limit Federal Ability to Regulate AI
AISN #37: US Launches Antitrust Investigations
AISN #36: Voluntary Commitments are Insufficient
AISN #35: Lobbying on AI Regulation
AISN #34: New Military AI Systems
AISN #33: Reassessing AI and Biorisk
AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs
AISN #31: A New AI Policy Bill in California
AISN #30: Investments in Compute and Military AI
AISN #29: Progress on the EU AI Act
The Landscape of US AI Legislation
AISN #28: Center for AI Safety 2023 Year in Review
AISN #27: Defensive Accelerationism
AISN #26: National Institutions for AI Safety
AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks.
AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI.
AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering.
AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy.
AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities.
[Paper] “An Overview of Catastrophic AI Risks” by Dan Hendrycks, Mantas Mazeika and Thomas Woodside
[Paper] “X-Risk Analysis for AI Research” by Dan Hendrycks and Mantas Mazeika
[Paper] “Unsolved Problems in ML Safety” by Dan Hendrycks, Nicholas Carlini, John Schulman and Jacob Steinhardt
AISN #19: US-China Competition on AI Chips, Measuring Language Agent Developments, Economic Analysis of Language Model Propaganda, and White House AI Cyber Challenge.
AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety.
AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight.
AISN #16: White House Secures Voluntary Commitments from Leading AI Labs, and Lessons from Oppenheimer .
AISN #15: China and the US take action to regulate AI, results from a tournament forecasting AI risk, updates on xAI’s plan, and Meta releases its open-source and commercially available Llama 2.
AISN #14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use .
AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave.
AISN #12: Policy Proposals from NTIA’s Request for Comment, and Reconsidering Instrumental Convergence.
AISN #11: An Overview of Catastrophic AI Risks.
AISN #10: How AI could enable bioterrorism, and policymakers continue to focus on AI .
AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level? .
AISN #8: Why AI could go rogue, how to screen for AI risks, and grants for research on democratic governance of AI.
AISN #7: Disinformation, recommendations for AI labs, and Senate hearings on AI.
AISN #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control .
AISN #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models.
AISN #4: AI and cybersecurity, persuasive AIs, weaponization, and Hinton talks AI risks.
AISN #3: AI policy proposals and a new challenger approaches.
AISN #2: ChaosGPT and the rise of language model agents, evolutionary pressures and AI, AI safety in the media.
AISN #1: Public opinion on AI, plugging ChatGPT into the internet, and the economic impacts of language models..