AI Safety Newsletter cover art

All Episodes

AI Safety Newsletter — 78 episodes

#
Title
1

AISN #72: New Research on AI Wellbeing

2

AISN #71: Cyberattacks & Datacenter Moratorium Bill

3

AISN #70: AI Layoffs and Automated Warfare

4

AISN #69: Department of War, Anthropic, and National Security

5

AISN #68: Moltbook Exposes Risky AI Behavior

6

AISN #67: Trump’s preemption order, H200s go to China, and new frontier AI from OpenAI and DeepSeek

7

AISN #66: AISN #66: Evaluating Frontier Models, New Gemini and Claude, Preemption is Back

8

AISN #65: Measuring Automation and Superintelligence Moratorium Letter

9

AISN #63: New AGI Definition and Senate Bill Would Establish Liability for AI Harms

10

AISN #63: California’s SB-53 Passes the Legislature

11

AISN #62: Big Tech Launches $100 Million pro-AI Super PAC

12

AISN #61: OpenAI Releases GPT-5

13

AISN #60: The AI Action Plan

14

AISN #59: EU Publishes General-Purpose AI Code of Practice

15

AISN #58: Senate Removes State AI Regulation Moratorium

16

AISN #57: The RAISE Act

17

AISN #56: Google Releases Veo 3

18

AISN #55: Trump Administration Rescinds AI Diffusion Rule, Allows Chip Sales to Gulf States

19

AISN #54: OpenAI Updates Restructure Plan

20

AISN #53: An Open Letter Attempts to Block OpenAI Restructuring

21

AISN #52: An Expert Virology Benchmark

22

AISN #51: AI Frontiers

23

AISN #50: AI Action Plan Responses

24

AISN #49: AI Action Plan Responses

25

AISN

26

Superintelligence Strategy: Expert Version

27

Superintelligence Strategy: Standard Version

28

AISN #48: Utility Engineering and EnigmaEval

29

AISN #47: Reasoning Models

30

AISN #46: The Transition

31

AISN #45: Center for AI Safety 2024 Year in Review

32

AISN #44: The Trump Circle on AI Safety

33

AISN #43: White House Issues First National Security Memo on AI

34

AISN #42: Newsom Vetoes SB 1047

35

AISN #41: The Next Generation of Compute Scale

36

AISN #40: California AI Legislation

37

AISN #39: Implications of a Trump Administration for AI Policy

38

AISN #38: Supreme Court Decision Could Limit Federal Ability to Regulate AI

39

AISN #37: US Launches Antitrust Investigations

40

AISN #36: Voluntary Commitments are Insufficient

41

AISN #35: Lobbying on AI Regulation

42

AISN #34: New Military AI Systems

43

AISN #33: Reassessing AI and Biorisk

44

AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs

45

AISN #31: A New AI Policy Bill in California

46

AISN #30: Investments in Compute and Military AI

47

AISN #29: Progress on the EU AI Act

48

The Landscape of US AI Legislation

49

AISN #28: Center for AI Safety 2023 Year in Review

50

AISN #27: Defensive Accelerationism

51

AISN #26: National Institutions for AI Safety

52

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks.

53

AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI.

54

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering.

55

AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy.

56

AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities.

57

[Paper] “An Overview of Catastrophic AI Risks” by Dan Hendrycks, Mantas Mazeika and Thomas Woodside

58

[Paper] “X-Risk Analysis for AI Research” by Dan Hendrycks and Mantas Mazeika

59

[Paper] “Unsolved Problems in ML Safety” by Dan Hendrycks, Nicholas Carlini, John Schulman and Jacob Steinhardt

60

AISN #19: US-China Competition on AI Chips, Measuring Language Agent Developments, Economic Analysis of Language Model Propaganda, and White House AI Cyber Challenge.

61

AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety.

62

AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight.

63

AISN #16: White House Secures Voluntary Commitments from Leading AI Labs, and Lessons from Oppenheimer .

64

AISN #15: China and the US take action to regulate AI, results from a tournament forecasting AI risk, updates on xAI’s plan, and Meta releases its open-source and commercially available Llama 2.

65

AISN #14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use .

66

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave.

67

AISN #12: Policy Proposals from NTIA’s Request for Comment, and Reconsidering Instrumental Convergence.

68

AISN #11: An Overview of Catastrophic AI Risks.

69

AISN #10: How AI could enable bioterrorism, and policymakers continue to focus on AI .

70

AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level? .

71

AISN #8: Why AI could go rogue, how to screen for AI risks, and grants for research on democratic governance of AI.

72

AISN #7: Disinformation, recommendations for AI labs, and Senate hearings on AI.

73

AISN #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control .

74

AISN #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models.

75

AISN #4: AI and cybersecurity, persuasive AIs, weaponization, and Hinton talks AI risks.

76

AISN #3: AI policy proposals and a new challenger approaches.

77

AISN #2: ChaosGPT and the rise of language model agents, evolutionary pressures and AI, AI safety in the media.

78

AISN #1: Public opinion on AI, plugging ChatGPT into the internet, and the economic impacts of language models..