EPISODE · May 13, 2026 · 1 MIN
[Linkpost] “Apollo Update May 2026” by Marius Hobbhahn
This is a link post. We now have an SF office. We're hiring for all technical roles in SF and London!The Scheming Research team focuses on two efforts We're focusing on figuring out the science of scheming. In particular, Will future models have misaligned preferences by default?Will training against misaligned preferences fail?improve our evaluations for scheming and loss of control for our evaluation campaigns with frontier AI labsWe're building out a monitoring team and coding agent monitoring product Research: We've published a scalable monitoring agenda and intend to publish a lot of research on how to build more accurate and reliable monitorsProduct: Watcher provides real-time monitors and other guardrails for coding agents and allows users to keep track of what all of their agents are doing. Our AI governance efforts will focus on the governance of automated AI R&D and recursively improving AI and the associated Loss of Control risks. Details: https://www.apolloresearch.ai/blog/apollo-update-may-2026/ --- First published: May 13th, 2026 Source: https://www.lesswrong.com/posts/4acQRDNyPs7tD8EED/apollo-update-may-2026 Linkpost URL:https://www.apolloresearch.ai/blog/apollo-update-may-2026/ --- Narrated by TYPE III AUDIO.
NOW PLAYING
[Linkpost] “Apollo Update May 2026” by Marius Hobbhahn
No transcript for this episode yet
Similar Episodes
Dec 20, 2021 ·0m