PodParley PodParley
Sobering Up on AI Progress w/ Dr. Sean McGregor

EPISODE · Dec 29, 2025 · 1H 13M

Sobering Up on AI Progress w/ Dr. Sean McGregor

from Into AI Safety · host Jacob Haimes

Sean McGregor and I discuss about why evaluating AI systems has become so difficult; we cover everything from the breakdown of benchmarking, how incentives shape safety work, and what approaches like BenchRisk (his recent paper at NeurIPS) and AI auditing aim to fix as systems move into the real world. We also talk about his history and journey in AI safety, including his PhD on ML for public policy, how he started the AI Incident Database, and what he's working on now: AVERI, a non-profit for frontier model auditing.Chapters(00:00) - Intro (02:36) - What's broken about benchmarking (03:41) - Sean’s wild PhD (14:28) - The phantom internship (19:25) - Sean's journey (22:25) - Market-vs-regulatory modes and AIID (32:13) - Drunk on AI progress (38:34) - BenchRisk (43:20) - Moral hazards and Master Hand (50:34) - Liability, Section 230, and open source (59:20) - AVERI (01:11:30) - Closing thoughts & outro LinksSean McGregor's websiteAVERI websiteBenchRiskBenchRisk websiteNeurIPS paper - Risk Management for Mitigating Benchmark Failure Modes: BenchRiskNeurIPS paper - AI and the Everything in the Whole Wide World BenchmarkAIIDAI Incident Database websiteIAAI paper - Preventing Repeated Real World AI Failures by Cataloging Incidents: The AI Incident DatabasePreprint - Lessons for Editors of AI Incidents from the AI Incident DatabaseAIAAIC website (another incident tracker)Hot AI SummerCACM article - A Few Useful Things to Know About Machine LearningCACM article - How the AI Boom Went BustUndergraduate Thesis - Analyzing the Prospect of an Approaching AI WinterTech Genies article - AI History: The First Summer and Winter of AICACM article - There Was No ‘First AI Winter’Measuring GeneralizationNeural Computation article - The Lack of A Priori Distinctions Between Learning AlgorithmsICLR paper - Understanding deep learning requires rethinking generalizationICML paper - Model-agnostic Measure of Generalization DifficultyRadiology Artificial Intelligence article - Generalizability of Machine Learning Models: Quantitative Evaluation of Three Methodological PitfallsPreprint - Quantifying Generalization Complexity for Large Language ModelsInsurers Exclude AIFinancial Times article - Insurers retreat from AI cover as risk of multibillion-dollar claims mountTom's Hardware article - Major insurers move to avoid liability for AI lawsuits as multi-billion dollar risks emerge — Recent public incidents have lead to costly repercussionsInsurance Newsnet article - Insurers Scale Back AI Coverage Amid Fears of Billion-Dollar ClaimsInsurance Business article - Insurance’s gen AI reckoning has comeSection 230Section 230 overviewLegal sidebar - Section 230 Immunity and Generative Artificial IntelligenceBad Internet Bills websiteTechDirt article - Section 230 Faces Repeal. Support The Coverage That’s Been Getting It Right All Along.Privacy Guides video - Dissecting Bad Internet Bills with Taylor Lorenz: KOSA, SCREEN Act, Section 230Journal of Technology in Behavioral Health article - Social Media and Mental Health: Benefits, Risks, and Opportunities for Research and PracticeTime article - Lawmakers Unveil New Bills to Curb Big Tech’s Power and ProfitHouse Hearing transcript - Legislative Solutions to Protect Children and Teens OnlineRelevant Kairos.fm EpisodesInto AI Safety episode - Growing BlueDot's Impact w/ Li-Lian AngmuckrAIkers episode - NeurIPS 2024 Wrapped 🌯Other LinksEncyclopedia of Life websiteIBM Watson AI XPRIZE websiteML Commons websiteWikipedia article

NOW PLAYING

Sobering Up on AI Progress w/ Dr. Sean McGregor

0:00 1:13:41

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

AI – IC之音竹科廣播 FM97.5 IC之音竹科廣播 全球華人的心靈故鄉 Copy That Converts - Entrepreneurs, Copywriting, Launch, Email Marketing, Conversion Megan Wisdom | Copywriter, Email Metrics Mentor, Marketing Strategist Are you a female entrepreneur with an online business who’s struggling to grow and nurture your audience? Do you feel like you’re not making enough sales, despite your best efforts? Do you feel confused by all the marketing jargon and just wish you had a bossy business big sister to shoot it to you straight?Hey, friend. I know you didn’t get into business to get bogged down by writing, but let’s face it, the internet is still powered by WORDS. The good news? You can harness the power of those words to connect with your ideal clients and make more sales through the magic of copywriting.In each episode, we’ll dive deep into the world of copywriting and marketing, sharing insights and strategies that will help you craft compelling messages that resonate with your audience. From understanding your ideal customer to mastering the art of storytelling, we’ll cover it all.I’m Megan Wisdom, a firstborn, Enneagram 5 copywriter who loves to help other female entrepreneurs reach their business fuzz – Swamp Jacuzzi Biggie Boutte An intoxicating wild mind trip through the past, present, and future realms of rock n roll. A euphoric cocktail of spiritual awakening through fuzz and focal points. A new dawn taking the past into the future and the future towards comforts unknown. A yesterday's tomorrow. That time is now. So free your soul and expand your mind. The key to the gates is through this sonic elixir. Administer the medicine, fasten your seatbelts and hold on tight. We have a long journey ahead. But if you want to rock it, you know it's in the pocket. You need Electrophonic Tonic. It could save your soul. Ya dig? The Inner Circle UBS advisor podcasts Step into The Inner Circle, a dynamic and engaging podcast hosted by The Radius Group of UBS. Leveraging their extensive wealth management experience and a diverse network of industry experts, The Inner Circle explores the latest trends while sharing timeless wealth management techniques. Don’t miss out – elevate your financial knowledge by joining The Inner Circle today!
URL copied to clipboard!