Agent Mode AI

PODCAST · technology

Agent Mode AI

The audio companion to agentmodeai.com. Two analysts pick one claim from the Holding-up ledger per episode, walk the evidence, and give the current verdict: Holding, Partial, or Not holding. For CIOs, IT directors, and senior implementers. 15-20 min, every Sunday.

  1. 13

    Why IT operations is the highest-exposure agentic-AI workforce population

    Episode 13 of Agent Mode AI. Abby and Avery walk AM-012, the claim that the enterprise IT operations workforce is structurally the highest-exposure population to autonomous-action AI. The task surface that defines the family — incident triage, configuration management, ticket processing, routine diagnostics, scripted remediation — maps onto the agent-class capability boundary more directly than any other large enterprise job-family. The US Bureau of Labor Statistics Occupational Outlook Handbook projects Computer and Information Technology occupations to grow faster than average through 2033 with substantial role-mix shifts inside the family. The World Economic Forum Future of Jobs Report 2025 reads the same job-family bimodally: routine-task sub-population in the displacement cohort, AI-adjacent sub-population in the creation cohort, 2030 horizon. Anthropic chief executive officer Dario Amodei's twenty-eighth of May 2025 Axios interview projected that AI could eliminate half of entry-level white-collar jobs over the next one to five years. The procurement-deck distinction is between the agent-orchestration posture (team scales toward managing fleets of agents) and the agent-replacement posture (team contracts through churn). Sources cited: - US Bureau of Labor Statistics Occupational Outlook Handbook, Computer and Information Technology occupations, 2023-2033 cycle - World Economic Forum Future of Jobs Report 2025 - Dario Amodei interview with Axios, 28 May 2025 - McKinsey "Seizing the agentic AI advantage" workforce findings - Atlanta Federal Reserve Workforce Currents data on AI-skill wage premium Claims tracked: - AM-012 — IT operations as highest-exposure workforce population — agentmodeai.com/holding/?claim=AM-012 - AM-006 — Atlanta Fed wage-premium and BCG frontline access gap — agentmodeai.com/holding/?claim=AM-006 - AM-010 — Chief information officer playbook five operational characteristics — agentmodeai.com/holding/?claim=AM-010 - AM-011 — Change-management variable in deployment success — agentmodeai.com/holding/?claim=AM-011 Newsletter and the full Holding-up ledger: agentmodeai.com

  2. 12

    What the Anthropic Claude for Chrome disclosure tells procurement

    Episode 12 of Agent Mode AI. Abby and Avery walk AM-009, the claim that Anthropic's Claude for Chrome launch is a procurement-decision data point about the maturity of the browser-resident agentic AI class rather than about Anthropic specifically. The published security disclosure on the launch reports a twenty-three point six percent prompt-injection success rate pre-mitigation, eleven point two percent post-mitigation, and zero percent on URL-injection variants after subsequent patches, against a defined attack corpus. The procurement-relevant signal is the published-disclosure posture itself, which places Anthropic in Cohort A under the AM-007 vendor-response-split framework. Brave Software's adjacent research on Comet confirms the prompt-injection class is structural to browser-resident agents rather than Anthropic-specific. The episode concludes with five questions a chief information officer and chief information security officer can require answered in writing before authorising browser-agent pilots. Sources cited: - Anthropic Claude for Chrome announcement, 26 August 2025 - Anthropic published security disclosure on Claude for Chrome - Brave Software research on Comet prompt injection - Simon Willison agentic-browser-security commentary, 25 August 2025 - Zenity Labs AgentFlayer research, Black Hat USA 2025 - EchoLeak CVE-2025-32711, disclosed August 2025 Claims tracked: - AM-009 — Claude for Chrome procurement-grade disclosure pattern — agentmodeai.com/holding/?claim=AM-009 - AM-007 — AgentFlayer cross-agent prompt-injection class vendor-response split — agentmodeai.com/holding/?claim=AM-007 - AM-146 — Three accuracy-disclosure questions for procurement — agentmodeai.com/holding/?claim=AM-146 Newsletter and the full Holding-up ledger: agentmodeai.com

  3. 11

    The seven AI vendor exit clauses that decide whether you can leave

    Episode 11 of Agent Mode AI. Abby and Avery walk AM-145, the claim that AI vendor switching in 2026 is bound primarily by contract terms — exit clauses, data-portability obligations, model-deprecation rights — not by technical migration cost. Three forces drive the 2026 procurement story: vendor consolidation (ServiceNow completing the Moveworks acquisition in December 2025, Automation Anywhere closing the Aisera acquisition on the eleventh of November 2025), model-deprecation cadence becoming a recurring contract event, and the first wave of multi-year enterprise agentic AI contracts approaching renewal. Seven clause families repeatedly create the lock-in most enterprises only discover at year two: data-portability scope, model-deprecation rights, sub-processor expansion, output-IP ambiguity, pricing-tier rebalancing, agent-uptime SLA definition gaps, and audit-evidence retention obligations. Article 16 of the EU AI Act applies to deployers from the second of August 2026 with a six-month log retention floor. Sources cited: - ServiceNow announcement on completion of Moveworks acquisition - Automation Anywhere announcement on completion of Aisera acquisition - OpenAI deprecation page - Anthropic model lifecycle policy - Google Vertex AI model versioning page - Microsoft Azure OpenAI model retirement policy - Microsoft Customer Copyright Commitment - Bloomberg report on Klarna, 8 May 2025 - EU AI Act Articles 12 and 16 Claims tracked: - AM-145 — Seven AI vendor exit clauses — agentmodeai.com/holding/?claim=AM-145 - AM-027 — Vendor contract gotchas — agentmodeai.com/holding/?claim=AM-027 - AM-046 — EU AI Act Article 12 audit-evidence template — agentmodeai.com/holding/?claim=AM-046 - RES-005 — AI MSA Red-Team Checklist Newsletter and the full Holding-up ledger: agentmodeai.com

  4. 10

    What vendor "successful pilot" references do not tell procurement

    Episode 10 of Agent Mode AI. Abby and Avery walk AM-140, the claim that vendor "successful pilot" references transfer to scaled production at roughly the McKinsey twenty-three percent rate, and that the gap is operational rather than capability-driven. The McKinsey State of AI 2025 survey, published November 2025 with sample size one thousand four hundred ninety-one, is the anchor data point. The Klarna seven-hundred-agent reversal reported by Bloomberg on the eighth of May 2025, the Salesforce Agentforce two-hundred-customer reality through Q1 2026, and the GitHub Copilot token-counting bug acknowledged in April 2026 are the documented walk-backs that bound what reference language can credibly imply. CRMArena-Pro thirty-five percent multi-step reliability and the EchoLeak CVE cross-agent class are the structural failure-mode evidence. Six pre-pilot questions for the procurement committee close the gap. Sources cited: - McKinsey State of AI 2025, published November 2025, n=1,491 - Bloomberg report on Klarna, 8 May 2025 - The Information report on Salesforce Agentforce, April 2025 - GitHub Copilot changelog, 18 April 2026 - CRMArena-Pro paper, Salesforce AI Research, August 2025 - Carnegie Mellon TheAgentCompany academic benchmark - EchoLeak CVE-2025-32711, disclosed August 2025 Claims tracked: - AM-140 — Vendor pilot reference to procuring-enterprise scaled production transfer rate — agentmodeai.com/holding/?claim=AM-140 - AM-030 — McKinsey 23% from IT-leader perspective — agentmodeai.com/holding/?claim=AM-030 - AM-128 — MIT 95% pilot-failure claim — agentmodeai.com/holding/?claim=AM-128 Newsletter and the full Holding-up ledger: agentmodeai.com

  5. 9

    The three questions every CIO should ask about a vendor accuracy claim

    Episode 9 of Agent Mode AI. Abby and Avery walk AM-146, the claim that vendor "ready-to-run" positioning without named task, named baseline, and named methodology is procurement-deck noise rather than procurement evidence. The procurement-grade reference shapes in 2026 are the academic-benchmark layer (CRMArena-Pro 35% multi-step reliability, CMU TheAgentCompany 30-35% reproduction range, WebArena ~36% browser-agent ceiling, SWE-bench Verified for code generation) and the Anthropic Claude for Chrome disclosure pattern (23.6% pre-mitigation, 11.2% post, 0% on URL-injection variants after patches). A third class — the named-customer audited deployment, with McKinsey Lilli, JPMorgan, BT Now Assist, and UK Government Digital Service as the canonical references — sits alongside. Sources cited: - CRMArena-Pro paper, Salesforce AI Research, August 2025 - Carnegie Mellon TheAgentCompany academic benchmark - WebArena academic benchmark - SWE-bench Verified - Anthropic published security disclosure on Claude for Chrome, 26 August 2025 - McKinsey internal Lilli platform deployment data - JPMorgan Chase 2023 AI value disclosure - BT Now Assist deployment, Hena Jalil - UK Government Digital Service Q4 2024 Claims tracked: - AM-146 — Three accuracy-disclosure questions for procurement — agentmodeai.com/holding/?claim=AM-146 - AM-009 — Claude for Chrome procurement-grade disclosure pattern — agentmodeai.com/holding/?claim=AM-009 - AM-140 — Procurement-committee pre-pilot questions — agentmodeai.com/holding/?claim=AM-140 Newsletter and the full Holding-up ledger: agentmodeai.com

  6. 8

    Which LLM provider actually stays up?

    Episode 8 of Agent Mode AI. Abby and Avery walk AM-136, the foundation- model uptime track-record claim. Twenty-four-month operational record across Anthropic, OpenAI, Google, AWS Bedrock, and Azure OpenAI. Why the status pages do not tell the procurement team what they need to know. The SLA-credit gap that almost every enterprise customer discovers only after their first incident. Three multi-provider routing patterns. Three contract language additions every 2026 enterprise renewal MSA should require. Sources cited: - status.anthropic.com (Anthropic incident reports) - status.openai.com (OpenAI incident reports) - status.cloud.google.com (Google Cloud / Gemini API) - health.aws.amazon.com (AWS Service Health Dashboard) - status.azure.com (Azure / Azure OpenAI) - LiteLLM, OpenRouter, Portkey project documentation Claims tracked: - AM-136 — Foundation-model uptime SLA track record — agentmodeai.com/holding/?claim=AM-136 Related claims: - AM-026 — The 60-question agentic AI RFP - AM-138 — Vendor MSA renewal post-EU-AI-Act-enforcement - AM-123 — Agent observability stack Newsletter and the full Holding-up ledger: agentmodeai.com

  7. 7

    Why 88% of agentic AI deployments fail

    Episode 7 of Agent Mode AI. Abby and Avery walk the four datasets that document the bimodal ROI distribution in enterprise agentic AI: AM-029 on the Stanford Digital Economy Lab 12/88, AM-132 on the restored bimodal framing, AM-128 on the MIT NANDA GenAI Divide 95% finding, and AM-053 on the McKinsey State of AI 2025 17% EBIT-attribution. Then the GAUGE framework — six dimensions that distinguish the 12% high-performing cohort from the 88% struggling body. Governance, audit substrate, use-case maturity, guardrails, evidence baseline, exit posture. Sources cited: - Stanford Digital Economy Lab Enterprise AI Playbook 2026 (Pereira, Graylin, Brynjolfsson) - McKinsey State of AI 2025 (n=1,993, November 2025) - MIT NANDA State of AI in Business 2025 (Project NANDA / MIT Connection Science, August 2025) - Gartner Q1 2026 Infrastructure & Operations Survey - Fortune coverage of MIT NANDA findings, August 2025 Claims tracked: - AM-029 — Why 88% of agentic AI deployments fail — agentmodeai.com/holding/?claim=AM-029 - AM-132 — The bimodal ROI distribution in enterprise agentic AI — agentmodeai.com/holding/?claim=AM-132 - AM-128 — The MIT 95% GenAI-pilot-failure claim — agentmodeai.com/holding/?claim=AM-128 - AM-053 — The McKinsey 17% EBIT claim — agentmodeai.com/holding/?claim=AM-053 Newsletter and the full Holding-up ledger: agentmodeai.com

  8. 6

    What changes for enterprise AI on 2 August 2026

    Episode 6 of Agent Mode AI. Abby and Avery walk the four claims that interlock around the 2 August 2026 EU AI Act deployer-obligations enforcement window. AM-127 is the falsifiable prediction (three of four operational claims will downgrade by 1 October 2026). AM-135 is the Article 50 transparency UX with chatbot disclosure, generative AI watermarking, biometric notification, and deepfake disclosure. AM-138 is the post-enforcement MSA red-team — three new clause families covering Article 11 technical-file pass-through, Article 16 post-market monitoring support, and Article 26 deployer-documentation supply. AM-046 is the fourteen-field Article 12 audit substrate that the other three obligations feed into. Sources cited: - EU AI Act consolidated text (Regulation 2024/1689) - EU AI Office implementing guidance, 2026 - C2PA Content Credentials specification - Google SynthID documentation - Adobe Content Authenticity Initiative - European Data Protection Board opinion on AI models, 2024 - National supervisory authority enforcement decisions, Q1-Q2 2026 Claims tracked: - AM-127 — Deadline-anchored prediction on operational vs governance claims — agentmodeai.com/holding/?claim=AM-127 - AM-135 — EU AI Act Article 50 transparency disclosure UX — agentmodeai.com/holding/?claim=AM-135 - AM-138 — Vendor MSA renewal post-EU-AI-Act-enforcement — agentmodeai.com/holding/?claim=AM-138 - AM-046 — EU AI Act Article 12 audit-evidence template — agentmodeai.com/holding/?claim=AM-046 Newsletter and the full Holding-up ledger: agentmodeai.com

  9. 5

    What the money is doing

    Episode 5 of Agent Mode AI. Abby and Avery walk through AM-118, AM-119, and the vendor-economics section of AM-121 — three institutional capital cohorts reading the same AI risk surface from three different angles, behaving in three different directions, in 2026. Sources cited: - NBIM (Norges Bank Investment Management) governance documentation, 2015-2026 - CalPERS investment policy publications - Lloyd's of London Futureset systemic-risk programme - Munich Re Cyber Insurance Risk Report 2024-2026 - Swiss Re Institute sigma research 2025 - ServiceNow Q1 FY26 10-Q (SEC EDGAR filing) - Datadog Q3 FY25 10-Q (AI-native cohort disclosure) Claims tracked: - AM-118 — The AI policy void at major pension funds — agentmodeai.com/holding/?claim=AM-118 - AM-119 — Reinsurance market AI tail risk pricing — agentmodeai.com/holding/?claim=AM-119 - AM-121 — AI in IT operations reality check — agentmodeai.com/holding/?claim=AM-121 Newsletter and the full Holding-up ledger: agentmodeai.com

  10. 4

    What AI doesn't know about your business

    Episode 4 of Agent Mode AI. The operators edition. Abby and Avery walk through OPS-037, OPS-039, and OPS-041 — three places where SMB AI tools generate visible work that misses the regulatory or algorithmic layer the tool was unaware of. Sources cited: - EU OSS scheme + reverse-charge VAT rules (Belastingdienst, HMRC, BZSt) - Burgerlijk Wetboek Boek 3 Article 89, Netherlands (notarial form) - Bürgerliches Gesetzbuch §311b, Federal Republic of Germany - GmbHG §15, Germany (share-transfer notarial form) - KNB (Koninklijke Notariële Beroepsorganisatie) 2024-2026 guidance - BNotK (Bundesnotarkammer) 2024-2026 guidance - Google Search Central blog, March 2024 spam policy update - LinkedIn Engineering blog, feed-distribution criteria 2025-2026 - Etsy seller handbook, listing-policy enforcement updates 2024-2026 Claims tracked: - OPS-037 — AI-drafted invoices and the EU VAT audit failure mode — agentmodeai.com/holding/?claim=OPS-037 - OPS-039 — AI-drafted contracts and the notary requirement in EU — agentmodeai.com/holding/?claim=OPS-039 - OPS-041 — Platform algorithm penalties on AI-generated content — agentmodeai.com/holding/?claim=OPS-041 Newsletter and the full Holding-up ledger: agentmodeai.com

  11. 3

    Whose consent do you need to deploy AI?

    Episode 3 of Agent Mode AI. Abby and Avery walk through AM-120 and OPS-038: the same continental European labour-relations surface that governs AI deployment, observed at the multinational scale and at the four-employee SMB scale. Sources cited: - Betriebsverfassungsgesetz (BetrVG) §87, Federal Republic of Germany - Wet op de ondernemingsraden (WOR) Article 27, Netherlands - Code du travail Comité Social et Économique provisions, France - FNV, DGB, IG Metall, CFDT public statements 2024-2026 - Dutch CAO database (NL Ministry of Social Affairs) - German BMAS Tarifvertrag list (federally declared) - Avv-declaration and Allgemeinverbindlich-declaration provisions Claims tracked: - AM-120 — Works councils and EU AI agent deployment — agentmodeai.com/holding/?claim=AM-120 - OPS-038 — CAO/Tarifvertrag AI-VA trap at SMB scale — agentmodeai.com/holding/?claim=OPS-038 Newsletter and the full Holding-up ledger: agentmodeai.com

  12. 2

    What you are actually buying when you buy AI

    Episode 2 of Agent Mode AI. Abby and Avery walk through AM-117 and AM-121: the AI Bill of Materials as the new EU AI Act Article 11 procurement artefact, and the auditability-versus-lock-in axis that defines platform-incumbent AI procurement in 2026. Sources cited: - EU AI Act Article 11, Regulation (EU) 2024/1689 - CycloneDX ML-BOM specification (OWASP Foundation) - SPDX 3.0 AI extensions (Linux Foundation) - ServiceNow Q1 FY26 10-Q (SEC EDGAR filing) - Nenshad Bardoliwalla, ServiceNow Group VP for AI Products, on the record - Charles Betz, Forrester principal analyst, ITSM/AIOps - IDC ITSM market share data, Q1 2026 Claims tracked: - AM-117 — AI Bill of Materials supply-chain disclosure — agentmodeai.com/holding/?claim=AM-117 - AM-121 — AI in IT operations reality check — agentmodeai.com/holding/?claim=AM-121 Newsletter and the full Holding-up ledger: agentmodeai.com

  13. 1

    The CIO's new liability

    Episode 1 of Agent Mode AI. Abby and Matt walk through AM-116 and AM-119: directors' and officers' insurance under the Caremark line, and the reinsurance market repricing cyber treaties for 2026. Sources cited: - In re Caremark International Inc. Derivative Litigation, 698 A.2d 959 (Del. Ch. 1996) - Marchand v. Barnhill, 212 A.3d 805 (Del. 2019) - Marsh quarterly D&O market reports, 2025-2026 - Aon commercial insurance updates, 2026 - Willis Towers Watson D&O commentary, 2026 - Lloyd's of London Futureset systemic-risk programme - Munich Re Cyber Insurance Risk Report, 2024-2026 - Swiss Re Institute sigma research, 2025 - EU AI Act Article 9, Regulation (EU) 2024/1689 Claims tracked: - AM-116 — D&O insurance and the AI-supervision claim — agentmodeai.com/holding/?claim=AM-116 - AM-119 — Reinsurance market AI tail risk pricing — agentmodeai.com/holding/?claim=AM-119 Newsletter and the full Holding-up ledger: agentmodeai.com

Type above to search every episode's transcript for a word or phrase. Matches are scoped to this podcast.

Searching…

No matches for "" in this podcast's transcripts.

Showing of matches

No topics indexed yet for this podcast.

Loading reviews...

ABOUT THIS SHOW

The audio companion to agentmodeai.com. Two analysts pick one claim from the Holding-up ledger per episode, walk the evidence, and give the current verdict: Holding, Partial, or Not holding. For CIOs, IT directors, and senior implementers. 15-20 min, every Sunday.

HOSTED BY

Agent Mode AI

Produced by Peter Walda

CATEGORIES

URL copied to clipboard!