DatAInnovators & Builders Podcast - All Episodes

12

How a healthcare startup migrated from Azure to GCP and kept production running

Jo Hjersman has spent over five years at a healthcare XR startup wearing every hat on the data stack, from research data scientist to Head of Data. In this episode, he walks Saket through how he built and scaled a behavioral analytics platform for mental health care, navigated a full cloud migration from Azure to GCP with critical systems kept online throughout, and keeps the whole system running efficiently on a startup budget.The conversation gets into the real mechanics: how intent-driven data design changes what you can extract from immersive environments, how to handle schema evolution without breaking downstream consumers, and why the data scientist and ML engineer roles are converging toward an ownership model rather than a specialization model.Topics discussed:Designing intent-driven data capture in immersive healthcare environmentsBuilding a bronze/silver/gold architecture from scratch at a startupManaging schema evolution across multi-device data sourcesPlanning and executing a full Azure-to-GCP cloud migration with minimal downtimeHandling HIPAA-adjacent compliance through UUID abstraction at the schema levelWhy data scientists and ML engineers are converging toward a full-stack ownership modelWhat bootcamps miss about real-world data engineering at scaleEvaluating infrastructure trade-offs from both a technical and business cost perspective

May 5, 2026

32m

11

The language mistake data leaders make when presenting to executives

Christine Pierce spent over 20 years at Nielsen building and running the data systems behind a $600 billion advertising ecosystem. She now consults on media measurement, AI deployment, and data monetization. Her take on enterprise AI cuts through the noise: most deployments fail not because of the technology, but because of change management and misaligned incentives.Christine outlines how the shift from panels and surveys to transactional and big data fundamentally changed the measurement challenge, why deduplication across devices remains unsolved, and how organizations can spot the real barriers to AI adoption before they invest.Topics Discussed:Shift from survey-based to transactional data in media measurementDeduplication challenges across devices and platformsIndependent vs. platform-owned measurement in programmatic advertisingWhy enterprise AI deployments fail: change management over technologyIndividual productivity vs. enterprise-level AI transformationUsing synthetic data and agentic AI for brand researchFramework for advising companies on data monetizationCommunicating data ROI in business outcomes, not technical outputsAgentic models as the next evolution of programmatic advertising

Apr 21, 2026

50m

10

Context Is the Differentiator, Not the Model

Most organizations have thousands of dashboards and still can't get a simple answer from their data. Francois Lopitaux, SVP of Product Management at ThoughtSpot, argues that the problem was never the data, it was a fundamental misunderstanding of who analytics tools were actually built for. In this episode, Saket and Francois trace the full arc from dashboard factories to agentic BI, and why the shift from self-service analytics to proactive insight delivery is finally within reach.Francois walks through how ThoughtSpot's semantic layer approach, built years before LLMs arrived, is now the foundation for its agentic product Spotter. Rather than using text-to-SQL and accepting hallucination risk, ThoughtSpot translates natural language into search tokens first and then generates deterministic SQL, preserving consistency and giving business users a way to verify every answer. The conversation goes deep on context engineering, how to enrich a semantic model with business rules and memory, and why the LLM is only as good as the context layer surrounding it.Topics discussed:Why dashboards failed business users from the startThoughtSpot's semantic layer and search token approachHow agentic BI differs from conversational analyticsWhy text-to-SQL introduces trust problems at scaleCombining structured, enterprise, and unstructured data sourcesMCP integration for real-time data and automated actionsContext engineering as the new governance layerAutomating semantic model enrichment with AIThe evolution from reactive dashboards to proactive agentsHow data leaders need to rethink their role in an agentic world

Apr 7, 2026

53m

9

15 AI Agents and Nothing to Show for It

What happens when a company runs 15 AI agents across its processes but still cannot measure their impact on the top or bottom line? According to Yorck F. Einhaus, former Global CDO at Liberty Mutual and CDO at Farmers Insurance, that is not an AI problem. It is a data problem, and it is the most common reason enterprise AI programs fail to scale.Yorck shares how he led Farmers Insurance through a migration from on-prem to Snowflake on AWS, using that transition as a forcing function to settle long-standing disputes between actuaries, underwriters, and product teams over how the same data should be defined. He also unpacks his framework for decision intelligence, which he applies at every level of an organization: determine only what information you truly need to make a decision, and treat everything else as noise.Topics discussed:• Why insurance is inherently a data business• Building physical and technological innovation lab environments• Using VR to scale claims adjuster training at Farmers• Aligning AI strategy directly to business strategy outcomes• Data governance as the primary barrier to AI at scale• Migrating to Snowflake to enforce data quality upstream• AI and multimodal data in claims, including AI-generated fraud detection• Shifting from backward-looking claims history to predictive catastrophe modeling• Why lateral career moves accelerate long-term advancement• The evolving CDO role in an AI-first enterprise

Mar 24, 2026

45m

8

The 'no more individual contributors' framework: Managing a team of 3 with AI

Most companies think turning on ChatGPT Enterprise and running a few lunch-and-learns counts as AI transformation. Michael Domanic, VP and Head of Generative AI Business Strategy at UserTesting and OpenAI's System Builder of the Year, has spent two years proving otherwise inside an 800-person org.His starting point is a reframe that changes how the whole program runs: there are no more individual contributors. Everyone is managing a team of three, an assistant and a thought partner with PhD-level expertise available all day, and the real leadership skill now is knowing how to direct that team toward what actually moves the business. From there, Michael gets specific on how UserTesting built the enablement infrastructure to make that mental model stick across every function, how they calculate ROI when half the value is genuinely hard to quantify, and why the companies waiting to see how AI plays out are making the mistake they'll most regret in five years.Topics discussed:Reframing every employee as a manager of a three-person AI teamAnchoring transformation to three business levers to avoid chasing infinite use casesUsing custom GPT hackathons to surface bottom-up adoption across all functionsRunning a 2-dozen-person cross-functional ambassadors program as an internal force multiplierQuarterly top-10 implementation reviews as a before-and-after ROI measurement frameworkWhy functional leadership, not function type, determines adoption speedShifting from model selection to purpose-built tooling as the real enterprise differentiatorWhy transformation requires dedicated leadership and can't be a distributed side projectHonest framing on job displacement: what the data actually supports vs. what is speculation

Mar 10, 2026

49m

7

95% prompt cache hit rate: how LLM cost reduction actually works in production

Most agents fail in production not because the model is bad, but because they forget everything and can't access the right data at the right time. Rowan Trollope, CEO of Redis, has built his entire product strategy around solving exactly those two problems, and in this episode he gets specific about how.From architecting a semantic layer that sits between your enterprise data and the agent's context window, to building memory systems that handle conflicting user preferences and temporal grounding, Rowan lays out the infrastructure decisions that actually determine whether agents make it out of POC. He also shares a clear-eyed take on the AI bubble, why he'd put money on infrastructure over apps right now, and what the dot-com crash taught him that still holds.Topics discussed:Why pointing an MCP server directly at backend databases breaks agent reliabilityRedis Context Engine: CDC pipeline plus pydantic object models as a semantic layer for agentsLanCache: prompt-layer caching hitting 95% cache hit rates and 70% LLM cost reductions in productionAgent Memory Server: using an LLM to extract, vectorize, and resolve conflicting user preferences from raw transcriptsContextual grounding: converting relative time and location references into absolutes before storing memoriesWhy the agentic infrastructure stack has not yet solidified and what that means for enterprise adoption timelines"Provability" as the framework for predicting which job functions agents will automate nextThe shift from specialist roles to "product builders" and what that means for how software teams are structuredHow Redis became the number one data store for agent workloads by market share, and why agents self-select for simpler APIsWhy infrastructure is the safer bet in an AI bubble, drawing on lessons from the dot-com crash

Feb 24, 2026

1h 02m

6

How swarm intelligence solves routing problems in 20 seconds without training data

Fred Gertz completed his PhD in electrical engineering under the inventor of the modern magnetic hard drive, then left academic research to solve a problem that's stumped manufacturers for decades: how to optimize complex operations when you have almost no data. At Collide Technologies, he's applying swarm intelligence to tackle NP-hard scheduling and routing problems that LLMs fail at spectacularly.His approach comes from an unexpected place. While most AI startups chase massive datasets and GPU clusters, Fred turned to ant colonies. These insects solve complex logistics problems without central coordination, training data, or computing power. Their collective behavior cracks the same mathematical challenges that paralyze manufacturing floors: which routes minimize delivery time, how to assign hundreds of workers to shifting tasks, what machine parameters balance throughput against reliability.The methodology borrows from operations research and Taguchi's philosophy, which Fred positions against Six Sigma's dominance. Where Six Sigma optimizes for low variation, Taguchi argued customers deserve the best possible product every single time. That shift in thinking leads to different math: instead of reducing standard deviations, you map how every process parameter mathematically connects to business outcomes like profit or quality. The problem? Operations research textbooks are dense enough to intimidate PhD holders. Collide's swarm algorithms make those techniques accessible to companies running on spreadsheets.Topics discussed:Ant colony optimization combining search functions and route optimization to solve scheduling problems in 20 to 30 secondsOperations research and Taguchi methods versus Six Sigma's statistical process control approach for manufacturing optimizationDelivering ROI with spreadsheet data instead of requiring IoT sensors and six month data collection projectsIQ OQ PQ validation frameworks from pharmaceutical robotics applied to AI model deployment in regulated industriesWhy NP complete problems are better AI targets than tasks humans already perform wellAgent coordination across 500 enterprise agents as swarm intelligence's next application beyond LLM reasoning modelsGenerating structured outputs from API calls without training data or few shot examplesRate limiting and context window management for stateful applications like production planning toolsManufacturing data environments spanning paper maintenance logs to live vibration sensors in the same facilityEvaluating AI without numeric metrics when outputs are text based recommendations rather than classifications

Feb 10, 2026

40m

5

The delegation test for AI: If an intern can't succeed with your context, neither will your model

Marcel Santilli built GrowthX to $13 million ARR without hiring an AE until three months ago. His approach: 170 paid workshops at $500+ each validated exactly what the market needed before writing a line of platform code. The methodology behind it came from his time as Deepgram's CMO, where AI workflows plus human judgment generated 3,000 continuously improving pages and helped 4x revenue in three months. His framework for AI implementation challenges the tools-first mentality plaguing most data teams.Topics DiscussedThree-role framework replacing traditional GTM engineers: process architects, input calibrators, output bar raisersContext engineering as delegation test: if an intern can't succeed with your inputs, neither will AIDeep research architecture: internal knowledge bases combined with external signal processing before draftingWorkshop revenue as product validation: charging $500 to teach eliminates assumption riskCohort analysis through narrative transformation: converting raw session data into plain English before pattern analysisPlanning layer between research and execution: evaluating task requirements against available contextPost-processing with human bar raisers: calibrating system improvements rather than fixing individual outputsForward-deployed delivery model: solving customer problems directly reveals what to automateOutput-first engineering: working backwards from deliverable to determine technical requirements

Jan 27, 2026

43m

4

Ortecha's Stephen Gatchell On The Data Governance Gap That's Blocking Your AI Production Deployment

Companies turn on Microsoft Copilot or Glean, then shut them off a month later after discovering sensitive data exposure across their environment. Stephen Gatchell, Partner and Head of AI Strategy at Ortecha, explains why this pattern keeps repeating and what it takes to actually get enterprise AI into production safely. Stephen breaks down the real blockers: unstructured data at petabyte scale that organizations never cataloged, duplicate files spreading PII across 15 different locations, and retention policies that exist on paper but never get enforced. He worked with the EDM Associates to formally define what a data product actually is, and explains why most companies assign data owners without ever telling them what their responsibilities are. His framework for moving AI from lab to production starts with cross-functional governance committees and ends with treating AI models as measurable assets with clear ROI criteria. Topics discussed: Why companies turn on enterprise copilots then shut them off within weeks The shift from structured to unstructured data as AI's primary governance challenge Shadow AI risks from employees uploading sensitive data to public LLMs Building cross-functional governance committees across security, privacy, and data teams Defining data products with owners, purpose of use, and lifecycle management Using generative AI to automate semantic layer creation and business glossary mapping Moving from large language models to small language models for specific agent tasks The production deployment framework from assessment through continuous monitoring New attack surfaces in RAG pipelines including vector databases and prompt storage Why scanning techniques evolved from metadata reading to actual data classification at scale This conversation was recorded while Stephen was Vice President, Data and AI Strategy at BigID.

Jan 13, 2026

47m

3

Bigpanda's Alexander Page On Building AI Agents That Internalize Corrections

Most AI agent demos still look great but fall apart in production. At BigPanda, Alexander Page's team solved this by building systems that internalize user corrections and improve without requiring source data fixes. The Engineering Director of Applied AI shares with Saket how his team designs production-grade AI agents for IT operations. When a user flags that step seven of a retrieved runbook is outdated, the system internalizes that correction with appropriate weighting and handles conflicts on future retrievals, even when nobody updates the original Confluence page. He argues this capability is becoming a baseline expectation: users accept that AI systems won't be perfect, but they increasingly expect systems to learn when shown the right answer. Page also breaks down multi-agent architecture decisions. When you have 100 tools, giving them all to one agent degrades tool selection. His team isolates decision-making by domain, spinning up specialized sub-agents at runtime based on user intent. For evals, they focus on tool call sequences rather than final outputs, making it easier to pinpoint where agent chains break down. Topics discussed: Internalizing user corrections when source data stays outdated Why correction capability is becoming a baseline user expectation Evaluating agent chains by tool call sequences not outputs Breaking large tool sets into domain-specific agents MCP security tradeoffs and when A2A fits better Runtime decisions on which sub-agents to spin up Maintaining a prototype shelf for future foundation model capabilities Context engineering over expanding context windows

Dec 23, 2025

41m

2

"AI build, Human verify, AI refine”: How CurieTech flipped the IT engineering workflow, with Ashish Thusoo

Most IT teams burn months integrating business systems. Ashish Thusoo's agents at CurieTech AI deliver 70-80% productivity improvements by changing one thing: the loop shifts from human build, human verify, human refine to AI build, human verify, AI refine. That compression happens because machines can now build and refine while humans focus verification energy where it matters. From co-creating Apache Hive at Facebook to General Manager of AI at AWS, Ashish brings 25 years of infrastructure experience to automating IT engineering. CurieTech targets the reality most companies face: not building software products but making CRM, ERP, and financial systems talk to each other across 1,000+ business systems. His method treats production agent development as a data problem first. Build benchmarks, run systematic error analysis across every failure, then decide whether fixes need more context, fine-tuning, or expanded knowledge bases. Topics discussed: Shifting from human build/verify/refine loops to AI build/human verify/AI refine workflow Building benchmarks and eval sets before prototyping agents for production quality Running painstaking error analysis on every agent mistake to classify root causes Choosing between fine-tuning and RAG based on knowledge stability and response speed requirements Creating synthetic datasets with statistical sampling methods for human verification loops Handling multimodal enterprise data quality with task-specific metrics per workflow Hiring engineers based on how they guide AI through problem decomposition Automating version upgrades across 1,000+ business systems with reasoning-capable agents Applying SaaS-era governance patterns to agent proliferation in enterprises Maintaining speed as core entrepreneurial skill when technology shifts monthly not yearly

Dec 9, 2025

55m

1

Databricks' Robin Sutara On Why AI Training Fails - And Persona-Based Enablement That Works

Robin Sutara is Field Chief Data Strategy Officer at Databricks, where she works with organizations facing a common problem: employees sit through AI training, check the box, then nothing changes. The issue isn't awareness. It's that a store manager, plant floor worker, and data scientist need completely different capabilities, but most organizations treat them identically. Robin breaks down why generic AI literacy programs fail to drive behavior change and how to build persona-specific enablement instead. She explains why your data teams need to sit with domain users (like riding in trucks with electric utility line workers) to understand their actual workflows, how to update performance KPIs to reinforce new behaviors, and why organizations should study their pandemic response as a template for AI transformation speed. The conversation covers JetBlue's production agentic systems, JP Morgan's executive-level AI representation, the Databricks AI Security Framework's 62 risk factors for prioritization, and the specific criteria for choosing which use cases to ship when data isn't perfect across your entire estate. Topics discussed: Persona-based enablement replacing one-size-fits-all AI literacy programs Sitting with domain users to translate AI capabilities into changed workflows Updating performance KPIs and organizational processes to reinforce AI behaviors Defining acceptable failure rates and safe experimentation spaces for pilots Applying pandemic-era instant transformation tactics to AI adoption cycles Prioritizing use cases where data foundations are ready while modernizing the rest JetBlue's agentic systems aggregating weather, maintenance, staffing, and customer data JP Morgan Chase's approach to AI representation at executive level Databricks AI Security Framework's 62 risk factors for balancing innovation and controls

Nov 25, 2025

43m