The Data Exchange with Ben Lorica cover art

All Episodes

The Data Exchange with Ben Lorica — 353 episodes

#
Title
1

The Hidden Failure Modes of AI Agents

2

The Data Stack Wasn't Built for AI — Here's What Comes Next

3

The SaaSpocalypse Is Coming — But Don't Count Out the Incumbents

4

AI Agents Are Implemented, Not Adopted

5

Why Your AI Agent Isn't Ready to Ship (And How to Know When It Is)

6

Why Foundation Models Haven’t Replaced Classical Machine Learning

7

When "Garbage In, Garbage Out" Gets It Wrong

8

As Code Generation Speeds Up, Who Tests the Output?

9

The Gap Between AI Hype and Enterprise Reality

10

Reading the Tea Leaves: What the World's Top AI Researchers Are Really Working On

11

From Web Video to Real-World Robots

12

Why Your AI Committee Might Be Your Biggest AI Problem

13

Building Mathematical Superintelligence

14

Your First AI Employee Is Already Clocking In

15

Are Multi-Agent Systems More Complex Than They Need to Be?

16

Coding Agents Meet Data Science

17

World Models Are Here—But It’s Still the GPT-2 Phase

18

The Hidden Challenges of Running AI at Scale in Production

19

What No One Tells You About Staying Employable in the AI Era

20

Adaptation: The Missing Layer Between Apps and Foundation Models

21

Securing the "YOLO" Era of AI Agents

22

Building the Open Source Alternative to AWS

23

Breaking the Memory Wall in the Age of Inference

24

Is Waymo Actually Profitable? The Real Cost of the Robotaxi Revolution

25

Beyond Vibe Coding: Building Your Entire Business with AI

26

The Rise of the Machine Identity: Securing the AI Workforce and AI Agents

27

Why Traditional Observability Falls Short for AI Agents

28

Teaching AI How to Forget

29

The Humanoid Hype Cycle: Separating “Shiny Objects” from Real Utility

30

The Junior Data Engineer is Now an AI Agent

31

The Truth About Agents in Production

32

The best books we read this year 📚

33

The Developer’s Guide to LLM Security

34

Is AI a Utility? Defining Usability and Public Trust

35

How to Build AI Copilots That Teach Rather Than Automate

36

The AI Revolution Finally Comes to Structured Data

37

Building the Knowledge Layer Your Agents Need

38

How Language Models Actually Think

39

How AI Is Reshaping Jobs, Budgets, and Data Centers

40

Making Data Engineering Safe for Automation and Agents

41

Is Your Database Ready for an Army of AI Agents?

42

Beyond the Dashboard: Collaborative Analytics in Slack

43

Stop Piloting, Start Shipping: A Playbook for Measurable AI

44

Databases for Machines, Not People

45

When AI Agents Need to Talk: Inside the A2A Protocol

46

The Infrastructure for Production AI

47

How to Make Your Data Truly AI-Ready

48

Beyond the Agent Hype

49

How to Build and Optimize AI Research Agents

50

Why Digital Work is the Perfect Training Ground for AI Agents

51

Beyond the Chatbot: What Actually Works in Enterprise AI

52

Why China's Engineering Culture Gives Them an AI Advantage

53

Predictability Beats Accuracy in Enterprise AI

54

2025 AI Governance Survey

55

The Fenic Approach to Production-Ready Data Processing

56

When AI Eats the Bottom Rung of the Career Ladder

57

From NotebookLM to Audio Companions: Why Google’s AI Team Went Startup

58

The AI-Native Notebook That Thinks Like a Spreadsheet

59

How Agentic AI is Transforming Wall Street

60

The Quantum Advantage Is Real—But Where's the Infrastructure?

61

From Human-Readable to Machine-Usable: The New API Stack

62

Why Voice Security Is Your Next Big Problem

63

Unlocking Unstructured Data with LLMs

64

Building Production-Grade RAG at Scale

65

Unlocking AI Superpowers in Your Terminal

66

From Vibe Coding to Autonomous Agents

67

How a Public-Benefit Startup Plans to Make Open Source the Default for Serious AI

68

The Highly Uncertain Future of OpenAI’s Dominance

69

Beyond Guardrails: Defending LLMs Against Sophisticated Attacks

70

Navigating the Generative AI Maze in Business

71

The Practical Realities of AI Development

72

Beyond the Demo: Building AI Systems That Actually Work

73

Vibe Coding and the Rise of AI Agents: The Future of Software Development is Here

74

2025 Artificial Intelligence Index

75

How AI is Transforming Talent Development

76

Prompts as Functions: The BAML Revolution in AI Engineering

77

Building the Operating System for AI Agents

78

Bridging the AI Agent Prototype-to-Production Chasm

79

The Evolution of Reinforcement Fine-Tuning in AI

80

Beyond GPUs: Cerebras’ Wafer-Scale Engine for Lightning-Fast AI Inference

81

The Future of AI: Regulation, Foundation Models & User Experience

82

The AI Agent Rundown: 10 Things to Know Now

83

Why ‘Structure’ Is All You Need: A Deep Dive into Next-Gen AI Retrieval

84

Why Legal Hurdles Are the Biggest Barrier to AI Adoption

85

Unlocking Spreadsheet Intelligence with AI

86

Monthly Roundup: Deregulation, Hardware, and Inference Scaling

87

What AI Teams Need to Know for 2025

88

AI Unlocked: The Data Bottleneck

89

The Data-Centric Shift in AI: Challenges, Opportunities, and Tools

90

Monthly Roundup: Semiconductors, Frontier Models, and Practical Innovations

91

Breaking the Cloud Barrier: How DBOS Transforms Application Development

92

The Essential Guide to AI Guardrails

93

Beyond ETL: How Snow Leopard Connects AI, Agents, and Live Data

94

2024 Generative AI in Healthcare Survey Results

95

Monthly Roundup: BAML, Tencent’s Hunyuan Model, AI & Kubernetes, and the Future of Voice AI

96

Building the Future of Finance: Inside AI Valuation Bots

97

Unleashing the Power of BAML in LLM Applications

98

Cracking the Code: How Enterprises Are Adopting Generative AI

99

Monthly Roundup: Ray Compiled Graphs, Llama 3.2 and Multimodal AI, and Structured Data for RAG

100

Reimagining Code: The AI-Driven Transformation of Programming and Data Analytics

101

The Security Debate: How Safe is Open-Source Software?

102

Generative AI in Voice Technology

103

Building An Experiment Tracker for Foundation Model Training

104

Monthly Roundup: AI Regulations, GenAI for Analysts, Inference Services, and Military Applications

105

Unlocking the Power of LLMs with Data Prep Kit

106

Advancing AI: Scaling, Data, Agents, Testing, and Ethical Considerations

107

Bridging the Hardware-Software Divide in AI

108

Monthly Roundup: The Economic Realities of Large Language Models

109

From Hype to Reality: The Current State of Enterprise Generative AI Adoption

110

Automating Unstructured Data Extraction with LLMs

111

Generative AI in Context: Hybrid Intelligence and Responsible Development

112

Monthly Roundup: Navigating the Peaks and Valleys of Generative AI Technology

113

From Preparation to Recovery: Mastering AI Incident Response

114

Unlocking the Power of Unstructured Data

115

Postgres: The Swiss Army Knife of Databases

116

Supercharging AI with Graphs

117

Monthly Roundup: SB 1047, GraphRAG, and AI Avatars in the Workplace

118

Fine-tuning and Preference Alignment in a Single Streamlined Process

119

TinyML, Sensor-Driven AI, and Advances in Large Language Models

120

Machine Unlearning: Techniques, Challenges, and Future Directions

121

Unleashing the Power of AI Agents

122

Monthly Roundup: Llama 3, Agents, Evaluation Metrics, Cyc, TikTok, and more

123

LLMs for Data Access: Unlocking Insights with Text-to-SQL

124

2024 Artificial Intelligence Index

125

DBRX and the Future of Open LLMs

126

Monthly Roundup: New LLMs, GTC 2024, Constraint-Driven Innovation, Model Safety, and GraphRAG

127

Automating Software Upgrades: How to Combine AI and Expert Developers

128

Generative AI in the Industrial Sphere

129

The Intersection of LLMs, Knowledge Graphs, and Query Generation

130

Unlocking the Potential of Private Data Collaboration

131

Frontiers of AI: From Text-to-Video Models to Knowledge Graphs

132

Adaptive, Specialized, and Accessible: Where AI Systems Are Heading Next

133

2024 Themes and Trends in AI

134

The AI Infrastructure Revolution: From Cloud Computing to Data Center Design

135

AI in Depth: Transforming Transportation, Enterprise, and Policy

136

Software Meets Hardware: Enabling AMD for Large Language Models

137

Incentives are Superpowers: Mastering Motivation in the AI Era

138

Synthetic Futures: The Convergence of Biology and AI

139

AI Co-Pilots in Action: Transforming Function Calling in Cybersecurity

140

Leveling Up: Tools and Techniques to Make AI Development More Accessible

141

LLMs on CPUs, Period

142

Democratizing Wealth Management With AI

143

Knowledge Graphs: Contextualizing Enterprise Data for More Accurate LLMs

144

TimeGPT: Machine Learning for Time Series, Made Accessible

145

Best Practices for Building LLM-Backed Applications

146

The Evolution of Crypto, Blockchain, and Web3

147

Open Source Data and AI: Past, Present, Future

148

Orchestration for LLM and RAG applications

149

Reflections from the First AI Conference in San Francisco

150

Kùzu: A simple, extremely fast, and embeddable graph database

151

Navigating the Nuances of Retrieval Augmented Generation

152

The Rise of Generative AI-Powered Social Media Manipulation

153

Versioning and MLOps for Generative AI

154

Navigating the Generative AI Landscape

155

Trends in Data Management: From Source to BI and Generative AI

156

AI and the Future of Speech Technologies

157

The Future of Cybersecurity: Generative AI and its Implications

158

Ivy: The One-Stop Interface for AI Model Deployment and Development

159

Navigating the Risk Landscape: A Deep Dive into Generative AI

160

Software Development with AI and LLMs

161

A Lightweight SDK for Integrating AI Models and Plugins

162

Using LLMs to Build AI Co-pilots for Knowledge Workers

163

ETL for LLMs

164

The Future of Graph Databases

165

Delivering Safe and Effective LLM and NLP Applications

166

Using Data and AI to Democratize Entity Resolution and Master Data Management

167

An Open Source Data Framework for LLMs

168

Redefining AI Infrastructure: Deploying and Developing with a Next-Generation Developer Platform

169

The Rise of Custom Foundation Models

170

The Future of Vector Databases and the Rise of Instant Updates

171

LLMs Are the Key to Unlocking the Next Generation of Search

172

Building and Deploying Foundation Models for Enterprises

173

Building Robust AI Infrastructure for Critical Solutions

174

Machine Learning for High-Risk Applications

175

Boosting Perception With Synthetic Data

176

Revolutionizing B2B: Unleashing the Power of AI and Data

177

AI Metadata

178

The 2023 AI Index

179

Custom Foundation Models

180

Uncovering and Highlighting AI Trends

181

How Data and AI Happened

182

Blazing fast bulk data transfers between any cloud

183

Exhaustion of High-Quality Data Could Slow Down AI Progress in Coming Decades

184

Generating high-fidelity and privacy-preserving synthetic data

185

How technology is disrupting the venture capital industry

186

Running Machine Learning Workloads On Any Cloud

187

2023 Trends in Data Engineering and Infrastructure

188

Preparing for the Implementation of the EU AI Act and Other AI Regulations

189

The Open Source Stack Unleashing a Game-Changing AI Hardware Shift

190

Data Science and AI in Context

191

Evaluating Language Models

192

2023 Opportunities and Trends: Data, Machine Learning, and AI

193

Exploring DALL·E 2

194

Data Science at Shopify and Stitch Fix

195

Building a data management system for unstructured data

196

A Cloud Native Vector Database Management System

197

What’s Next for Machine Learning in Time Series

198

Efficient Methods for Natural Language Processing

199

Responsible and Trustworthy AI

200

Building a premier industrial AI research and product group

201

An open source, production grade vector search engine

202

A comprehensive suite of open source tools for time series modeling

203

Building Safe and Reliable AI applications

204

A new storage engine for vectors

205

Project Lightspeed: Next-generation Spark Streaming

206

The Unreasonable Effectiveness of Speech Data

207

Machine Learning Integrity

208

Synthetic data technologies can enable more capable and ethical AI

209

Confidential Computing for Machine Learning

210

Applied NLP Research at Primer

211

Using SQL to Retrieve Data from APIs and Web Services

212

Machine Learning for Time Series Intelligence

213

Unleashing the power of large language models

214

Building production-ready machine learning pipelines

215

Machine Learning at Gong

216

Data Infrastructure for Computer Vision

217

How DALL·E works

218

Scalable, end-to-end machine learning, for everyone

219

Orchestration and Pipelines for Data Scientists

220

Dataframes at scale

221

Software-Defined Assets

222

Adversarial Machine Learning

223

Orchestrating Machine Learning Applications

224

Narrative AI

225

Machine Learning Model Observability

226

Dataflow Automation

227

Practical Machine Learning and Deep learning

228

Machine Learning for Optimization

229

Efficient Scaling of Language Models

230

Data Science at Stitch Fix

231

The 2022 AI Index

232

Why You Need A Time-Series Database

233

Data Science at Shopify

234

An AI Risk Management Framework

235

An open source and end-to-end library for causal inference

236

The Graph Intelligence Stack

237

NLP and Language Models in Healthcare and the Life Sciences

238

Delivering Continuous Intelligence at Scale

239

Imperceptible NLP Attacks

240

Evolving Data Science Training Programs

241

Building Machine Learning Infrastructure at Netflix and beyond

242

Democratizing NLP

243

Machine Learning at Discord

244

Applications of Knowledge Graphs

245

Key AI and Data Trends for 2022

246

Large Language Models

247

Data and Machine Learning Platforms at Shopify

248

What is AI Engineering?

249

NLP and AI in Financial Services

250

Modern Experimentation Platforms

251

Reinforcement Learning in Real-World Applications

252

MLOps Anti-Patterns

253

Why You Need a Modern Metadata Platform

254

Making Large Language Models Smarter

255

AI Begins With Data Quality

256

Modernizing Data Integration

257

Deploying Machine Learning Models Safely and Systematically

258

Large-scale machine learning and AI on multi-modal data

259

Machine Learning in Astronomy and Physics

260

The Unreasonable Effectiveness of Multiple Dispatch

261

How To Lead In Data Science

262

Why interest in graph databases and graph analytics are growing

263

The State of Data Journalism

264

Auditing machine learning models for discrimination, bias, and other risks

265

An oscilloscope for deep learning

266

What’s new in data engineering

267

The evolution of the data science role and of data science tools

268

Data Augmentation in Natural Language Processing

269

Storage Technologies for a Multi-cloud World

270

Building a next-generation dataflow orchestration and automation system

271

Building a flexible, intuitive, and fast forecasting library

272

Neural Models for Tabular Data

273

Training and Sharing Large Language Models

274

Questioning the Efficacy of Neural Recommendation Systems

275

Automation in Data Management and Data Labeling

276

Reinforcement Learning For the Win

277

How Companies Are Investing in AI Risk and Liability Minimization

278

The Future of Machine Learning Lies in Better Abstractions

279

Why You Should Optimize Your Deep Learning Inference Platform

280

AI Beyond Automation

281

Injecting Software Engineering Practices and Rigor into Data Governance

282

Building a data store for unstructured data and deep learning applications

283

How Technology Companies Are Using Ray

284

Data quality is key to great AI products and services

285

Machine Learning in Healthcare

286

Measuring the Impact of AI and Machine Learning Research

287

The Mathematics of Data Integration and Data Quality

288

Pricing Data Products

289

Challenges, Opportunities, and Trends in EdTech

290

Towards Simple, Interpretable, and Trustworthy AI

291

The Rise of Metadata Management Systems

292

Tools for building robust, state-of-the-art machine learning models

293

Creating Master Data at Scale with AI

294

Bringing AI and computing closer to data sources

295

Deep Learning in the Sciences

296

Taking business intelligence and analyst tools to the next level

297

Data exchanges and their applications in healthcare and the life sciences

298

Key AI and Data Trends for 2021

299

A Unified Management Model for Successful Data-Focused Teams

300

Security and privacy for the disoriented

301

The State of Responsible AI

302

Improving the robustness of natural language applications

303

End-to-end deep learning models for speech applications

304

Securing machine learning applications

305

Testing Natural Language Models

306

Detecting Fake News

307

The Computational Limits of Deep Learning

308

Making deep learning accessible

309

Building and deploying knowledge graphs

310

Financial Time Series Forecasting with Deep Learning

311

A programming language for scientific machine learning and differentiable programming

312

Using machine learning to modernize medical triage and monitoring systems

313

Connecting Reinforcement Learning to Simulation Software

314

Using machine learning to detect shifts in government policy

315

What is AI Assurance?

316

Best practices for building conversational AI applications

317

Tools for scaling machine learning

318

From Python beginner to seasoned software engineer

319

Assessing Models and Simulations of Epidemic Infectious Diseases

320

Improving the hiring pipeline for software engineers

321

How to build state-of-the-art chatbots

322

Democratizing machine learning

323

How graph technologies are being used to solve complex business problems

324

Machines for unlocking the deluge of COVID-19 papers, articles, and conversations

325

Designing machine learning models for both consumer and industrial applications

326

Building open source developer tools for language applications

327

Viewing machine learning and data science applications as sociotechnical systems

328

Identifying and mitigating liabilities and risks associated with AI

329

How machine learning is being used in quantitative finance

330

Understanding machine learning model governance

331

Improving performance and scalability of data science libraries

332

Why TinyML will be huge

333

An open source platform for training deep learning models

334

Algorithms that continually invent both problems and solutions

335

Computational Models and Simulations of Epidemic Infectious Diseases

336

Human-in-the-loop machine learning

337

Next-generation simulation software will incorporate deep reinforcement learning

338

Business at the speed of AI: Lessons from Shopify

339

How deep learning is being used in search and information retrieval

340

The responsible development, deployment and operation of machine learning systems

341

Hyperscaling natural language processing

342

What businesses need to know about model explainability

343

Scalable Machine Learning, Scalable Python, For Everyone

344

Computational humanness, analogy and innovation, and soft concepts

345

Building domain specific natural language applications

346

The state of privacy-preserving machine learning

347

Taking messaging and data ingestion systems to the next level

348

Business at the speed of AI: Lessons from Rakuten

349

The combination of the right software and commodity hardware will prove capable of handling most machine learning tasks

350

Key AI and Data Trends for 2020

351

The evolution of TensorFlow and of machine learning infrastructure

352

Building large-scale, real-time computer vision applications

353

Taking stock of foundational tools for analytics and machine learning