The Data Exchange with Ben Lorica cover art

All Episodes

The Data Exchange with Ben Lorica — 345 episodes

#
Title
1

The Gap Between AI Hype and Enterprise Reality

2

Reading the Tea Leaves: What the World's Top AI Researchers Are Really Working On

3

From Web Video to Real-World Robots

4

Why Your AI Committee Might Be Your Biggest AI Problem

5

Building Mathematical Superintelligence

6

Your First AI Employee Is Already Clocking In

7

Are Multi-Agent Systems More Complex Than They Need to Be?

8

Coding Agents Meet Data Science

9

World Models Are Here—But It’s Still the GPT-2 Phase

10

The Hidden Challenges of Running AI at Scale in Production

11

What No One Tells You About Staying Employable in the AI Era

12

Adaptation: The Missing Layer Between Apps and Foundation Models

13

Securing the "YOLO" Era of AI Agents

14

Building the Open Source Alternative to AWS

15

Breaking the Memory Wall in the Age of Inference

16

Is Waymo Actually Profitable? The Real Cost of the Robotaxi Revolution

17

Beyond Vibe Coding: Building Your Entire Business with AI

18

The Rise of the Machine Identity: Securing the AI Workforce and AI Agents

19

Why Traditional Observability Falls Short for AI Agents

20

Teaching AI How to Forget

21

The Humanoid Hype Cycle: Separating “Shiny Objects” from Real Utility

22

The Junior Data Engineer is Now an AI Agent

23

The Truth About Agents in Production

24

The best books we read this year 📚

25

The Developer’s Guide to LLM Security

26

Is AI a Utility? Defining Usability and Public Trust

27

How to Build AI Copilots That Teach Rather Than Automate

28

The AI Revolution Finally Comes to Structured Data

29

Building the Knowledge Layer Your Agents Need

30

How Language Models Actually Think

31

How AI Is Reshaping Jobs, Budgets, and Data Centers

32

Making Data Engineering Safe for Automation and Agents

33

Is Your Database Ready for an Army of AI Agents?

34

Beyond the Dashboard: Collaborative Analytics in Slack

35

Stop Piloting, Start Shipping: A Playbook for Measurable AI

36

Databases for Machines, Not People

37

When AI Agents Need to Talk: Inside the A2A Protocol

38

The Infrastructure for Production AI

39

How to Make Your Data Truly AI-Ready

40

Beyond the Agent Hype

41

How to Build and Optimize AI Research Agents

42

Why Digital Work is the Perfect Training Ground for AI Agents

43

Beyond the Chatbot: What Actually Works in Enterprise AI

44

Why China's Engineering Culture Gives Them an AI Advantage

45

Predictability Beats Accuracy in Enterprise AI

46

2025 AI Governance Survey

47

The Fenic Approach to Production-Ready Data Processing

48

When AI Eats the Bottom Rung of the Career Ladder

49

From NotebookLM to Audio Companions: Why Google’s AI Team Went Startup

50

The AI-Native Notebook That Thinks Like a Spreadsheet

51

How Agentic AI is Transforming Wall Street

52

The Quantum Advantage Is Real—But Where's the Infrastructure?

53

From Human-Readable to Machine-Usable: The New API Stack

54

Why Voice Security Is Your Next Big Problem

55

Unlocking Unstructured Data with LLMs

56

Building Production-Grade RAG at Scale

57

Unlocking AI Superpowers in Your Terminal

58

From Vibe Coding to Autonomous Agents

59

How a Public-Benefit Startup Plans to Make Open Source the Default for Serious AI

60

The Highly Uncertain Future of OpenAI’s Dominance

61

Beyond Guardrails: Defending LLMs Against Sophisticated Attacks

62

Navigating the Generative AI Maze in Business

63

The Practical Realities of AI Development

64

Beyond the Demo: Building AI Systems That Actually Work

65

Vibe Coding and the Rise of AI Agents: The Future of Software Development is Here

66

2025 Artificial Intelligence Index

67

How AI is Transforming Talent Development

68

Prompts as Functions: The BAML Revolution in AI Engineering

69

Building the Operating System for AI Agents

70

Bridging the AI Agent Prototype-to-Production Chasm

71

The Evolution of Reinforcement Fine-Tuning in AI

72

Beyond GPUs: Cerebras’ Wafer-Scale Engine for Lightning-Fast AI Inference

73

The Future of AI: Regulation, Foundation Models & User Experience

74

The AI Agent Rundown: 10 Things to Know Now

75

Why ‘Structure’ Is All You Need: A Deep Dive into Next-Gen AI Retrieval

76

Why Legal Hurdles Are the Biggest Barrier to AI Adoption

77

Unlocking Spreadsheet Intelligence with AI

78

Monthly Roundup: Deregulation, Hardware, and Inference Scaling

79

What AI Teams Need to Know for 2025

80

AI Unlocked: The Data Bottleneck

81

The Data-Centric Shift in AI: Challenges, Opportunities, and Tools

82

Monthly Roundup: Semiconductors, Frontier Models, and Practical Innovations

83

Breaking the Cloud Barrier: How DBOS Transforms Application Development

84

The Essential Guide to AI Guardrails

85

Beyond ETL: How Snow Leopard Connects AI, Agents, and Live Data

86

2024 Generative AI in Healthcare Survey Results

87

Monthly Roundup: BAML, Tencent’s Hunyuan Model, AI & Kubernetes, and the Future of Voice AI

88

Building the Future of Finance: Inside AI Valuation Bots

89

Unleashing the Power of BAML in LLM Applications

90

Cracking the Code: How Enterprises Are Adopting Generative AI

91

Monthly Roundup: Ray Compiled Graphs, Llama 3.2 and Multimodal AI, and Structured Data for RAG

92

Reimagining Code: The AI-Driven Transformation of Programming and Data Analytics

93

The Security Debate: How Safe is Open-Source Software?

94

Generative AI in Voice Technology

95

Building An Experiment Tracker for Foundation Model Training

96

Monthly Roundup: AI Regulations, GenAI for Analysts, Inference Services, and Military Applications

97

Unlocking the Power of LLMs with Data Prep Kit

98

Advancing AI: Scaling, Data, Agents, Testing, and Ethical Considerations

99

Bridging the Hardware-Software Divide in AI

100

Monthly Roundup: The Economic Realities of Large Language Models

101

From Hype to Reality: The Current State of Enterprise Generative AI Adoption

102

Automating Unstructured Data Extraction with LLMs

103

Generative AI in Context: Hybrid Intelligence and Responsible Development

104

Monthly Roundup: Navigating the Peaks and Valleys of Generative AI Technology

105

From Preparation to Recovery: Mastering AI Incident Response

106

Unlocking the Power of Unstructured Data

107

Postgres: The Swiss Army Knife of Databases

108

Supercharging AI with Graphs

109

Monthly Roundup: SB 1047, GraphRAG, and AI Avatars in the Workplace

110

Fine-tuning and Preference Alignment in a Single Streamlined Process

111

TinyML, Sensor-Driven AI, and Advances in Large Language Models

112

Machine Unlearning: Techniques, Challenges, and Future Directions

113

Unleashing the Power of AI Agents

114

Monthly Roundup: Llama 3, Agents, Evaluation Metrics, Cyc, TikTok, and more

115

LLMs for Data Access: Unlocking Insights with Text-to-SQL

116

2024 Artificial Intelligence Index

117

DBRX and the Future of Open LLMs

118

Monthly Roundup: New LLMs, GTC 2024, Constraint-Driven Innovation, Model Safety, and GraphRAG

119

Automating Software Upgrades: How to Combine AI and Expert Developers

120

Generative AI in the Industrial Sphere

121

The Intersection of LLMs, Knowledge Graphs, and Query Generation

122

Unlocking the Potential of Private Data Collaboration

123

Frontiers of AI: From Text-to-Video Models to Knowledge Graphs

124

Adaptive, Specialized, and Accessible: Where AI Systems Are Heading Next

125

2024 Themes and Trends in AI

126

The AI Infrastructure Revolution: From Cloud Computing to Data Center Design

127

AI in Depth: Transforming Transportation, Enterprise, and Policy

128

Software Meets Hardware: Enabling AMD for Large Language Models

129

Incentives are Superpowers: Mastering Motivation in the AI Era

130

Synthetic Futures: The Convergence of Biology and AI

131

AI Co-Pilots in Action: Transforming Function Calling in Cybersecurity

132

Leveling Up: Tools and Techniques to Make AI Development More Accessible

133

LLMs on CPUs, Period

134

Democratizing Wealth Management With AI

135

Knowledge Graphs: Contextualizing Enterprise Data for More Accurate LLMs

136

TimeGPT: Machine Learning for Time Series, Made Accessible

137

Best Practices for Building LLM-Backed Applications

138

The Evolution of Crypto, Blockchain, and Web3

139

Open Source Data and AI: Past, Present, Future

140

Orchestration for LLM and RAG applications

141

Reflections from the First AI Conference in San Francisco

142

Kùzu: A simple, extremely fast, and embeddable graph database

143

Navigating the Nuances of Retrieval Augmented Generation

144

The Rise of Generative AI-Powered Social Media Manipulation

145

Versioning and MLOps for Generative AI

146

Navigating the Generative AI Landscape

147

Trends in Data Management: From Source to BI and Generative AI

148

AI and the Future of Speech Technologies

149

The Future of Cybersecurity: Generative AI and its Implications

150

Ivy: The One-Stop Interface for AI Model Deployment and Development

151

Navigating the Risk Landscape: A Deep Dive into Generative AI

152

Software Development with AI and LLMs

153

A Lightweight SDK for Integrating AI Models and Plugins

154

Using LLMs to Build AI Co-pilots for Knowledge Workers

155

ETL for LLMs

156

The Future of Graph Databases

157

Delivering Safe and Effective LLM and NLP Applications

158

Using Data and AI to Democratize Entity Resolution and Master Data Management

159

An Open Source Data Framework for LLMs

160

Redefining AI Infrastructure: Deploying and Developing with a Next-Generation Developer Platform

161

The Rise of Custom Foundation Models

162

The Future of Vector Databases and the Rise of Instant Updates

163

LLMs Are the Key to Unlocking the Next Generation of Search

164

Building and Deploying Foundation Models for Enterprises

165

Building Robust AI Infrastructure for Critical Solutions

166

Machine Learning for High-Risk Applications

167

Boosting Perception With Synthetic Data

168

Revolutionizing B2B: Unleashing the Power of AI and Data

169

AI Metadata

170

The 2023 AI Index

171

Custom Foundation Models

172

Uncovering and Highlighting AI Trends

173

How Data and AI Happened

174

Blazing fast bulk data transfers between any cloud

175

Exhaustion of High-Quality Data Could Slow Down AI Progress in Coming Decades

176

Generating high-fidelity and privacy-preserving synthetic data

177

How technology is disrupting the venture capital industry

178

Running Machine Learning Workloads On Any Cloud

179

2023 Trends in Data Engineering and Infrastructure

180

Preparing for the Implementation of the EU AI Act and Other AI Regulations

181

The Open Source Stack Unleashing a Game-Changing AI Hardware Shift

182

Data Science and AI in Context

183

Evaluating Language Models

184

2023 Opportunities and Trends: Data, Machine Learning, and AI

185

Exploring DALL·E 2

186

Data Science at Shopify and Stitch Fix

187

Building a data management system for unstructured data

188

A Cloud Native Vector Database Management System

189

What’s Next for Machine Learning in Time Series

190

Efficient Methods for Natural Language Processing

191

Responsible and Trustworthy AI

192

Building a premier industrial AI research and product group

193

An open source, production grade vector search engine

194

A comprehensive suite of open source tools for time series modeling

195

Building Safe and Reliable AI applications

196

A new storage engine for vectors

197

Project Lightspeed: Next-generation Spark Streaming

198

The Unreasonable Effectiveness of Speech Data

199

Machine Learning Integrity

200

Synthetic data technologies can enable more capable and ethical AI

201

Confidential Computing for Machine Learning

202

Applied NLP Research at Primer

203

Using SQL to Retrieve Data from APIs and Web Services

204

Machine Learning for Time Series Intelligence

205

Unleashing the power of large language models

206

Building production-ready machine learning pipelines

207

Machine Learning at Gong

208

Data Infrastructure for Computer Vision

209

How DALL·E works

210

Scalable, end-to-end machine learning, for everyone

211

Orchestration and Pipelines for Data Scientists

212

Dataframes at scale

213

Software-Defined Assets

214

Adversarial Machine Learning

215

Orchestrating Machine Learning Applications

216

Narrative AI

217

Machine Learning Model Observability

218

Dataflow Automation

219

Practical Machine Learning and Deep learning

220

Machine Learning for Optimization

221

Efficient Scaling of Language Models

222

Data Science at Stitch Fix

223

The 2022 AI Index

224

Why You Need A Time-Series Database

225

Data Science at Shopify

226

An AI Risk Management Framework

227

An open source and end-to-end library for causal inference

228

The Graph Intelligence Stack

229

NLP and Language Models in Healthcare and the Life Sciences

230

Delivering Continuous Intelligence at Scale

231

Imperceptible NLP Attacks

232

Evolving Data Science Training Programs

233

Building Machine Learning Infrastructure at Netflix and beyond

234

Democratizing NLP

235

Machine Learning at Discord

236

Applications of Knowledge Graphs

237

Key AI and Data Trends for 2022

238

Large Language Models

239

Data and Machine Learning Platforms at Shopify

240

What is AI Engineering?

241

NLP and AI in Financial Services

242

Modern Experimentation Platforms

243

Reinforcement Learning in Real-World Applications

244

MLOps Anti-Patterns

245

Why You Need a Modern Metadata Platform

246

Making Large Language Models Smarter

247

AI Begins With Data Quality

248

Modernizing Data Integration

249

Deploying Machine Learning Models Safely and Systematically

250

Large-scale machine learning and AI on multi-modal data

251

Machine Learning in Astronomy and Physics

252

The Unreasonable Effectiveness of Multiple Dispatch

253

How To Lead In Data Science

254

Why interest in graph databases and graph analytics are growing

255

The State of Data Journalism

256

Auditing machine learning models for discrimination, bias, and other risks

257

An oscilloscope for deep learning

258

What’s new in data engineering

259

The evolution of the data science role and of data science tools

260

Data Augmentation in Natural Language Processing

261

Storage Technologies for a Multi-cloud World

262

Building a next-generation dataflow orchestration and automation system

263

Building a flexible, intuitive, and fast forecasting library

264

Neural Models for Tabular Data

265

Training and Sharing Large Language Models

266

Questioning the Efficacy of Neural Recommendation Systems

267

Automation in Data Management and Data Labeling

268

Reinforcement Learning For the Win

269

How Companies Are Investing in AI Risk and Liability Minimization

270

The Future of Machine Learning Lies in Better Abstractions

271

Why You Should Optimize Your Deep Learning Inference Platform

272

AI Beyond Automation

273

Injecting Software Engineering Practices and Rigor into Data Governance

274

Building a data store for unstructured data and deep learning applications

275

How Technology Companies Are Using Ray

276

Data quality is key to great AI products and services

277

Machine Learning in Healthcare

278

Measuring the Impact of AI and Machine Learning Research

279

The Mathematics of Data Integration and Data Quality

280

Pricing Data Products

281

Challenges, Opportunities, and Trends in EdTech

282

Towards Simple, Interpretable, and Trustworthy AI

283

The Rise of Metadata Management Systems

284

Tools for building robust, state-of-the-art machine learning models

285

Creating Master Data at Scale with AI

286

Bringing AI and computing closer to data sources

287

Deep Learning in the Sciences

288

Taking business intelligence and analyst tools to the next level

289

Data exchanges and their applications in healthcare and the life sciences

290

Key AI and Data Trends for 2021

291

A Unified Management Model for Successful Data-Focused Teams

292

Security and privacy for the disoriented

293

The State of Responsible AI

294

Improving the robustness of natural language applications

295

End-to-end deep learning models for speech applications

296

Securing machine learning applications

297

Testing Natural Language Models

298

Detecting Fake News

299

The Computational Limits of Deep Learning

300

Making deep learning accessible

301

Building and deploying knowledge graphs

302

Financial Time Series Forecasting with Deep Learning

303

A programming language for scientific machine learning and differentiable programming

304

Using machine learning to modernize medical triage and monitoring systems

305

Connecting Reinforcement Learning to Simulation Software

306

Using machine learning to detect shifts in government policy

307

What is AI Assurance?

308

Best practices for building conversational AI applications

309

Tools for scaling machine learning

310

From Python beginner to seasoned software engineer

311

Assessing Models and Simulations of Epidemic Infectious Diseases

312

Improving the hiring pipeline for software engineers

313

How to build state-of-the-art chatbots

314

Democratizing machine learning

315

How graph technologies are being used to solve complex business problems

316

Machines for unlocking the deluge of COVID-19 papers, articles, and conversations

317

Designing machine learning models for both consumer and industrial applications

318

Building open source developer tools for language applications

319

Viewing machine learning and data science applications as sociotechnical systems

320

Identifying and mitigating liabilities and risks associated with AI

321

How machine learning is being used in quantitative finance

322

Understanding machine learning model governance

323

Improving performance and scalability of data science libraries

324

Why TinyML will be huge

325

An open source platform for training deep learning models

326

Algorithms that continually invent both problems and solutions

327

Computational Models and Simulations of Epidemic Infectious Diseases

328

Human-in-the-loop machine learning

329

Next-generation simulation software will incorporate deep reinforcement learning

330

Business at the speed of AI: Lessons from Shopify

331

How deep learning is being used in search and information retrieval

332

The responsible development, deployment and operation of machine learning systems

333

Hyperscaling natural language processing

334

What businesses need to know about model explainability

335

Scalable Machine Learning, Scalable Python, For Everyone

336

Computational humanness, analogy and innovation, and soft concepts

337

Building domain specific natural language applications

338

The state of privacy-preserving machine learning

339

Taking messaging and data ingestion systems to the next level

340

Business at the speed of AI: Lessons from Rakuten

341

The combination of the right software and commodity hardware will prove capable of handling most machine learning tasks

342

Key AI and Data Trends for 2020

343

The evolution of TensorFlow and of machine learning infrastructure

344

Building large-scale, real-time computer vision applications

345

Taking stock of foundational tools for analytics and machine learning