PodParley - Discover, Search, and Explore Podcasts

1

How We Built a Per-Plant CO2 Dataset for 4,551 Power Stations Worldwide

Jun 25, 2026

4:58

2

Eliminating Data Latency with Event-Driven Pipelines at Enterprise Scale

Jun 25, 2026

19:44

3

Scaling Self-Service Analytics in Regulated Banking With Metadata-Driven Design

Jun 23, 2026

6:38

4

How to Rotate Proxies Without Breaking Login Sessions

Jun 23, 2026

8:17

5

I Built an Open-Source Firebase Analytics Alternative Because I Hit 1M Events/Day Once Too Many

Jun 20, 2026

10:03

6

Your Redshift Cluster Is Probably Idle 85% of the Time — And You're Paying for All of It

Jun 20, 2026

11:34

7

What the Real Operating Data on AI Agents Tells Me as an Investor

Jun 18, 2026

4:56

8

Building Data Quality Into the Pipeline Instead of Cleaning Up After It

Jun 17, 2026

10:43

9

Why Speed Matters: How Performance in Analytics Saves Business from "Digital Paralysis"

Jun 17, 2026

18:26

10

Open Data Is Not a Product. Here's What It Takes to Make It One.

Jun 12, 2026

8:09

11

Why Scrapers Fail: Headers, Sessions, IP Reputation, and Request Patterns

Jun 11, 2026

13:54

12

I Built an AI-Assisted Data Quality Layer for Operations Dashboards

Jun 3, 2026

11:49

13

The Source Code Isn't Hidden - You Just Gotta Refocus Your Lens

Jun 3, 2026

4:44

14

Why Your Data Governance Framework Is Failing (And What You Can Do About It)

Jun 2, 2026

12:17

15

The Cloud Data Leak: Architecting SQL to Stop Financial Bleeding

Jun 2, 2026

7:28

16

Principal Components Analysis in TypeScript (Part 4): Turning PCA Into Interpretable Factor Analysis

May 30, 2026

5:25

17

Data Engineering Teams Need a Different Version of Agile

May 28, 2026

12:45

18

The LLM Veneer: When AI Sounds Smart but Has Nothing Real to Reason Over

May 27, 2026

6:45

19

Bad Ingestion Architecture Generates Million Dollar Snowflake and Databricks Bills

May 22, 2026

9:57

20

Optimizing Distributed Data Processing for ML at Scale

May 21, 2026

7:03

21

Why Finance Data Quality Needs Rule Engines, Not ML Hype

May 21, 2026

14:41

22

156 Blog Posts To Learn About Business Intelligence

May 20, 2026

37:38

23

Why Your Marketplace Scraper Keeps Getting Blocked (And Why It’s Not a Code Problem)

May 19, 2026

11:03

24

How I Decoded My Apple Watch Metrics: Taking a Look At The Raw Numbers (Part 2)

May 9, 2026

3:39

25

Why AI Agents Are Creating a New Kind of Data Engineer

May 9, 2026

13:43

26

The Architectural Limits of Data Lakes and the Rise of Lakehouses

May 8, 2026

9:03

27

The Economic Case for Investing in Youth Education

May 7, 2026

18:48

28

HiveMQ and TimescaleDB: It Just Works!

May 7, 2026

3:57

29

102 Blog Posts To Learn About Datasets

May 6, 2026

26:26

30

Why More Data Doesn’t Guarantee Better Insights in Modern Data Systems

May 6, 2026

8:42

31

500 Blog Posts To Learn About Data

May 5, 2026

120:33

32

228 Blog Posts To Learn About Data Visualization

May 5, 2026

55:13

33

The Hard Lessons of Managing a Data Science Team

May 4, 2026

12:42

34

95 Blog Posts To Learn About Data Storage

May 4, 2026

22:43

35

70 Blog Posts To Learn About Data Scraping

May 3, 2026

20:07

36

500 Blog Posts To Learn About Data Science

May 3, 2026

130:38

37

110 Blog Posts To Learn About Data Management

May 2, 2026

26:25

38

402 Blog Posts To Learn About Data Analytics

May 1, 2026

95:23

39

50 Blog Posts To Learn About Data Collection

May 1, 2026

12:49

40

427 Blog Posts To Learn About Data Analysis

Apr 30, 2026

104:16

41

Your Dashboard Isn’t Wrong - Your KPI Logic Is

Apr 29, 2026

5:51

42

The Hidden Cost of Scraping Everything (and Why Datasets Win)

Apr 28, 2026

12:26

43

500 Blog Posts To Learn About Big Data

Apr 28, 2026

127:06

44

263 Blog Posts To Learn About Analytics

Apr 27, 2026

70:40

45

They Got Lost in the Transformer, Episode 1: What Even Is an Embedding?

Apr 24, 2026

5:58

46

Kafka vs Azure Event Hubs: The Tradeoffs You Only See in Production

Apr 24, 2026

5:48

47

Clarifying the Difference Between Data Strategy, Analytics, and AI Governance

Feb 6, 2026

7:50

48

The “Store Everything” Cloud Model Is Breaking Under Modern AI Workloads

Feb 6, 2026

10:32

49

AI Belongs Inside DataOps, Not Just at the End of the Pipeline

Feb 5, 2026

5:19

50

Stop Torturing Your Data: How to Automate Rigor With AI

Feb 4, 2026

3:40

51

Minimum Incident Lineage (MIL): A Run-Level Evidence Standard for Reproducible Data Incidents

Feb 4, 2026

8:49

52

5 Ways Spark 4.1 Moves Data Engineering From Manual Pipelines to Intent-Driven Design

Feb 3, 2026

7:17

53

Beyond Prediction: Econometric Data Science for Measuring True Business Impact

Feb 3, 2026

4:34

54

Designing Economic Intelligence: Econometrics-First Approaches in Data Science

Jan 31, 2026

4:28

55

From Forecasting to BI: Inside Shravanthi Ashwin Kumar’s Data-Driven Finance Playbook

Jan 30, 2026

9:06

56

Causal Thinking in the Age of Big Data: Modern Econometrics for Data Scientists

Jan 27, 2026

5:06

57

Data Pipeline Testing: The 3 Levels Most Teams Miss

Jan 27, 2026

7:40

58

HSM: The Original Tiering Engine Behind Mainframes, Cloud, and S3

Jan 25, 2026

59:33

59

Navigating Architectural Trade-offs at Scale to Meet AI Goals in 2026

Jan 23, 2026

6:35

60

Will AI Take Your Job? The Data Tells a Very Different Story

Jan 23, 2026

21:46

61

You Don’t Need an API for Everything (Sometimes Scraping Is Enough)

Jan 22, 2026

2:59

62

How to Use Propensity Score Matching to Measure Down Stream Causal Impact of an Event

Jan 22, 2026

24:50

63

How to Analyze Call Sentiment With Open-Source NLP Libraries

Jan 21, 2026

6:26

64

How Bayesian Tail-Risk Modeling can save your Retail Business Marketing Budget

Jan 20, 2026

19:29

65

Architecting Trustworthy Healthcare Data Platforms Using Declarative Pipelines

Jan 20, 2026

9:05

66

When A/B Tests Aren’t Possible, Causal Inference Can Still Measure Marketing Impact

Jan 14, 2026

7:20

67

Why Data Quality Is Becoming a Core Developer Experience Metric

Jan 13, 2026

7:44

68

Why “Accuracy” Fails for Uplift Models (and What to Use Instead)

Jan 11, 2026

5:18

69

Turning Your Data Swamp into Gold: A Developer’s Guide to NLP on Legacy Logs

Dec 18, 2025

4:30

70

Data Monetization Strategies in Government Digital Platforms

Dec 17, 2025

5:40

71

Why Partner Data Became My Toughest Engineering Problem

Dec 16, 2025

8:43

72

PBIX Is Not Going Away - But PowerBI Will Never Work the Same Again

Dec 16, 2025

9:40

73

Smart Fire Protection: How AI Is Changing Preventive Maintenance Forever

Dec 6, 2025

6:16

74

Why More VARs and SIs Are Embedding Melissa Into Their Enterprise Solutions

Dec 6, 2025

8:14

75

Big Data as the New Compass of Competition

Dec 4, 2025

9:40

76

Srilatha Samala’s Agile Intelligence Approach to Enterprise Reporting as a Strategic Asset

Dec 3, 2025

4:40

77

The Hidden Cost of Bad Data: Why It’s Undermining Your AI Strategy

Dec 3, 2025

18:13

78

Data Platform as a Service: A Three-Pillar Model for Scaling Enterprise Data Systems

Nov 20, 2025

4:22

79

How RAG Improves Database Management

Nov 20, 2025

12:04

80

How To Power AI, Analytics, and Microservices Using the Same Data

Nov 19, 2025

8:51

81

From Data Fragmentation to Billion-Dollar Insights: The Vision of Manish Ravindra Sharath

Oct 30, 2025

7:19

82

Building a Layered Defense Against Web Scraping

Oct 30, 2025

8:43

83

Cosmo: The Graph Visualization Tool Built for Your Terminal

Oct 23, 2025

2:56

84

How Businesses Are Turning Space Data into a Tool for Risk, Resilience, and Sustainability

Oct 15, 2025

6:06

85

How Data Innovation Changed a State’s Infrastructure Engine

Oct 10, 2025

7:44

86

How to Optimize Your Marketing Budget Using Just Three Letters: MMM

Sep 25, 2025

7:26

87

Here's How ShareChat Scaled Their ML Feature Store 1000X Without Scaling the Database

Sep 25, 2025

12:42

88

Why You Shouldn’t Judge by PnL Alone

Sep 24, 2025

13:23

89

From "Decentralized" to "Unified": SUPCON Uses SeaTunnel to Build an Efficient Data Collection Frame

Sep 23, 2025

16:17

90

Enterprise Data Pipeline Revolution: Suresh Palli's Metadata-Driven Automation Success

Sep 19, 2025

7:50

91

Unified Data, Smarter Agents—Is Your Architecture Future-Proof?

Sep 18, 2025

7:51

92

Data-Driven Decisions at Scale: A/B Testing Best Practices for Engineering & Data Science Teams

Sep 18, 2025

5:59

93

Why You Should (Almost) Always Choose Sync Gunicorn Workers

Sep 17, 2025

6:09

94

Beyond the Ten Blue Links: How Generative AI Rewires Our Brains for Search

Sep 16, 2025

7:26

95

Need Web Data? Here Are the 3 Methods Everyone’s Using

Sep 16, 2025

10:09

96

Applying Transitive Closure to Sort Products Into Categories, Considering Nesting and Overlaps

Sep 15, 2025

15:50

97

98% of Data Strategies Fail: Let's Fix It

Aug 2, 2024

11:24

98

How To Measure The Results Of In-App Events When Onelinks Don’t Work

Jul 30, 2024

5:59

99

How AI-Powered Data Mapping is Democratizing Data Management

Jul 27, 2024

8:10

100

Data Engineering: What’s the Value of API Security in the Generative AI Era?

Jul 27, 2024

5:47

101

Say Goodbye to Outdated Diagrams: Automate Your Infrastructure Visualization

Jul 25, 2024

7:15

102

Why C-Suite Executives Won’t Cut it Without Data Skills Anymore

Jul 25, 2024

6:45

103

Meet New & Improved BigQuery: Single, Unified AI-Ready Data Platform

Jul 20, 2024

10:20

104

Decoding Transformers' Superiority over RNNs in NLP Tasks

Jul 19, 2024

9:38

105

How to Enable Auto-Start for Apache DolphinScheduler

Jul 14, 2024

4:18

106

Benchmarking Apache Kafka: Performance-per-price

Jul 13, 2024

13:11

107

When and When Not to Use Apache Kafka as a Database

Jul 9, 2024

9:26

108

A Leader's Guide to Data-Driven Success

Jul 6, 2024

7:35

109

Seamlessly Migrate Your On-Premise Data Pipeline to Azure with These Key Steps

Jul 1, 2024

12:35

110

Data Collection for Product Managers

Jun 29, 2024

7:55

111

Data Collection for Product Managers

Jun 29, 2024

7:55

112

Leveraging Data Granularity, Distribution, and Modeling for Effective Product Management

Jun 28, 2024

11:39

113

How Vectors, Rag and Llama 3 Are Changing First-Party Data

Jun 28, 2024

7:59

114

16 Best Sklearn Datasets for Building Machine Learning Models

Jun 27, 2024

21:22

115

Enhancing Audit Processes With Advanced Analytical Tools

Jun 26, 2024

5:01

116

Go Clean to Be Lean: Data Optimization for Improved Business Efficiency

Jun 22, 2024

11:30

117

Efficient Data Management and Workflow Orchestration with Apache Doris Job Scheduler

Jun 21, 2024

7:26

118

Scaling Ethereum: Data Bloat, Data Availability, and the Cloudless Solution

Jun 13, 2024

17:12

119

What Frontend Devs Want (From Backend Devs)

Jun 11, 2024

5:42

120

How to Build an AI Chatbot with Python and Gemini API

Jun 11, 2024

6:04

121

How to Set Up a Local DNS Server With Python

Jun 9, 2024

4:13

122

The Collective Loves Data: How Big Data Is Shaping and Predicting Our Future

Jun 7, 2024

8:11

123

Apache Doris for Log and Time Series Data Analysis in NetEase: Why Not Elasticsearch and InfluxDB?

Jun 6, 2024

12:01

124

Unlocking the Power of Data Lakes for Embedded Analytics in Multi-Tenant SaaS

Jun 4, 2024

15:16

125

The LinkedIn Nanotargeting Experiment that Broke All the Rules

May 31, 2024

10:50

126

Data Science Interview Question: Creating ROC & Precision Recall Curves From Scratch

May 31, 2024

8:59

127

Why Should Companies Outsource Data Processing?

May 28, 2024

6:05

128

The Role of Big Data in Developing New Medicines

May 28, 2024

6:30

129

Building CI Pipeline with Databricks Asset Bundle and GitLab

May 26, 2024

10:59

130

How I'm Building an AI for Analytics Service

May 24, 2024

7:12

131

Real-Time Anomaly Detection in Underwater Gliders: Experimental Evaluation

May 23, 2024

10:13

132

Real-Time Anomaly Detection in Underwater Gliders: Abstract and Intro

May 23, 2024

7:22

133

The Power of Universal Semantic Layers: Insights from Cube Co-founder Artyom Keydunov

May 22, 2024

9:37

134

A Comprehensive Guide to Building DolphinScheduler 3.2.0 Production-Grade Cluster Deployment

May 18, 2024

9:04

135

Why Monitoring a Distributed Database is More Complex Than You Might Expect

May 18, 2024

20:18

136

Outlier Detection: What You Need to Know

May 11, 2024

2:43

137

Instrument Variables and AB Testing – Part 1

May 10, 2024

1:28

138

Using Arrow Flight SQL Protocol in Apache Doris 2.1 For Super Fast Data Transfer

May 9, 2024

8:00

139

Data Science for Portfolio Optimization: Markowitz Mean-Variance Theory

May 1, 2024

5:47

140

10 Best Datasets for Time Series Analysis

Apr 28, 2024

8:23

All Episodes