1

Mateusz Gienieczko | AnyBlox: A Framework for Self-Decoding Datasets | #69

Mar 17, 2026

62:28

2

Xiangyao Yu | Disaggregation: A New Architecture for Cloud Databases | #68

Nov 27, 2025

42:12

3

Navid Eslami | Diva: Dynamic Range Filter for Var-Length Keys and Queries | #67

Nov 13, 2025

46:50

4

Adaptive Factorization in DuckDB with Paul Groß

Nov 6, 2025

51:15

5

Parachute: Rethinking Query Execution and Bidirectional Information Flow in DuckDB - with Mihail Stoian

Oct 30, 2025

36:34

6

Anarchy in the Database: Abigale Kim on DuckDB and DBMS Extensibility

Oct 23, 2025

46:24

7

Recursive CTEs, Trampolines, and Teaching Databases with DuckDB - with Prof. Torsten Grust

Oct 16, 2025

51:05

8

DuckDB in Research S2 Coming Soon!

Oct 16, 2025

2:06

9

Rohan Padhye & Ao Li | Fray: An Efficient General-Purpose Concurrency JVM Testing Platform | #66

Oct 6, 2025

58:45

10

Shrey Tiwari | It's About Time: A Study of Date and Time Bugs in Python Software | #65

Sep 23, 2025

65:29

11

Lessons Learned from Five Years of Artifact Evaluations at EuroSys | #64

Jul 30, 2025

43:48

12

Dominik Winterer | Validating SMT Solvers for Correctness and Performance via Grammar-based Enumeration | #63

Jul 25, 2025

43:38

13

Haralampos Gavriilidis | Fast and Scalable Data Transfer across Data Systems | #62

Jun 16, 2025

56:46

14

Haralampos Gavriilidis | SheetReader: Efficient spreadsheet parsing

Apr 17, 2025

40:53

15

Arjen P. de Vries | faiss: An extension for vector data & search

Apr 10, 2025

46:14

16

David Justen | POLAR: Adaptive and non-invasive join order selection via plans of least resistance

Apr 3, 2025

51:08

17

Daniël ten Wolde | DuckPGQ: A graph extension supporting SQL/PGQ

Mar 20, 2025

48:38

18

Till Döhmen | DuckDQ: A Python library for data quality checks in ML pipelines

Mar 13, 2025

58:12

19

Disseminate x DuckDB Coming Soon...

Mar 6, 2025

2:40

20

High Impact in Databases with... Anastasia Ailamaki

Mar 3, 2025

46:17

21

Anastasiia Kozar | Fault Tolerance Placement in the Internet of Things | #61

Dec 16, 2024

49:02

22

Liana Patel | ACORN: Performant and Predicate-Agnostic Hybrid Search | #60

Nov 11, 2024

52:49

23

High Impact in Databases with... David Maier

Nov 4, 2024

62:24

24

Raunak Shah | R2D2: Reducing Redundancy and Duplication in Data Lakes | #59

Oct 28, 2024

31:09

25

High Impact in Databases with... Aditya Parameswaran

Oct 21, 2024

58:57

26

Marco Costa | Taming Adversarial Queries with Optimal Range Filters | #58

Oct 14, 2024

37:07

27

High Impact in Databases with... Ali Dasdan

Oct 8, 2024

63:02

28

Matt Perron | Analytical Workload Cost and Performance Stability With Elastic Pools | #57

Jul 22, 2024

52:10

29

High Impact in Databases with... Andreas Kipf

Jul 15, 2024

53:06

30

Marvin Wyrich & Justus Bogner | How Software Engineering Research Is Discussed on LinkedIn | #56

Jul 8, 2024

47:53

31

High Impact in Databases with... Joe Hellerstein

Jul 1, 2024

52:56

32

Harry Goldstein | Property-Based Testing | #55

Jun 25, 2024

49:13

33

High Impact in Databases with... Raghu Ramakrishnan

Jun 17, 2024

23:56

34

Gina Yuan | In-Network Assistance With Sidekick Protocols | #54

Jun 10, 2024

55:25

35

High Impact in Databases with... Moshe Vardi

Jun 3, 2024

47:39

36

Tammy Sukprasert | Move Your Workloads To Sweden! | #53

May 27, 2024

32:50

37

High Impact in Databases with... Ryan Marcus

May 20, 2024

59:52

38

Yazhuo Zhang | SIEVE is Simpler than LRU | #52

May 13, 2024

43:10

39

Introducing the High Impact Series...

May 6, 2024

2:40

40

Eleni Zapridou | Oligolithic Cross-task Optimizations across Isolated Workloads | #51

Apr 29, 2024

38:42

41

Pat Helland | Scalable OLTP in the Cloud: What’s the BIG DEAL? | #50

Apr 15, 2024

80:03

42

Rui Liu | Towards Resource-adaptive Query Execution in Cloud Native Databases | #49

Apr 1, 2024

53:52

43

Yifei Yang | Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries | #48

Mar 18, 2024

47:37

44

Vikramank Singh | Panda: Performance Debugging for Databases using LLM Agents | #47

Mar 4, 2024

68:12

45

Tamer Eldeeb | Chablis: Fast and General Transactions in Geo-Distributed Systems | #46

Feb 12, 2024

62:27

46

Matt Butrovich | Tigger: A Database Proxy That Bounces With User-Bypass | #45

Dec 18, 2023

63:55

47

Gábor Szárnyas | The LDBC Social Network Benchmark: Business Intelligence Workload | #44

Dec 4, 2023

46:34

48

Thaleia Doudali | Is Machine Learning Necessary for Cloud Resource Usage Forecasting? | #43

Nov 20, 2023

49:13

49

Jinkun Geng | Nezha: Deployable and High-Performance Consensus Using Synchronized Clocks | #42

Oct 23, 2023

55:09

50

Dimitris Koutsoukos | NVM: Is it Not Very Meaningful for Databases? | #41

Oct 9, 2023

48:57

51

Mohamed Alzayat | Groundhog: Efficient Request Isolation in FaaS | #40

Sep 11, 2023

42:46

52

Cuong Nguyen | Detock: High Performance Multi-region Transactions at Scale | #39

Aug 28, 2023

37:28

53

Bogdan Stoica | WAFFLE: Exposing Memory Ordering Bugs Efficiently with Active Delay Injection | #38

Aug 14, 2023

55:57

54

Roger Waleffe | MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks | #37

Jul 31, 2023

73:06

55

Madelon Hulsebos | GitTables: A Large-Scale Corpus of Relational Tables | #36

Jul 17, 2023

45:54

56

Tarikul Islam Papon | ACEing the Bufferpool Management Paradigm for Modern Storage Devices | #35

Jun 20, 2023

47:18

57

Jian Zhang | VIPER: A Fast Snapshot Isolation Checker | #34

Jun 9, 2023

42:34

58

Ahmed Sayed | REFL: Resource Efficient Federated Learning | #33

May 26, 2023

58:53

59

Subhadeep Sarkar | Log-structured Merge Trees | #32

May 11, 2023

59:27

60

Andra Ionescu | Topio: The Geodata Marketplace | #31

Apr 25, 2023

46:25

61

Laurens Kuiper | These Rows Are Made For Sorting | #30

Apr 12, 2023

55:01

62

Semih Salihoğlu | Kùzu Graph Database Management System | #29

Apr 3, 2023

77:06

63

Lukas Vogel | Data Pipes: Declarative Control over Data Movement | #28

Mar 28, 2023

50:27

64

Haralampos Gavriilidis | In-Situ Cross-Database Query Processing | #27

Mar 20, 2023

60:53

65

Paras Jain & Sarah Wooders | Skyplane: Fast Data Transfers Between Any Cloud | #26

Mar 13, 2023

46:21

66

Yang Wang | Rethinking Concurrency Control in Databases | #25

Mar 6, 2023

55:56

67

Suyash Gupta | Chemistry behind Agreement | #24

Feb 27, 2023

63:51

68

Tobias Ziegler | Is Scalable OLTP in the Cloud a Solved Problem? | #23

Feb 20, 2023

55:25

69

Hamish Nicholson | HetCache: Synergising NVMe Storage and GPU acceleration for Memory-Efficient Analytics | #22

Feb 13, 2023

50:56

70

Immanuel Haffner | mutable: A Modern DBMS for Research and Fast Prototyping | #21

Feb 6, 2023

88:13

71

Konstantinos Kallas | Practically Correct, Just-in-Time Shell Script Parallelization | #20

Jan 30, 2023

57:48

72

Vasily Sartakov | CAP-VMs: Capability-Based Isolation and Sharing in the Cloud #19

Jan 23, 2023

36:10

73

Haoran Ma | MemLiner: Lining up Tracing and Application for a Far-Memory-Friendly Runtime | #18

Jan 16, 2023

44:25

74

Lexiang Huang | Metastable Failures in the Wild | #17

Jan 9, 2023

53:18

75

Andrew Quinn | Debugging the OmniTable Way | #16

Jan 2, 2023

57:58

76

Audrey Cheng | TAOBench: An End-to-End Benchmark for Social Network Workloads | #15

Dec 12, 2022

52:43

77

George Konstantinidis | Enabling Personal Consent in Databases | #14

Dec 5, 2022

55:36

78

Per Fuchs | Sortledton: a Universal, Transactional Graph Data Structure | #13

Nov 28, 2022

41:21

79

George Theodorakis | Scabbard: Single-Node Fault-Tolerant Stream Processing | #12

Nov 21, 2022

45:36

80

Kevin Gaffney | SQLite: Past, Present, and Future | #11

Nov 14, 2022

48:18

81

Matthias Jasny | P4DB - The Case for In-Network OLTP | #10

Aug 8, 2022

27:20

82

Tobias Ziegler | ScaleStore: A Fast and Cost-Efficient Storage Engine using DRAM, NVMe, and RDMA | #9

Aug 1, 2022

23:08

83

Chuzhe Tang | Ad Hoc Transactions in Web Applications: The Good, the Bad, and the Ugly | #8

Jul 25, 2022

32:15

84

Michael Abebe | Proteus: Autonomous Adaptive Storage for Mixed Workloads | #7

Jul 18, 2022

27:57

85

Hani Al-Sayeh | Juggler: Autonomous Cost Optimization and Performance Prediction of Big Data Applications | #6

Jul 11, 2022

32:00

86

Thomas Hütter | JEDI: These aren’t the JSON documents you’re looking for | #4

Jul 8, 2022

11:50

87

Sainyam Galhotra | Causal Feature Selection for Algorithmic Fairness | #5

Jul 8, 2022

12:06

88

Draco Xu | TSUBASA: Climate Network Construction on Historical and Real-Time Data | #3

Jul 4, 2022

17:14

89

Felix S Campbell | Efficient Answering of Historical What-if Queries | #2

Jul 1, 2022

19:21

90

Alex Isenko | Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines | #1

Jun 27, 2022

24:32

91

Coming Soon | ACM SIGMOD/PODS 2022 | #0

Jun 3, 2022

1:30

All Episodes