All Episodes
Disseminate: The Computer Science Research Podcast — 91 episodes
Mateusz Gienieczko | AnyBlox: A Framework for Self-Decoding Datasets | #69
Xiangyao Yu | Disaggregation: A New Architecture for Cloud Databases | #68
Navid Eslami | Diva: Dynamic Range Filter for Var-Length Keys and Queries | #67
Adaptive Factorization in DuckDB with Paul Groß
Parachute: Rethinking Query Execution and Bidirectional Information Flow in DuckDB - with Mihail Stoian
Anarchy in the Database: Abigale Kim on DuckDB and DBMS Extensibility
Recursive CTEs, Trampolines, and Teaching Databases with DuckDB - with Prof. Torsten Grust
DuckDB in Research S2 Coming Soon!
Rohan Padhye & Ao Li | Fray: An Efficient General-Purpose Concurrency JVM Testing Platform | #66
Shrey Tiwari | It's About Time: A Study of Date and Time Bugs in Python Software | #65
Lessons Learned from Five Years of Artifact Evaluations at EuroSys | #64
Dominik Winterer | Validating SMT Solvers for Correctness and Performance via Grammar-based Enumeration | #63
Haralampos Gavriilidis | Fast and Scalable Data Transfer across Data Systems | #62
Haralampos Gavriilidis | SheetReader: Efficient spreadsheet parsing
Arjen P. de Vries | faiss: An extension for vector data & search
David Justen | POLAR: Adaptive and non-invasive join order selection via plans of least resistance
Daniël ten Wolde | DuckPGQ: A graph extension supporting SQL/PGQ
Till Döhmen | DuckDQ: A Python library for data quality checks in ML pipelines
Disseminate x DuckDB Coming Soon...
High Impact in Databases with... Anastasia Ailamaki
Anastasiia Kozar | Fault Tolerance Placement in the Internet of Things | #61
Liana Patel | ACORN: Performant and Predicate-Agnostic Hybrid Search | #60
High Impact in Databases with... David Maier
Raunak Shah | R2D2: Reducing Redundancy and Duplication in Data Lakes | #59
High Impact in Databases with... Aditya Parameswaran
Marco Costa | Taming Adversarial Queries with Optimal Range Filters | #58
High Impact in Databases with... Ali Dasdan
Matt Perron | Analytical Workload Cost and Performance Stability With Elastic Pools | #57
High Impact in Databases with... Andreas Kipf
Marvin Wyrich & Justus Bogner | How Software Engineering Research Is Discussed on LinkedIn | #56
High Impact in Databases with... Joe Hellerstein
Harry Goldstein | Property-Based Testing | #55
High Impact in Databases with... Raghu Ramakrishnan
Gina Yuan | In-Network Assistance With Sidekick Protocols | #54
High Impact in Databases with... Moshe Vardi
Tammy Sukprasert | Move Your Workloads To Sweden! | #53
High Impact in Databases with... Ryan Marcus
Yazhuo Zhang | SIEVE is Simpler than LRU | #52
Introducing the High Impact Series...
Eleni Zapridou | Oligolithic Cross-task Optimizations across Isolated Workloads | #51
Pat Helland | Scalable OLTP in the Cloud: What’s the BIG DEAL? | #50
Rui Liu | Towards Resource-adaptive Query Execution in Cloud Native Databases | #49
Yifei Yang | Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries | #48
Vikramank Singh | Panda: Performance Debugging for Databases using LLM Agents | #47
Tamer Eldeeb | Chablis: Fast and General Transactions in Geo-Distributed Systems | #46
Matt Butrovich | Tigger: A Database Proxy That Bounces With User-Bypass | #45
Gábor Szárnyas | The LDBC Social Network Benchmark: Business Intelligence Workload | #44
Thaleia Doudali | Is Machine Learning Necessary for Cloud Resource Usage Forecasting? | #43
Jinkun Geng | Nezha: Deployable and High-Performance Consensus Using Synchronized Clocks | #42
Dimitris Koutsoukos | NVM: Is it Not Very Meaningful for Databases? | #41
Mohamed Alzayat | Groundhog: Efficient Request Isolation in FaaS | #40
Cuong Nguyen | Detock: High Performance Multi-region Transactions at Scale | #39
Bogdan Stoica | WAFFLE: Exposing Memory Ordering Bugs Efficiently with Active Delay Injection | #38
Roger Waleffe | MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks | #37
Madelon Hulsebos | GitTables: A Large-Scale Corpus of Relational Tables | #36
Tarikul Islam Papon | ACEing the Bufferpool Management Paradigm for Modern Storage Devices | #35
Jian Zhang | VIPER: A Fast Snapshot Isolation Checker | #34
Ahmed Sayed | REFL: Resource Efficient Federated Learning | #33
Subhadeep Sarkar | Log-structured Merge Trees | #32
Andra Ionescu | Topio: The Geodata Marketplace | #31
Laurens Kuiper | These Rows Are Made For Sorting | #30
Semih Salihoğlu | Kùzu Graph Database Management System | #29
Lukas Vogel | Data Pipes: Declarative Control over Data Movement | #28
Haralampos Gavriilidis | In-Situ Cross-Database Query Processing | #27
Paras Jain & Sarah Wooders | Skyplane: Fast Data Transfers Between Any Cloud | #26
Yang Wang | Rethinking Concurrency Control in Databases | #25
Suyash Gupta | Chemistry behind Agreement | #24
Tobias Ziegler | Is Scalable OLTP in the Cloud a Solved Problem? | #23
Hamish Nicholson | HetCache: Synergising NVMe Storage and GPU acceleration for Memory-Efficient Analytics | #22
Immanuel Haffner | mutable: A Modern DBMS for Research and Fast Prototyping | #21
Konstantinos Kallas | Practically Correct, Just-in-Time Shell Script Parallelization | #20
Vasily Sartakov | CAP-VMs: Capability-Based Isolation and Sharing in the Cloud #19
Haoran Ma | MemLiner: Lining up Tracing and Application for a Far-Memory-Friendly Runtime | #18
Lexiang Huang | Metastable Failures in the Wild | #17
Andrew Quinn | Debugging the OmniTable Way | #16
Audrey Cheng | TAOBench: An End-to-End Benchmark for Social Network Workloads | #15
George Konstantinidis | Enabling Personal Consent in Databases | #14
Per Fuchs | Sortledton: a Universal, Transactional Graph Data Structure | #13
George Theodorakis | Scabbard: Single-Node Fault-Tolerant Stream Processing | #12
Kevin Gaffney | SQLite: Past, Present, and Future | #11
Matthias Jasny | P4DB - The Case for In-Network OLTP | #10
Tobias Ziegler | ScaleStore: A Fast and Cost-Efficient Storage Engine using DRAM, NVMe, and RDMA | #9
Chuzhe Tang | Ad Hoc Transactions in Web Applications: The Good, the Bad, and the Ugly | #8
Michael Abebe | Proteus: Autonomous Adaptive Storage for Mixed Workloads | #7
Hani Al-Sayeh | Juggler: Autonomous Cost Optimization and Performance Prediction of Big Data Applications | #6
Thomas Hütter | JEDI: These aren’t the JSON documents you’re looking for | #4
Sainyam Galhotra | Causal Feature Selection for Algorithmic Fairness | #5
Draco Xu | TSUBASA: Climate Network Construction on Historical and Real-Time Data | #3
Felix S Campbell | Efficient Answering of Historical What-if Queries | #2
Alex Isenko | Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines | #1
Coming Soon | ACM SIGMOD/PODS 2022 | #0