#
Title
1

Mateusz Gienieczko | AnyBlox: A Framework for Self-Decoding Datasets | #69

2

Xiangyao Yu | Disaggregation: A New Architecture for Cloud Databases | #68

3

Navid Eslami | Diva: Dynamic Range Filter for Var-Length Keys and Queries | #67

4

Adaptive Factorization in DuckDB with Paul Groß

5

Parachute: Rethinking Query Execution and Bidirectional Information Flow in DuckDB - with Mihail Stoian

6

Anarchy in the Database: Abigale Kim on DuckDB and DBMS Extensibility

7

Recursive CTEs, Trampolines, and Teaching Databases with DuckDB - with Prof. Torsten Grust

8

DuckDB in Research S2 Coming Soon!

9

Rohan Padhye & Ao Li | Fray: An Efficient General-Purpose Concurrency JVM Testing Platform | #66

10

Shrey Tiwari | It's About Time: A Study of Date and Time Bugs in Python Software | #65

11

Lessons Learned from Five Years of Artifact Evaluations at EuroSys | #64

12

Dominik Winterer | Validating SMT Solvers for Correctness and Performance via Grammar-based Enumeration | #63

13

Haralampos Gavriilidis | Fast and Scalable Data Transfer across Data Systems | #62

14

Haralampos Gavriilidis | SheetReader: Efficient spreadsheet parsing

15

Arjen P. de Vries | faiss: An extension for vector data & search

16

David Justen | POLAR: Adaptive and non-invasive join order selection via plans of least resistance

17

Daniël ten Wolde | DuckPGQ: A graph extension supporting SQL/PGQ

18

Till Döhmen | DuckDQ: A Python library for data quality checks in ML pipelines

19

Disseminate x DuckDB Coming Soon...

20

High Impact in Databases with... Anastasia Ailamaki

21

Anastasiia Kozar | Fault Tolerance Placement in the Internet of Things | #61

22

Liana Patel | ACORN: Performant and Predicate-Agnostic Hybrid Search | #60

23

High Impact in Databases with... David Maier

24

Raunak Shah | R2D2: Reducing Redundancy and Duplication in Data Lakes | #59

25

High Impact in Databases with... Aditya Parameswaran

26

Marco Costa | Taming Adversarial Queries with Optimal Range Filters | #58

27

High Impact in Databases with... Ali Dasdan

28

Matt Perron | Analytical Workload Cost and Performance Stability With Elastic Pools | #57

29

High Impact in Databases with... Andreas Kipf

30

Marvin Wyrich & Justus Bogner | How Software Engineering Research Is Discussed on LinkedIn | #56

31

High Impact in Databases with... Joe Hellerstein

32

Harry Goldstein | Property-Based Testing | #55

33

High Impact in Databases with... Raghu Ramakrishnan

34

Gina Yuan | In-Network Assistance With Sidekick Protocols | #54

35

High Impact in Databases with... Moshe Vardi

36

Tammy Sukprasert | Move Your Workloads To Sweden! | #53

37

High Impact in Databases with... Ryan Marcus

38

Yazhuo Zhang | SIEVE is Simpler than LRU | #52

39

Introducing the High Impact Series...

40

Eleni Zapridou | Oligolithic Cross-task Optimizations across Isolated Workloads | #51

41

Pat Helland | Scalable OLTP in the Cloud: What’s the BIG DEAL? | #50

42

Rui Liu | Towards Resource-adaptive Query Execution in Cloud Native Databases | #49

43

Yifei Yang | Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries | #48

44

Vikramank Singh | Panda: Performance Debugging for Databases using LLM Agents | #47

45

Tamer Eldeeb | Chablis: Fast and General Transactions in Geo-Distributed Systems | #46

46

Matt Butrovich | Tigger: A Database Proxy That Bounces With User-Bypass | #45

47

Gábor Szárnyas | The LDBC Social Network Benchmark: Business Intelligence Workload | #44

48

Thaleia Doudali | Is Machine Learning Necessary for Cloud Resource Usage Forecasting? | #43

49

Jinkun Geng | Nezha: Deployable and High-Performance Consensus Using Synchronized Clocks | #42

50

Dimitris Koutsoukos | NVM: Is it Not Very Meaningful for Databases? | #41

51

Mohamed Alzayat | Groundhog: Efficient Request Isolation in FaaS | #40

52

Cuong Nguyen | Detock: High Performance Multi-region Transactions at Scale | #39

53

Bogdan Stoica | WAFFLE: Exposing Memory Ordering Bugs Efficiently with Active Delay Injection | #38

54

Roger Waleffe | MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks | #37

55

Madelon Hulsebos | GitTables: A Large-Scale Corpus of Relational Tables | #36

56

Tarikul Islam Papon | ACEing the Bufferpool Management Paradigm for Modern Storage Devices | #35

57

Jian Zhang | VIPER: A Fast Snapshot Isolation Checker | #34

58

Ahmed Sayed | REFL: Resource Efficient Federated Learning | #33

59

Subhadeep Sarkar | Log-structured Merge Trees | #32

60

Andra Ionescu | Topio: The Geodata Marketplace | #31

61

Laurens Kuiper | These Rows Are Made For Sorting | #30

62

Semih Salihoğlu | Kùzu Graph Database Management System | #29

63

Lukas Vogel | Data Pipes: Declarative Control over Data Movement | #28

64

Haralampos Gavriilidis | In-Situ Cross-Database Query Processing | #27

65

Paras Jain & Sarah Wooders | Skyplane: Fast Data Transfers Between Any Cloud | #26

66

Yang Wang | Rethinking Concurrency Control in Databases | #25

67

Suyash Gupta | Chemistry behind Agreement | #24

68

Tobias Ziegler | Is Scalable OLTP in the Cloud a Solved Problem? | #23

69

Hamish Nicholson | HetCache: Synergising NVMe Storage and GPU acceleration for Memory-Efficient Analytics | #22

70

Immanuel Haffner | mutable: A Modern DBMS for Research and Fast Prototyping | #21

71

Konstantinos Kallas | Practically Correct, Just-in-Time Shell Script Parallelization | #20

72

Vasily Sartakov | CAP-VMs: Capability-Based Isolation and Sharing in the Cloud #19

73

Haoran Ma | MemLiner: Lining up Tracing and Application for a Far-Memory-Friendly Runtime | #18

74

Lexiang Huang | Metastable Failures in the Wild | #17

75

Andrew Quinn | Debugging the OmniTable Way | #16

76

Audrey Cheng | TAOBench: An End-to-End Benchmark for Social Network Workloads | #15

77

George Konstantinidis | Enabling Personal Consent in Databases | #14

78

Per Fuchs | Sortledton: a Universal, Transactional Graph Data Structure | #13

79

George Theodorakis | Scabbard: Single-Node Fault-Tolerant Stream Processing | #12

80

Kevin Gaffney | SQLite: Past, Present, and Future | #11

81

Matthias Jasny | P4DB - The Case for In-Network OLTP | #10

82

Tobias Ziegler | ScaleStore: A Fast and Cost-Efficient Storage Engine using DRAM, NVMe, and RDMA | #9

83

Chuzhe Tang | Ad Hoc Transactions in Web Applications: The Good, the Bad, and the Ugly | #8

84

Michael Abebe | Proteus: Autonomous Adaptive Storage for Mixed Workloads | #7

85

Hani Al-Sayeh | Juggler: Autonomous Cost Optimization and Performance Prediction of Big Data Applications | #6

86

Thomas Hütter | JEDI: These aren’t the JSON documents you’re looking for | #4

87

Sainyam Galhotra | Causal Feature Selection for Algorithmic Fairness | #5

88

Draco Xu | TSUBASA: Climate Network Construction on Historical and Real-Time Data | #3

89

Felix S Campbell | Efficient Answering of Historical What-if Queries | #2

90

Alex Isenko | Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines | #1

91

Coming Soon | ACM SIGMOD/PODS 2022 | #0