EPISODE · Sep 20, 2025 · 17 MIN
Building and maintaining large-scale data leak databases (DS2025)
from Chaos Computer Club - recent audio-only feed · host Yaroslav Harahuts
Technical and organizational insights into building and operating large-scale leak databases, using reveng.ee as a case study. Reveng.ee is a private database indexing extensive leaks of personal data from Russian sources, used by investigative journalists, military analysts, and law enforcement. Possible formats: - Workshop: guided exploration of how to search and work with leak data using the real interface — but this format has limited value because reveng.ee is a closed, paid service and will not be accessible to all participants; - Talk: an in-depth overview of where such data comes from, how it is processed and stored, search indexing for messy heterogeneous datasets, and operational considerations (security, legal, ethical). This would emphasize methodology and infrastructure rather than promoting or granting access to the actual platform. Either case will take about 20–25 minutes, followed by Q&A. Licensed to the public under https://creativecommons.org/licenses/by/4.0/de/ about this event: https://talks.datenspuren.de/ds25/talk/SG7CRZ/
What this episode covers
Technical and organizational insights into building and operating large-scale leak databases, using reveng.ee as a case study. Reveng.ee is a private database indexing extensive leaks of personal data from Russian sources, used by investigative journalists, military analysts, and law enforcement. Possible formats: - Workshop: guided exploration of how to search and work with leak data using the real interface — but this format has limited value because reveng.ee is a closed, paid service and will not be accessible to all participants; - Talk: an in-depth overview of where such data comes from, how it is processed and stored, search indexing for messy heterogeneous datasets, and operational considerations (security, legal, ethical). This would emphasize methodology and infrastructure rather than promoting or granting access to the actual platform. Either case will take about 20–25 minutes, followed by Q&A. Licensed to the public under https://creativecommons.org/licenses/by/4.0/de/ about this event: https://talks.datenspuren.de/ds25/talk/SG7CRZ/
NOW PLAYING
Building and maintaining large-scale data leak databases (DS2025)
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Feb 8, 2026 ·4m
Jan 30, 2026 ·6m
Jan 2, 2026 ·47m