Hot and cold data with Apache Kafka, Tiered Storage, and Iceberg episode artwork

EPISODE · Jul 16, 2024 · 48 MIN

Hot and cold data with Apache Kafka, Tiered Storage, and Iceberg

from Data (R)evolution · host Aiven

Utilizing the true potential of data streaming is key to business success. In this Data (R)evolution episode, we're joined by Josep Prat and Filip Yonov to dive into the transformative features of Apache Kafka and its evolving role in data architecture. They discuss the critical importance of collaboration and feedback in enhancing Kafka's capabilities, the future of "lake house" technology, exciting updates from the Open Source Program Office (OSPO), and the importance of Kafka's readiness to support evolving data formats—making it a backbone for modern data ecosystems.Key Takeaways:Community collaboration and contribution are essential for the continuous improvement and testing of Apache Kafka's capabilitiesThe evolution of Apache Kafka into a more versatile platform, combined with object storage and open table formats, can significantly enhance real-time data streaming, analytics, and the future of "lake house" technologyTiered storage in Kafka facilitates more efficient and cost-effective data management by decoupling storage from computingResources:Watch the full interview on our YouTube: https://www.youtube.com/@Aiven_ioCheck out our website for more information: https://aiven.io/Check out Aiven AI Database Optimizer Want to be on our mailing list? Sign up here: https://aiven.io/resourcesFollow us on LinkedIn: https://www.linkedin.com/company/aiven/Sign up for our newsletter for more insights on this topic: https://aiven.io/newsletterConnect with Filip Yonov on LinkedIn: https://www.linkedin.com/in/filipyonov/Connect with Josep Prat on LinkedIn: https://www.linkedin.com/in/jlprat/Timestamps:[05:49] Kafka servers have theoretical storage limits[09:29] Test storage proposal process for Apache Kafka[17:38] LinkedIn conducted an experiment merging Xcode versions[22:11] Data lake evolving into lake house architectures[25:00] Broker pushes data to remote storage, plugin handles retrieval and format translation[26:40] Kafka excels at high-speed, high-volume data[32:18] Kafka data consumption evolving with new options[40:19] Managing metadata for conversion on community level[47:45] Kafka's potential as a widely used API

Utilizing the true potential of data streaming is key to business success. In this Data (R)evolution episode, we're joined by Josep Prat and Filip Yonov to dive into the transformative features of Apache Kafka and its evolving role in data architecture. They discuss the critical importance of collaboration and feedback in enhancing Kafka's capabilities, the future of "lake house" technology, exciting updates from the Open Source Program Office (OSPO), and the importance of Kafka's readiness to support evolving data formats—making it a backbone for modern data ecosystems.Key Takeaways:Community collaboration and contribution are essential for the continuous improvement and testing of Apache Kafka's capabilitiesThe evolution of Apache Kafka into a more versatile platform, combined with object storage and open table formats, can significantly enhance real-time data streaming, analytics, and the future of "lake house" technologyTiered storage in Kafka facilitates more efficient and cost-effective data management by decoupling storage from computingResources:Watch the full interview on our YouTube: https://www.youtube.com/@Aiven_ioCheck out our website for more information: https://aiven.io/Check out Aiven AI Database Optimizer Want to be on our mailing list? Sign up here: https://aiven.io/resourcesFollow us on LinkedIn: https://www.linkedin.com/company/aiven/Sign up for our newsletter for more insights on this topic: https://aiven.io/newsletterConnect with Filip Yonov on LinkedIn: https://www.linkedin.com/in/filipyonov/Connect with Josep Prat on LinkedIn: https://www.linkedin.com/in/jlprat/Timestamps:[05:49] Kafka servers have theoretical storage limits[09:29] Test storage proposal process for Apache Kafka[17:38] LinkedIn conducted an experiment merging Xcode versions[22:11] Data lake evolving into lake house architectures[25:00] Broker pushes data to remote storage, plugin handles retrieval and format translation[26:40] Kafka excels at high-speed, high-volume data[32:18] Kafka data consumption evolving with new options[40:19] Managing metadata for conversion on community level[47:45] Kafka's potential as a widely used API

NOW PLAYING

Hot and cold data with Apache Kafka, Tiered Storage, and Iceberg

0:00 48:58

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

CISO Perspectives (public) N2K Networks This season on CISO Perspectives, host Kim Jones explores some of the challenges of leading through uncertainty. We explore the complexity of the changing nature of regulation and working with the federal government, the evolution of privacy and fraud, and how emerging technologies like AI and quantum computing are changing cyber. When you don’t know what questions to ask, you’re afraid to ask, or don’t know who to ask, CISO Perspectives provides the foundation for learning in this brave new world. NEWMORROW SESSIONS - A PodCast Series on the Future of Hospitality Mario C. Bauer, Florian Schneider, Axel Weber & Dr. Tillman Bardt The Newmorrow PodCast is more than a podcast — it's a platform for open dialog on the future of our business, a platform for those building what doesn’t exist yet. Here, we share and embrace our passion for the hospitality industry, but we won’t romanticize the journey. We ask the tough questions, confront uncomfortable truths, and prepare for a future that resists easy answers. We believe that the tougher and wilder times become, the more openly, honestly and humanely people need to talk to each other and act together. We believe, openness, togetherness, and truthfulness should also be cornerstones of a professional community to develop our utopian idea of „open source“. This is a space where visionaries don’t just imagine the future — they wrestle with the paradoxes that shape it: success vs. happiness, data vs. instinct, stability vs. reinvention. Join leaders, entrepreneurs, and thinkers as they share not what made them — but what’s actively shaping them, now and next. So tune in Hyperfluent Hypio Hyperfluent transmits straight from the heart of Hyperliquid, where culture, creativity, and capital converge. Anchored by the architects of Hypio—the decentralized cultural virus—each episode archives the minds engineering the blockchain built to house all finance. These conversations are traceable artifacts in HyperEVM’s evolution: not just what’s being built, but why it matters, how it mutates, and where it’s taking us next. Listen in for the blueprints, the blind spots, and the narrative weapons shaping tomorrow’s markets.Hyperfluent: learn the language, ride the wave, spread the strain. The Health Odyssey: Navigating Tomorrow's Medicine Podcast Welcome to 'The Health Odyssey: Navigating Tomorrow's Medicine,' where we embark on an adventurous journey through the ever-evolving world of healthcare. Each episode is like a treasure map, guiding you through the rich tapestry of ancient healing arts mixed with futuristic tech wizardry. We’ll chat about the wild west of health data privacy, the corporate giants reshaping our care, and the mind-bending potential of psychedelics for mental wellness. Think of us as your trusty sidekicks, unraveling the mysteries of modern medicine while keeping it real and relatable. Let’s dive into the stories, the science, and the soul of healthcare, paving the way for a healthier tomorrow.

Frequently Asked Questions

How long is this episode of Data (R)evolution?

This episode is 48 minutes long.

When was this Data (R)evolution episode published?

This episode was published on July 16, 2024.

What is this episode about?

Utilizing the true potential of data streaming is key to business success. In this Data (R)evolution episode, we're joined by Josep Prat and Filip Yonov to dive into the transformative features of Apache Kafka and its evolving role in data...

Can I download this Data (R)evolution episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!