EPISODE · May 12, 2026 · 24 MIN
Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets
from CyberSecurity Summary · host CyberSecurity Summary
A comprehensive guide by Andreas François Vermeulen designed to help organizations convert raw data lakes into valuable business assets. It outlines a sophisticated Data Science Technology Stack that includes powerful processing and storage tools like Apache Spark, Kafka, and Cassandra, alongside programming languages such as R, Python, and Scala. The author presents a structured layered framework and the HORUS methodology to streamline data transformation through a hub-and-spoke approach. To ground these technical concepts, the text establishes a fictional corporate group, VKHCG, providing realistic datasets across sectors like logistics, media, and finance. This framework emphasizes moving beyond simple data wrangling toward a Center of Excellence model that ensures scalability and operational efficiency. Ultimately, the sources serve as both a theoretical roadmap and a practical manual for mastering the end-to-end data-to-knowledge cycle.You can listen and download our episodes for free on more than 10 different platforms:https://linktr.ee/cyber_security_summaryGet the Book now from Amazon:https://www.amazon.com/Practical-Data-Science-Building-Technology/dp/1484230531?&linkCode=ll2&tag=cvthunderx-20&linkId=41e96f1f6d23f742302cb82466c28372&language=en_US&ref_=as_li_ss_tlDiscover our free courses in tech and cybersecurity, Start learning today:https://linktr.ee/cybercode_academy
What this episode covers
A comprehensive guide by Andreas François Vermeulen designed to help organizations convert raw data lakes into valuable business assets. It outlines a sophisticated Data Science Technology Stack that includes powerful processing and storage tools like Apache Spark, Kafka, and Cassandra, alongside programming languages such as R, Python, and Scala. The author presents a structured layered framework and the HORUS methodology to streamline data transformation through a hub-and-spoke approach. To ground these technical concepts, the text establishes a fictional corporate group, VKHCG, providing realistic datasets across sectors like logistics, media, and finance. This framework emphasizes moving beyond simple data wrangling toward a Center of Excellence model that ensures scalability and operational efficiency. Ultimately, the sources serve as both a theoretical roadmap and a practical manual for mastering the end-to-end data-to-knowledge cycle.You can listen and download our episodes for free on more than 10 different platforms:https://linktr.ee/cyber_security_summaryGet the Book now from Amazon:https://www.amazon.com/Practical-Data-Science-Building-Technology/dp/1484230531?&linkCode=ll2&tag=cvthunderx-20&linkId=41e96f1f6d23f742302cb82466c28372&language=en_US&ref_=as_li_ss_tlDiscover our free courses in tech and cybersecurity, Start learning today:https://linktr.ee/cybercode_academy
NOW PLAYING
Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets
No transcript for this episode yet
Similar Episodes
Jun 13, 2025 ·17m
May 7, 2025 ·14m
Mar 26, 2025 ·23m
Feb 22, 2025 ·13m
Jan 7, 2025 ·10m
Dec 31, 2024 ·19m