Fundamentals of Apache Spark | Think Tech #14
Episode 48 of the Think Tech podcast, hosted by Shivam Mohan, titled "Fundamentals of Apache Spark | Think Tech #14" was published on July 24, 2023 and runs 21 minutes.
July 24, 2023 ·21m · Think Tech
Summary
Join me in this engaging and comprehensive episode of Think Tech that explores the powerful distributed computing system, Apache Spark, designed for big data processing. The episode covers the core concepts and inner workings of Spark, emphasizing its memory-centric architecture that enables lightning-fast processing and real-time or near-real-time capabilities. Listeners gain insights into Spark's fault-tolerant master/worker model, the significance of partitions for parallel processing, and the three essential data abstractions - RDD, Dataframe, and Dataset. The podcast also delves into Actions and Transformations, explaining their roles in optimizing data processing workflows. Additionally, the Spark Session as the entry point and the execution modes (Client, Cluster, and Local) for different scenarios are highlighted. Overall, the episode serves as an essential guide for understanding Apache Spark and its groundbreaking contributions to the world of big data processing.
Episode Description
Join me in this engaging and comprehensive episode of Think Tech that explores the powerful distributed computing system, Apache Spark, designed for big data processing. The episode covers the core concepts and inner workings of Spark, emphasizing its memory-centric architecture that enables lightning-fast processing and real-time or near-real-time capabilities. Listeners gain insights into Spark's fault-tolerant master/worker model, the significance of partitions for parallel processing, and the three essential data abstractions - RDD, Dataframe, and Dataset. The podcast also delves into Actions and Transformations, explaining their roles in optimizing data processing workflows. Additionally, the Spark Session as the entry point and the execution modes (Client, Cluster, and Local) for different scenarios are highlighted. Overall, the episode serves as an essential guide for understanding Apache Spark and its groundbreaking contributions to the world of big data processing.
Similar Episodes
May 14, 2025 ·40m
Mar 12, 2025 ·37m
Feb 26, 2025 ·40m
Jan 2, 2025 ·13m