EPISODE · Nov 26, 2024 · 43 MIN
Data Pipelines with Apache Airflow
from CyberSecurity Summary · host CyberSecurity Summary
This Book provides a comprehensive guide to Apache Airflow, a powerful open-source workflow management platform commonly used in data-intensive environments. It covers the fundamentals of Airflow, including defining data pipelines as directed acyclic graphs (DAGs), scheduling and executing these pipelines, monitoring their performance, and handling failures. The book also explores advanced topics such as templating tasks, building custom components, integrating with external systems, and designing tests for your pipelines. The authors then demonstrate how to deploy and operate Airflow in production environments, including securing the system, managing resources efficiently, and collecting metrics for monitoring. Finally, the book includes detailed guidance on deploying Airflow in various cloud environments, including AWS, Azure, and GCP.You can listen and download our episodes for free on more than 10 different platforms:https://linktr.ee/cyber_security_summaryGet the Book now from Amazon:https://www.amazon.com/Data-Pipelines-Apache-Airflow-Harenslak/dp/1617296902?&linkCode=ll1&tag=cvthunderx-20&linkId=39a43518fff3b8fca733494faa3cb6df&language=en_US&ref_=as_li_ss_tlDiscover our free courses in tech and cybersecurity, Start learning today:https://linktr.ee/cybercode_academy
What this episode covers
This Book provides a comprehensive guide to Apache Airflow, a powerful open-source workflow management platform commonly used in data-intensive environments. It covers the fundamentals of Airflow, including defining data pipelines as directed acyclic graphs (DAGs), scheduling and executing these pipelines, monitoring their performance, and handling failures. The book also explores advanced topics such as templating tasks, building custom components, integrating with external systems, and designing tests for your pipelines. The authors then demonstrate how to deploy and operate Airflow in production environments, including securing the system, managing resources efficiently, and collecting metrics for monitoring. Finally, the book includes detailed guidance on deploying Airflow in various cloud environments, including AWS, Azure, and GCP.You can listen and download our episodes for free on more than 10 different platforms:https://linktr.ee/cyber_security_summaryGet the Book now from Amazon:https://www.amazon.com/Data-Pipelines-Apache-Airflow-Harenslak/dp/1617296902?&linkCode=ll1&tag=cvthunderx-20&linkId=39a43518fff3b8fca733494faa3cb6df&language=en_US&ref_=as_li_ss_tlDiscover our free courses in tech and cybersecurity, Start learning today:https://linktr.ee/cybercode_academy
NOW PLAYING
Data Pipelines with Apache Airflow
No transcript for this episode yet
Similar Episodes
Jun 13, 2025 ·17m
May 7, 2025 ·14m
Mar 26, 2025 ·23m
Feb 22, 2025 ·13m
Jan 7, 2025 ·10m
Dec 31, 2024 ·19m