Using Airflow To Power Machine Learning Pipelines at Optimove with Vasyl Vasyuta episode artwork

EPISODE · Dec 12, 2024 · 24 MIN

Using Airflow To Power Machine Learning Pipelines at Optimove with Vasyl Vasyuta

from The Data Flowcast: Mastering Apache Airflow ® for Data Engineering and AI · host Astronomer

Data orchestration and machine learning are shaping how organizations handle massive datasets and drive customer-focused strategies. Tools like Apache Airflow are central to this transformation. In this episode, Vasyl Vasyuta, R&D Team Leader at Optimove, joins us to discuss how his team leverages Airflow to optimize data processing, orchestrate machine learning models and create personalized customer experiences.Key Takeaways:(01:59) Optimove tailors marketing notifications with personalized customer journeys.(04:25) Airflow orchestrates Snowflake procedures for massive datasets.(05:11) DAGs manage workflows with branching and replay plugins.(05:41) The "Joystick" plugin enables seamless data replays.(09:33) Airflow supports MLOps for customer data grouping.(11:15) Machine learning predicts customer behavior for better campaigns.(13:20) Thousands of DAGs run every five minutes for data processing.(15:36) Custom versioning allows rollbacks and gradual rollouts.(18:00) Airflow logs enhance operational observability.(23:00) DAG versioning in Airflow 3.0 could boost efficiency.Resources Mentioned:Vasyl Vasyuta -https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/Optimove -https://www.linkedin.com/company/optimove/Apache Airflow -https://airflow.apache.org/Snowflake -https://www.snowflake.com/Datadog -https://www.datadoghq.com/Apache Airflow Survey -https://astronomer.typeform.com/airflowsurvey24Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering & AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.#AI #Automation #Airflow #MachineLearning

Data orchestration and machine learning are shaping how organizations handle massive datasets and drive customer-focused strategies. Tools like Apache Airflow are central to this transformation. In this episode, Vasyl Vasyuta, R&D Team Leader at Optimove, joins us to discuss how his team leverages Airflow to optimize data processing, orchestrate machine learning models and create personalized customer experiences.Key Takeaways:(01:59) Optimove tailors marketing notifications with personalized customer journeys.(04:25) Airflow orchestrates Snowflake procedures for massive datasets.(05:11) DAGs manage workflows with branching and replay plugins.(05:41) The "Joystick" plugin enables seamless data replays.(09:33) Airflow supports MLOps for customer data grouping.(11:15) Machine learning predicts customer behavior for better campaigns.(13:20) Thousands of DAGs run every five minutes for data processing.(15:36) Custom versioning allows rollbacks and gradual rollouts.(18:00) Airflow logs enhance operational observability.(23:00) DAG versioning in Airflow 3.0 could boost efficiency.Resources Mentioned:Vasyl Vasyuta -https://www.linkedin.com/in/vasyl-vasyuta-3270b54a/Optimove -https://www.linkedin.com/company/optimove/Apache Airflow -https://airflow.apache.org/Snowflake -https://www.snowflake.com/Datadog -https://www.datadoghq.com/Apache Airflow Survey -https://astronomer.typeform.com/airflowsurvey24Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering & AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.#AI #Automation #Airflow #MachineLearning

NOW PLAYING

Using Airflow To Power Machine Learning Pipelines at Optimove with Vasyl Vasyuta

0:00 24:11

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The Data Flowcast: Mastering Apache Airflow ® for Data Engineering and AI?

This episode is 24 minutes long.

When was this The Data Flowcast: Mastering Apache Airflow ® for Data Engineering and AI episode published?

This episode was published on December 12, 2024.

What is this episode about?

Data orchestration and machine learning are shaping how organizations handle massive datasets and drive customer-focused strategies. Tools like Apache Airflow are central to this transformation. In this episode, Vasyl Vasyuta, R&D Team Leader at...

Can I download this The Data Flowcast: Mastering Apache Airflow ® for Data Engineering and AI episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!