Artwork

Content provided by The Data Flowcast. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by The Data Flowcast or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.
Player FM - Aplicație Podcast
Treceți offline cu aplicația Player FM !

Scaling Airflow to 11,000 DAGs Across Three Regions at Intercom with András Gombosi and Paul Vickers

34:24
 
Distribuie
 

Manage episode 522564525 series 2053958
Content provided by The Data Flowcast. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by The Data Flowcast or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.

The evolution of Intercom’s data infrastructure reveals how a well-built orchestration system can scale to serve global needs. With thousands of DAGs powering analytics, AI and customer operations, the team’s approach combines technical depth with organizational insight.

In this episode, András Gombosi, Senior Engineering Manager of Data Infra and Analytics Engineering, and Paul Vickers, Principal Engineer, both at Intercom, share how they built one of the largest Airflow deployments in production and enabled self-serve data platforms across teams.

Key Takeaways:

00:00 Introduction.

04:24 Community input encourages confident adoption of a common platform.

08:50 Self-serve workflows require consistent guardrails and review.

09:25 Internal infrastructure support accelerates scalable deployments.

13:26 Batch LLM processing benefits from a configuration-driven design.

15:20 Standardized development environments enable effective AI-assisted work.

19:58 Applied AI enhances internal analysis and operational enablement.

27:27 Strong test coverage and staged upgrades protect stability.

30:36 Proactive observability and on-call ownership improve outcomes.

Resources Mentioned:

András Gombosi

https://www.linkedin.com/in/andrasgombosi/

Paul Vickers

https://www.linkedin.com/in/paul-vickers-a22b76a3/

Intercom | LinkedIn

https://www.linkedin.com/company/intercom/

Intercom | Website

https://www.intercom.com

Apache Airflow

https://airflow.apache.org/

dbtLabs

https://www.getdbt.com/

Snowflake Cortex AI

https://www.snowflake.com/en/product/features/cortex/

Datadog

https://www.datadoghq.com/

Thanks for listening to “The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.

#AI #Automation #Airflow

  continue reading

81 episoade

Artwork
iconDistribuie
 
Manage episode 522564525 series 2053958
Content provided by The Data Flowcast. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by The Data Flowcast or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.

The evolution of Intercom’s data infrastructure reveals how a well-built orchestration system can scale to serve global needs. With thousands of DAGs powering analytics, AI and customer operations, the team’s approach combines technical depth with organizational insight.

In this episode, András Gombosi, Senior Engineering Manager of Data Infra and Analytics Engineering, and Paul Vickers, Principal Engineer, both at Intercom, share how they built one of the largest Airflow deployments in production and enabled self-serve data platforms across teams.

Key Takeaways:

00:00 Introduction.

04:24 Community input encourages confident adoption of a common platform.

08:50 Self-serve workflows require consistent guardrails and review.

09:25 Internal infrastructure support accelerates scalable deployments.

13:26 Batch LLM processing benefits from a configuration-driven design.

15:20 Standardized development environments enable effective AI-assisted work.

19:58 Applied AI enhances internal analysis and operational enablement.

27:27 Strong test coverage and staged upgrades protect stability.

30:36 Proactive observability and on-call ownership improve outcomes.

Resources Mentioned:

András Gombosi

https://www.linkedin.com/in/andrasgombosi/

Paul Vickers

https://www.linkedin.com/in/paul-vickers-a22b76a3/

Intercom | LinkedIn

https://www.linkedin.com/company/intercom/

Intercom | Website

https://www.intercom.com

Apache Airflow

https://airflow.apache.org/

dbtLabs

https://www.getdbt.com/

Snowflake Cortex AI

https://www.snowflake.com/en/product/features/cortex/

Datadog

https://www.datadoghq.com/

Thanks for listening to “The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.

#AI #Automation #Airflow

  continue reading

81 episoade

Все серии

×
 
Loading …

Bun venit la Player FM!

Player FM scanează web-ul pentru podcast-uri de înaltă calitate pentru a vă putea bucura acum. Este cea mai bună aplicație pentru podcast și funcționează pe Android, iPhone și pe web. Înscrieți-vă pentru a sincroniza abonamentele pe toate dispozitivele.

 

Ghid rapid de referință

Listen to this show while you explore
Play