+91-80100-33033 | info@digitalvidya.com
Duration
60 Mins
Day
Jul 11 (Wed), 2018
Time
-

Session Agenda

The Data Team at Qubole collects usage and telemetry data from a million machines a month. We run many complex ETL workflows to process this data and provide reports, insights, and recommendations to customers, analysts and data scientists. We use the open-source distribution of Apache Airflow to orchestrate our ETL and process more than 1 terabyte of data daily.

In this talk, we will be talking about how we have extended airflow to manage the operational inefficiencies that arise when you manage data pipelines in a multi-tenant environment. We will also be talking about how we have made the data pipelines robust by adding data quality checks using CheckOperators.

Who Should Attend?

Students
Students
bulb
Entrepreneurs
advertising
Advertising & Marketing Professionals
CXO's
CXO's
CXO's
Professionals
breafcase
Digital Marketing Professionals
globe
Web Strategists

Discuss With A Career Advisor

Not Sure, What to learn and how it will help you?

Call Us Live Chat Free MasterClass
Scroll to Top