Why Attend?
- Live Online
- 1,00,000+ people attended since 2009
Rs 1999FREE- Certificate of Participation
- An exclusive Surprise
By clicking the above button, you agree to our Privacy Policy.
Why Attend?
By clicking the above button, you agree to our Privacy Policy.
The Data Team at Qubole collects usage and telemetry data from a million machines a month. We run many complex ETL workflows to process this data and provide reports, insights, and recommendations to customers, analysts and data scientists. We use the open-source distribution of Apache Airflow to orchestrate our ETL and process more than 1 terabyte of data daily.
In this talk, we will be talking about how we have extended airflow to manage the operational inefficiencies that arise when you manage data pipelines in a multi-tenant environment. We will also be talking about how we have made the data pipelines robust by adding data quality checks using CheckOperators.
Not Sure, What to learn and how it will help you?