Important!!! How to join: On the morning of the event, you will get a link via Meetup. Make sure that you have opted for emails from Meetup.
|18:40||Data Processing with Python: a Containerized, Scheduled, and Monitored Pipeline on the Google Cloud Platform - Nancy Irisarri Méndez|
|19:15||AWS Glue - the good, the bad, the ugly - Jan Gazda|
Data Processing with Python: a Containerized, Scheduled, and Monitored Pipeline on the Google Cloud Platform
About: Nancy Irisarri Méndez  I studied economics and astronomy and have worked within research groups, government organizations, and health care. My strong suit is data science and data engineering, using Python and cloud providers as favorite tools.
During the talk, you will learn how to containerize a Python pipeline for data processing and execute it as a scheduled job that sends email messages for critical errors.
AWS Glue - the good, the bad, the ugly
Jan is a co-founder of PyAmsterdam, Pythonista and open-source contributor. Coming from Czech Republic he decided to bring the Python community spirit to Amsterdam. Python is his swiss army knife but he is not afraid of Golang as well. (But who needs types, right?)
AWS Glue offers Serverless ETL pipeline and workflows, while it can be rather simple to start using it via console it quickly become non trivial for deployment via CI/CD pipeline. Management of multiple environments, working with the pipeline state and keeping your code DRY is a challenge. During the talk I will walk you through guts of AWS Glue and show my approach to more streamlined deployment.