About the talk
Data is all over the place and not hard to obtain. However, what matters is how we manage that data, make sense out of it, and make meaningful decisions based on it. In this session, we will go through the data engineering pipeline via a demo. We will be using some open source datasets and store and process them on AWS.
This talk will cover
An interactive demo using open source datasets, which will cover:
- Data collection
- Data processing, analysis, and visualization
- A few relevant AWS services
Programming & Development
About the speaker

Suman is a Principal Developer Advocate at Amazon Web Services, primarily focusing on Data Engineering, Data Analysis and Machine Learning. He is passionate about large scale distributed systems and is a vivid fan of Python.