Yuriy Margulis

Yuriy Margulis

Mentor
Rising Codementor
US$25.00
For every 15 mins
free badge
First 15 mins free for your first session
ABOUT ME
Big Data Engineer, Architect, Manager
Big Data Engineer, Architect, Manager

Data specialist with over 15 years of experience in data warehousing, data engineering, big data, and business intelligence. Over the years worked on 5 large data warehouses for prime internet, media, and entertainment companies and multiple Big Data systems. In addition, also acted as a hands-on big data engineer & architect, ETL developer, database administrator, provided operational support and SLA compliance.

Russian, English
Pacific Time (US & Canada) (-07:00)
Joined April 2019
EXPERTISE
3 years experience
Data Engineering, ETL
Data Engineering, ETL
4 years experience
AWS and HDP Spark
AWS and HDP Spark
20 years experience
Main language of relational Data Warehousing
Main language of relational Data Warehousing
5 years experience
One S3-to-S3 Data Warehouse, 3 AWS S3 Data Lakes, one Feature Engineering data system, multiple data marts, BI reporting, DevOps
One S3-to-S3 Data Warehouse, 3 AWS S3 Data Lakes, one Feature Engineering data system, multiple data marts, BI reporting, DevOps
20 years experience
5 large Oracle Data Warehouses for prime media and internret companies, multiple data marts, ODS's and BI systems
5 large Oracle Data Warehouses for prime media and internret companies, multiple data marts, ODS's and BI systems

REVIEWS FROM CLIENTS

Yuriy's profile has been carefully vetted and approved as a Codementor. Connect with Yuriy now, and leave a review for them once you're done!
EMPLOYMENTS
Principal Consultant, Co-founder
Crowd Consulting LLC
2016-04-01-Present
Multiple project in the field of data warehousing and big data enginnering. Business development, team augmentation, mentoring, pre- and ...
Multiple project in the field of data warehousing and big data enginnering. Business development, team augmentation, mentoring, pre- and post-sales solution architecture, engineering and support
Python
SQL
Amazon RDS
View more
Python
SQL
Amazon RDS
Apache Spark
Apache Hadoop
AWS EMR
Snowflake
Hortonworks Data Platform
Apache Hive
Aws rredshift
View more
Big Data Engineer
Boston Consulting Group, GAMMA (via Toptal)
2018-06-01-2019-01-01
2 contracts (via Toptal). Both clients – major pharmaceutical companies. Subcontracted by BCG GAMMA Advance Analytics and Data Scien...
2 contracts (via Toptal). Both clients – major pharmaceutical companies. Subcontracted by BCG GAMMA Advance Analytics and Data Science division to provide engineering support for BCG’s data scientists on DMP and personalization projects. Mostly Feature Engineering and ETL but also devops tasks: Python utilities, Airflow installations and Airflow administration Python scripting, Spark to Excel Python scripts, other devops tasks. Designed and build dynamic S3-to-S3 RDS-driven (metadata in Postgres) ETL system in Spark/Hive. AWS Glue is used for Hive metastore, Athena for querying and Airflow for scheduling. ETL system build based on modern Data Warehousing best practices. Documented the system and provided training. Designed and build Feature Engineering S3 Data Mart and multi-layered S3 Customer-360 Data Lake. Proposed and enforced development standards, provided documentation, data validation procedures, operational support and maintenance guidelines.
Python
Pandas
Apache Spark
View more
Python
Pandas
Apache Spark
Airflow
Hdp
Athena
Apache Hive
Glue
View more
VP Data
Enervee
2017-10-01-2018-05-01
Manage data team and built analytical system. Build AWS S3 Data Lake. Loaded Segment.com data from parsed Segment logs bypassing Seg...
Manage data team and built analytical system. Build AWS S3 Data Lake. Loaded Segment.com data from parsed Segment logs bypassing Segment Warehouse. Validated data against Segment Warehouse in AWS Redshift. Loaded PostgreSQL product catalog data to the Data Lake what allowed to develop site behavior marketing BI reporting and ML predictive analytics. Designed a framework for Enterprise Data Warehouse.
Python
Django
MySQL
View more
Python
Django
MySQL
PostgreSQL
Ansible
Amazon RDS
Segment
Amazon Redshift
AWS EMR
Airflow
View more
PROJECTS
Big Data Platform
Mobivity
2016
12-month contract (via Crowd Consulting). Design and deployment of full Big Data analytical ecosystem including Data Lake, Data Warehouse...
12-month contract (via Crowd Consulting). Design and deployment of full Big Data analytical ecosystem including Data Lake, Data Warehouse, Reporting Data Mart, Analytical Data Mart, BI Reporting System and comprehensive ETL system. Coded one of two subject areas completely, set up development methodology and standards. Personnel training on Big Data. Data validation, cleansing and governance. Technology selection and client’s team augmentation
Python
PostgreSQL
Lambda
View more
Python
PostgreSQL
Lambda
Rds
ETL
Amazon Redshift
Luigi
Emr
Presto
View more