Shubham Sannyasi

Shubham Sannyasi

Mentor
Rising Codementor
US$0.00
For every 15 mins
ABOUT ME

I am an experienced Data Engineer with a strong proficiency in PySpark, Apache Spark, SQL, Python, and Azure services. My expertise lies in designing and optimizing ETL (Extract, Transform, Load) pipelines, creating efficient data storage solutions, and enhancing data processing services. As a certified Azure professional, I have a proven track record of delivering impactful solutions that drive business success.

I am passionate about harnessing the power of data to solve complex problems and enable data-driven decision-making. With a keen eye for detail and a dedication to excellence, I consistently strive to create innovative solutions that improve data quality, accessibility, and usability.

Pacific Time (US & Canada) (-07:00)
Joined July 2023
EXPERTISE
3 years experience
6 years experience
3 years experience
3 years experience
5 years experience
2 years experience
5 years experience

REVIEWS FROM CLIENTS

Shubham's profile has been carefully vetted and approved as a Codementor. Connect with Shubham now, and leave a review for them once you're done!
EMPLOYMENTS
Data Engineer
Globant India Private Limited, Pune
2022-06-01-Present

Developed a dynamic Databricks workflow as a sole contributor, enabling trigger-based execution by backend microservices. This workflo...

Developed a dynamic Databricks workflow as a sole contributor, enabling trigger-based execution by backend microservices. This workflow autonomously generates delta tables and views, crucial for backend processes that create Excel workbooks containing fund information. The data source originates from transactional tables with fund audit financial data. The solution was built from the ground up, offering parallel processing capabilities and extensive configurability via JSON-controlled settings. Engineered agile pipelines with the ability to execute intricate calculations driven by microservices parameters. Optimized SQL queries, enhancing system performance by eliminating redundancies.

SQL
Apache Spark
View more
SQL
Apache Spark
View more
Data Engineer
Cognizant Technology Solutions India Ltd, Kolkata
2021-05-01-2022-06-01

Designed and executed ETL production pipelines using PySpark and HIVE for data extraction, decryption, reconciliation, and transformat...

Designed and executed ETL production pipelines using PySpark and HIVE for data extraction, decryption, reconciliation, and transformation. Developed Databricks framework for downstream predictive analytics, enabling easy access to data as HIVE tables. Enhanced PySpark notebooks to achieve 40% reduction in runtime by parallelizing data loads. Delivered adhoc notebooks for real-time data manipulation needs. Established historical data ingestion workflow and incremental orchestration for daily loads.

Apache Spark
View more
Apache Spark
View more
Data Engineer
Capgemini India Pvt limited, Mumbai
2017-12-01-2021-03-01

Orchestrated Azure Data Factory (ADF) pipelines for seamless on-premises to cloud (ADLS) data migration. Designed ADF data wrangling b...

Orchestrated Azure Data Factory (ADF) pipelines for seamless on-premises to cloud (ADLS) data migration. Designed ADF data wrangling based Incremental pipeline (SCD2), replacing legacy SSIS jobs. Leveraged Azure DevOps for efficient ADF pipeline deployment across higher environments. Developed, enhanced, and maintained ETL packages using SSIS, ensuring client requirements were met. Investigated and resolved data discrepancies within strict SLAs, collaborating with cross-functional teams. Interacted with client stakeholders to gather requirements and craft technical specifications.

SQL
View more
SQL
View more