Himanshu S.
Data Engineer
Himanshu är en Data Engineer med gedigen erfarenhet och hög kompetens inom SQL, Snowflake och AWS. Han har arbetat inom flera olika branscher, däribland hälsovård, detaljhandel, fordonsindustrin och finans.
Under de senaste fem åren har Himanshu finslipat sina färdigheter och etablerat sig som en Full-stack Data Consultant med djupgående expertis inom både maskininlärning och data science.
Under sin tid på KnowledgeFoundry och ZS Associates gjorde Himanshu betydande bidrag till de tekniska teamen på båda företagen. Hans omfattande kompetens och starka engagemang har etablerat honom som en pålitlig och respekterad utvecklare inom data engineering.
Huvudsaklig expertis
- OpenCV 4 år
- Linux 5 år
- LangChain 2 år
Andra kompetenser
- Docker 3 år
- FastAPI 2 år
- ChatGPT API 2 år
Utvald erfarenhet
Anställningar
Data Engineer
InfoGain - 10 månader
- Created a Data Warehouse solution utilizing AWS Redshift and AWS Glue, migrating an OLAP database from MS SQL Server.
- Established a DBT pipeline for ETL processes, transferring data between a MySQL warehouse and an activity database to a Neo4j graph database using native Python programming. The setup was implemented on an AWS Linux box with Neo4j running as a Docker container.
- Developed an ETL pipeline for conducting market basket analysis and other marketing statistics on millions of rows of transactional data. Utilized Redshift as a transactional database and populated it in a serverless fashion using Amazon Lambda functions in real time.
Teknologier:
- Teknologier:
- Python
- ETL
- Data Engineering
- AWS
Data Engineer
ZS Associates - 6 månader
- Developed a pipeline to convert data into a structured format, enabling serving to Prodigy for ML-related tagging. The entire pipeline was constructed in a modular fashion using pure Python and shell scripting.
- Implemented data transformations in Python and stored the processed data in an Amazon S3 bucket for storage and accessibility.
Teknologier:
- Teknologier:
- Python
Data Engineer
KnowledgeFoundry - 5 år 5 månader
- Automated the process of writing Hive queries for ETL of multiple tables (both one-time and incremental) by generating automated scripts.
- Read CSV files from folder locations and created tables, then performed incremental loads sequentially.
- Set up Snowflake as the primary storage solution for structured data and utilized DBT for ETL processes. Crafted SQL-based models to define transformation logic, ensuring flexibility with incremental loading and version control using DBT.
- Prepared transformed data for analysis using business intelligence tools, facilitating effortless insights discovery. Conducted regular checks in Snowflake and DBT to maintain data integrity and pipeline functionality.
- Designed and developed data pipelines to extract, transform, and load data from diverse sources into a centralized data warehouse.
Teknologier:
- Teknologier:
- ETL
- SQL
- Data Engineering
Utbildning
BSc.Information Technology
Dharmsinh Desai University · 2015 - 2019
Hitta din nästa utvecklare inom ett par dagar
Ge oss 25 minuter av din tid, så kommer vi att:
- Sätta oss in i dina utmaningar och behov
- Berätta om våra seniora och beprövade utvecklare
- Förklara hur vi kan matcha dig med precis rätt utvecklare