Mani K.

Data Engineer

Mani is a highly experienced Data Engineer with over nine years of commercial expertise, including seven years dedicated to data pipeline design and ETL development. Proficient in Python, SQL, AWS, and big data technologies, he specializes in building enterprise data warehouses, automating data validation processes, and implementing robust data governance strategies.

His key achievements include developing a Change Data Capture (CDC) solution in AWS Glue for seamless Oracle-to-S3 synchronization, migrating GCP Data Prep workflows to BigQuery using stored procedures, and optimizing costly Alteryx workflows by implementing scalable PySpark solutions. These contributions have significantly improved efficiency and reduced operational costs.

With a strong technical background and a strategic approach to data engineering, Mani consistently delivers high-impact solutions that enhance data reliability, scalability, and cost-effectiveness.

Hauptkompetenz
  • Apache Airflow
    Apache Airflow 5 Jahre
  • CSV 7 Jahre
  • Redshift
    Redshift 5 Jahre
Andere Fähigkeiten
  • BigQuery
    BigQuery 8 Jahre
  • QA 3 Jahre
  • Tableau
    Tableau 2 Jahre
Mani
Mani K.

United States

Erste Schritte

Ausgewählte Erfahrung

Beschäftigung

  • Sr Data Engineer

    Samach Innovations LLC - 9 jahre 6 monate

    ● Designed, implemented, and maintained data integration and ETL (Extract, Transform, Load) pipelines to move and transform data between various systems and databases.

    ● Ensured data quality, consistency, and reliability by performing data validation and error handling within integration workflows.

    ● Created and managed workflows using Apache Airflow, scheduling and orchestrating data integration tasks, ensuring timely execution, and monitoring job statuses.

    ● Customized and optimized Airflow DAGs (Directed Acyclic Graphs), Apache Ni to meet specific data pipeline requirements..

    ● Utilized Snowflake data warehousing platform for data storage and processing, including creating and managing databases, schemas, and tables.

    ● Worked on Snowflake stages for data ingestion and developed efficient data loading strategies.

    ● Developed and optimized complex SQL queries to extract, transform, and analyze data from databases

    ● Implemented performance tuning techniques to improve query execution times and optimize database performance.

    ● Wrote Python scripts and code to automate data extraction, transformation, and loading processes.

    ● Developed custom Python functions, Scala and modules to perform data manipulations and transformations.

    ● Managed data transfers between systems via SFTP and API integrations.

    ● Dealt with various data file formats, including Parquet, CSV, SAV, and GZ, ensuring compatibility and efficient processing.

    ● Conducted data type conversions and transformations to prepare data for analysis and reporting.

    ● Ensured compatibility between data types used in source and target systems

    Technologien:

    • Technologien:
    • Apache Airflow Apache Airflow
    • BigQuery BigQuery
    • CSV
    • Redshift Redshift
    • AWS Athena
    • AWS Glue AWS Glue
    • Python Python
    • SQL SQL
    • Snowflake Snowflake
    • Data Engineering
    • Database testing
    • VSCode VSCode

Ausbildung

  • MSc.Master of Computer Application

    AKT(UP Technical University) Noida, India · 2007 - 2010

Finden Sie Ihren nächsten Entwickler innerhalb von Tagen, nicht Monaten

In einem kurzen 25-minütigen Gespräch würden wir gerne:

  • Auf Ihren Bedarf bezüglich des Recruitments von Software-Entwicklern eingehen
  • Unseren Prozess vorstellen und somit wie wir Sie mit talentierten und geprüften Kandidaten aus unserem Netzwerk zusammenbringen können
  • Die nächsten Schritte besprechen, um den richtigen Kandidaten zu finden - oft in weniger als einer Woche

Unterhalten wir uns