Oscar C.

Data Engineer

Oscar er en høyt spesialisert Senior Dataingeniør med 13 års forretningsbakgrunn. Han har jobbet i forskjellige bransjer som AdTech, FinTech, HealthTech, og Enterprise Software, og har demonstrert sin ekspertise på tvers av ulike domener.

Oscar har opparbeidet seg verdifull erfaring ved å jobbe både i USA og Nederland. Hans tekniske ferdigheter inkluderer å anvende Golang, Python, BigQuery, Apache Spark on Databricks (AWS), og Scala for å lage robuste programvaresystemer.

En av Oscars stolteste prestasjoner var å utvikle en patentert idé med USPTO, som han med suksess tok til markedet. Dette prosjektet viser hans innovasjon og evne til å bygge bro mellom konsept og kommersialisering.

I tillegg til sin tekniske ekspertise har Oscar vist eksepsjonelle evner til å lede team gjennom hele karrieren, noe som ytterligere fremhever hans evne til å levere resultater av høy kvalitet i komplekse prosjekter.

Hovedekspertise

  • Apache Spark
    Apache Spark 10 år
  • AWS
    AWS 10 år
  • BigQuery
    BigQuery 5 år

Andre kunnskaper

  • MySQL
    MySQL 13 år
  • PostgreSQL
    PostgreSQL 13 år
  • ETL
    ETL 11 år
Oscar

Oscar C.

Guatemala

Kom i gang

Utvalgt opplevelse

Arbeidserfaring

  • Senior MLOps Engineer

    Sago Mini - 2 months

    • Promoting Machine Learning models to Production on GCP

    Teknologier:

    • Teknologier:
    • Python Python
    • Vertex AI Vertex AI
  • Tech Lead / MLOps & Optimization Platform

    Occidental Petroleum (Oxy) - 4 months

    Summary: Led design and delivery of Oxy’s Optimization Pillar MLOps platform in AWS ● Designed and implemented MLOps platform integrating AWS (S3, ECS/Fargate, SageMaker, Lambda) with Oxy’s ODAP Lakehouse ● Packaged and deployed Python optimization models (Gurobi/Pyomo) with CI/CD pipelines in Azure DevOps + MLflow ● Built PySpark ingestion pipelines from Kabal APIs, SQL Server, and PI systems into ODAP, ensuring governance and schema validation ● Collaborated with data scientists and IT to enable predictive maintenance and vessel scheduling optimization use cases ● Mentored engineers and defined role skill matrices across MLOps, DevOps, Backend, and QA

    Teknologier:

    • Teknologier:
    • AWS AWS
    • Python Python
    • Machine Learning Machine Learning
  • Senior Backend Developer

    Reddit - 5 months

    Summary: Backend development in Golang and Python ● Developed new integrations with Notification Platform to send emails to 300k+ users ● Implemented concurrency in email send increasing performance by 98% ● Implemented Spam filters in email send increasing performance further by 46% ● Developed client support for Business Experiences team to tap into Notification Platform, making progress towards goal of deprecating integrations with legacy Mailroom messaging. ● Code reviews and various team activities

    Teknologier:

    • Teknologier:
    • Golang Golang
    • Apache Kafka Apache Kafka
  • Senior Data Engineer

    Curinos - 8 months

    • Led data product development on the Databricks Lakehouse platform, ensuring efficient data handling and analysis;

    • Migrated data from MySQL and PostgreSQL databases using AWS Database Migration Service (DMS) to streamline data management;

    • Developed Data Pipelines using Delta Live Tables (DLT) for real-time and batch processing of data;

    • Created a Code Generation tool to automatically generate Scala code for Databricks, enhancing development speed and accuracy;

    • Proficient in Databricks, Scala, and Python, with a strong focus on scalable data engineering solutions.

    Teknologier:

    • Teknologier:
    • MySQL MySQL
    • PostgreSQL PostgreSQL
    • AWS AWS
    • Databricks Databricks
    • Python Python
    • Scala Scala
    • Data Engineering
  • Senior Data Engineer

    Clevertech - 2 years 11 months

    • Developed a Reporting API for analyzing large-scale advertising campaigns (Golang, BigQuery)
    • Created an Advanced Query Tool in Golang for complex SQL queries, reducing processing time by 50%
    • Implemented Data Modeling for forecasting TV Ads performance to extrapolate impressions, increasing revenue by 20%
    • Debugged and improved complex queries in BigQuery, reducing overall query complexity
    • Enhanced collaboration with the Data Science team by serving as an interface with the Backend team

    Teknologier:

    • Teknologier:
    • Golang Golang
    • SQL SQL
    • Data Engineering
    • BigQuery BigQuery
  • Co-Founder and CTO

    Sciencesheet - 1 year 8 months

    • Developed Codegen for ML pipelines (Spark, Scala, Python), accelerating data science processes by 10x
    • Invented and patented Codegen technology for processing millions of spreadsheet rows in Spark using Excel formulas
    • Launched a startup from idea to market within one year
    • Increased market reach by developing plugins for Google Sheets and Microsoft Excel
    • Successfully developed the AWS Backend using Sagemaker Autopilot, Lambda, EC2, SNS, and SES

    Teknologier:

    • Teknologier:
    • AWS AWS
    • Python Python
    • AWS Lambda AWS Lambda
    • Scala Scala
    • AWS EC2 AWS EC2
    • Hadoop Hadoop
    • Microsoft Excel Microsoft Excel
  • Data Scientist / Engineering Manager

    PayPal (Xoom) - 3 years 9 months

    • Tech Lead for Data Science and Engineering team (Spark, Scala, Python)
    • Managed a team of five data scientists and data engineers
    • Developed a Locations indexer, doubling the speed of finding bank branches in India
    • Increased market coverage for the Sendmoney product by supporting FP&A analyses in Spark instead of Excel
    • Enhanced the effective reach of push notifications by 20% through segmentation analyses in Spark

    Teknologier:

    • Teknologier:
    • Apache Spark Apache Spark
    • Python Python
    • Scala Scala
    • Data Engineering
    • Team Leading
    • Microsoft Excel Microsoft Excel
  • Cloud Engineer

    Mendix - 3 years 5 months

    • Developed the Mendix Cloud platform using a Mendix code generation tool, streamlining the development process;

    • Architected robust security protocols for the Mendix Enterprise Cloud Platform, ensuring data protection and compliance;

    • Automated parallel firewall installation and configuration across thousands of cloud nodes, enhancing security and operational efficiency;

    • Reverse-engineered Mendix code generation to reproduce applications using the open-source WebDSL language, expanding platform versatility and open-source integration.

    Teknologier:

    • Teknologier:
    • AWS AWS
    • Python Python
    • Data Engineering
  • Summer Intern

    Google - 3 months

    • Conducted data mining on a Git repository containing 70 Apache projects, extracting valuable insights for analysis;

    • Presented the project findings at ApacheCon US in Atlanta, showcasing expertise and contributing to the open-source community.

    Teknologier:

    • Teknologier:
    • Apache Spark Apache Spark
    • Data Engineering
    • Git Git

Utdannelse

  • MSc.Computer Science

    Delft University of Technology · 2009 - 2011

  • MSc.Management and Technology

    Delft University of Technology · 2007 - 2009

  • MSc.Management and Technology

    Delft University of Technology · 2007 - 2009

  • BSc.Computer Science

    Universidad Francisco Marroquín · 1997 - 2002

Finn din neste utvikler innen dager, ikke måneder

I løpet av en kort 25-minutters samtale ønsker vi å:

  • Forstå dine utviklingsbehov
  • Forklare prosessen vår der vi matcher deg med kvalifiserte, evaluerte utviklere fra vårt nettverk
  • Dele de neste stegene for å finne riktig match, ofte på mindre enn en uke

La oss ta en prat