Luiz R.

Data Engineering

Luiz is a highly skilled Data Engineer with five years of commercial experience. He specializes in data engineering and machine learning, with a strong focus on code optimization and reproducibility.

Luiz has worked in various industries, including retail, eCommerce, financial operations, sales, and real estate in Brazil.

His expertise includes text processing, sentiment analysis, and infrastructure design. Luiz has optimized ETLs, improved queries, and PySpark code, and implemented table partitioning. He has a proven track record of implementing innovative solutions, such as a StableDiffusion-based application for image enhancement and an XGBoost model for predicting buyer visit-scheduling probability.

Tärkein asiantuntemus
  • Apache Spark
    Apache Spark 4 vuotta
  • Data Engineering 4 vuotta
  • Databricks
    Databricks 1 vuotta
Muut taidot
  • Scala
    Scala 1 vuotta
Luiz
Luiz R.

Brazil

Aloita tästä

Valittu kokemus

Työllisyys

  • Data Engineer | Data Scientist

    Loft - 1 year 1 month

    • Refined the team’s entire code structure, improving performance and code/ML reproducibility.
    • Improved several ETLs by optimizing queries, PySpark code, and table partitioning.
    • Worked with other engineering teams to map and define multiple improvement points for the entire Data Chapter.
    • Researched and implemented a StableDiffusion-based application to "decorate" apartment photos.
    • Created an XGBoost-based model to predict buyers’ visit-scheduling probability over time.
    • Created and maintained multiple interactive data-viz prototypes using Streamlit and ML models for the Business Intelligence area.
    • Defined and deployed multiple model APIs to production through a SageMaker layer alongside the MLOps team.

    Tekniikat:

    • Tekniikat:
    • Apache Spark Apache Spark
    • Databricks Databricks
    • ETL ETL
    • Python Python
    • SQL SQL
    • AWS AWS
    • Git Git
    • Agile Agile
    • FastAPI FastAPI
    • Pandas Pandas
  • Data Engineer | Data Scientist

    Dextra Consulting - 1 year 2 months

    • Created text-processing PySpark jobs for multiple client pipelines on a lakehouse architecture.
    • Designed and implemented a highly available infrastructure for a speech-to-text and text-processing engagement using GCP (Dataproc, R-MIG, Compute Engine, Firebase, Cloud Function, Build, and Run).
    • Supported and developed machine learning models for sentiment analysis.
    • Contributed to the creation of several PySpark jobs for multiple text-processing pipelines for different clients on a lakehouse architecture.

    Tekniikat:

    • Tekniikat:
    • Apache Spark Apache Spark
    • Data Engineering
    • Python Python
    • Google Cloud Google Cloud
    • AWS AWS
    • Git Git
    • Pandas Pandas
  • Data Engineer

    Samsung Electronics - 8 months

    • Automated data extraction procedures using a distributed architecture with Apache Airflow and Docker.
    • Created Spark jobs for data indexing and logging of metadata, including relational database architecture and modeling for event logging.
    • Automated procedures and ETLs using Bash, Python, and Docker.
    • Developed database-interface APIs using Node.js.

    Tekniikat:

    • Tekniikat:
    • Apache Spark Apache Spark
    • Python Python
    • Agile Agile
    • Pandas Pandas

Koulutus

  • MSc.Electrical Engineering

    Federal University Of Rio de Janeiro · 2017 - 2019

  • BSc.Electronics and Computer Engineering

    Federal University Of Rio de Janeiro · 2013 - 2019

Löydä seuraava kehittäjäsi päivien, ei kuukausien sisällä

Kun otat yhteyttä, järjestämme lyhyen 25 minuuttia kestävän tapaamisen, jonka aikana:

  • Kartoitamme yrityksenne kehitystarvetta
  • Kertoa prosessimme, jolla löydämme teille pätevän, ennakkotarkastetun kehittäjän verkostostamme
  • Käymme läpi askeleet, joilla oikea ehdokas pääsee aloittamaan – useimmiten viikon sisällä

Keskustele kanssamme