Luiz R.

Luiz R.

Data Engineering

Brazil
Vertrouwd lid sinds 2023
8 jaar ervaring

Luiz has worked in various industries, including retail, eCommerce, financial operations, sales, and real estate in Brazil.

His expertise includes text processing, sentiment analysis, and infrastructure design. Luiz has optimized ETLs, improved queries, and PySpark code, and implemented table partitioning. He has a proven track record of implementing innovative solutions, such as a StableDiffusion-based application for image enhancement and an XGBoost model for predicting buyer visit-scheduling probability.

Hoofd expertise

Apache SparkApache Spark4 jaar
Data Engineering4 jaar
DatabricksDatabricks1 jaar
ETLETL5 jaar
11+

Ervaring4

Loft

Data Engineer | Data Scientist

Loft
Real Estate
Feb 2022 - Mar 2023 · 1y 1m
  • Refined the team’s entire code structure, improving performance and code/ML reproducibility.
  • Improved several ETLs by optimizing queries, PySpark code, and table partitioning.
  • Worked with other engineering teams to map and define multiple improvement points for the entire Data Chapter.
  • Researched and implemented a StableDiffusion-based application to "decorate" apartment photos.
  • Created an XGBoost-based model to predict buyers’ visit-scheduling probability over time.
  • Created and maintained multiple interactive data-viz prototypes using Streamlit and ML models for the Business Intelligence area.
  • Defined and deployed multiple model APIs to production through a SageMaker layer alongside the MLOps team.
Data Science
PandasPandas
DataDogDataDog
Scikit-learnScikit-learn
Machine LearningMachine Learning
4+
Dextra Consulting

Data Engineer | Data Scientist

Dextra Consulting
Information Technology (IT) and Services
Dec 2020 - Feb 2022 · 1y 2m
  • Created text-processing PySpark jobs for multiple client pipelines on a lakehouse architecture.
  • Designed and implemented a highly available infrastructure for a speech-to-text and text-processing engagement using GCP (Dataproc, R-MIG, Compute Engine, Firebase, Cloud Function, Build, and Run).
  • Supported and developed machine learning models for sentiment analysis.
  • Contributed to the creation of several PySpark jobs for multiple text-processing pipelines for different clients on a lakehouse architecture.
PandasPandas
Scikit-learnScikit-learn
Samsung Electronics

Data Engineer

Samsung Electronics
Manufacturing
Jan 2020 - Sep 2020 · 8m
  • Automated data extraction procedures using a distributed architecture with Apache Airflow and Docker.
  • Created Spark jobs for data indexing and logging of metadata, including relational database architecture and modeling for event logging.
  • Automated procedures and ETLs using Bash, Python, and Docker.
  • Developed database-interface APIs using Node.js.
PandasPandas
Grupo SOMA

Data Scientist

Grupo SOMA
Fashion and Apparel
Jul 2019 - Dec 2019 · 5m
  • Implement a cluster using Kubernetes (GKE) to orchestrate dockerized ETLs in Python.

  • Creation of a scalable development platform used for several engagements using Jupyter Hub hosted in a cluster with Kubernetes.

  • Development of a workshop on data science, machine learning models, and their applications.

  • Implement a supervised model to predict sales of new products based on historical data and visual characteristics, with direct development on the whole machine learning chain, from data processing to serving the model through a REST API using Flask.

  • Establishment of a workflow to analyze and compare machine learning models using MLFlow.

Data Science
PandasPandas
Scikit-learnScikit-learn
Machine LearningMachine Learning

Beoordeling

Uitmuntendheid in techniek

Luiz algemene prestaties in een 90-minuten durende technische beoordeling zijn in de top 5% van de gescreende Data Engineering bij Proxify.

Educatie

FUO
Federal University Of Rio de Janeiro
Electrical Engineering2017 - 2019
FUO
Federal University Of Rio de Janeiro
Electronics and Computer Engineering2013 - 2019

Stop met browsen.
Word sneller gekoppeld.