NYHET
Proxify er åpen om utviklerens prestasjoner — det er bransjeledende, og også enhver CTOs drøm.
Finn ut mer
Caio M.
Data Engineer
Caio er en allsidig dataprofesjonell med over fem års erfaring fra programvare- og datateknikk, datavitenskap og analyse.
I tillegg påtok Caio seg en sentral rolle i Nubank, hvor han ledet et team på 200 fagfolk, og styrte initiativet Data Governance and Cost Accountability. Dette initiativet økte innsynet i prosjekt- og oppgaverelaterte kostnader betydelig, til fordel for alle involverte teammedlemmer.
I løpet av sin periode i Accenture viste Caio eksepsjonelt engasjement da han jobbet flittig med å implementere GCP Data-løsninger i kundenes infrastruktur. Denne innsatsen hadde stor innvirkning, og effektiviserte datainnsamlingsprosessen betydelig fra ulike kilder og reduserte tiden som tidligere ble brukt på manuell datainnsamling.
Hovedekspertise
- Python 6 år

- SQL 6 år

- ETL 5 år

Andre kunnskaper
- OAuth2 4 år

- GraphQL 4 år

- PowerShell 3 år

Utvalgt opplevelse
Arbeidserfaring
Data Engineer
Proxify AB - 5 months
-
Working In Multiple Clients, providing Data Engineering & Analytics consultancy and development
-
Automating Web Page Navigation and Scraping using Playwright and BeautifulSoup
-
Providing Data Analytics and Data Modelling solutions with multiple frameworks
-
Implementing ETL jobs to integrate data from different sources into Data Warehouses or Data Lakes
Teknologier:
- Teknologier:
HTML
Azure Blob storage
TensorFlow
NumPy
OpenCV
XGBoost
Keras
Pandas
Open source
LaTeX
PyCharm
BigQuery
- CSV
OAuth2
- Command-line interface
Unix
VSCode
SciPy
Scikit-learn
- ELT
Matplotlib
- Data Analytics
Azure Synapse
- Recurrent neural network
PL/SQL
XML
- NLP
Machine Learning
BeautifulSoup
SQLAlchemy
Tableau
Plotly
- Dimensional modeling
- Fact Data Modeling
Redshift
dbt
- Prompt Engineering
LangChain
Looker
-
Data Engineer
Nubank - 3 years 5 months
- Reformerte datainnsamlingsarkitekturen for sosiale medier, og reduserte beregningstiden og -kostnadene med over 90 %.
- Ledet et initiativ for datastyring og kostnadsansvar i et team på 200 personer, og forbedret kostnadssynlighet knyttet til prosjekter og oppgaver.
Analytics Engineer
Nubank - 3 years 5 months
-
Spearheaded the reformulation of social media data collection architecture, achieving a remarkable reduction of computing time and costs by over 90%;
-
Integrated a new pipeline with the company’s data lake, enabling universal access to Social Media datasets and fostering collaboration;
-
Led a Data Governance and Cost Accountability initiative within a team of 200 members, enhancing transparency and providing visibility into costs associated with projects and tasks;
-
Delivering meaningful data and insights empowered the team to concentrate on content analytics and performance, facilitating informed decision-making and optimizing workflow efficiency.
Teknologier:
- Teknologier:
HTML
Scala
Azure Blob storage
- Data Science
Azure Data Factory
TensorFlow
NumPy
OpenCV
XGBoost
Keras
Pandas
ClojureScript
R (programming language)
Open source
LaTeX
PyTorch
PyCharm
BigQuery
- CSV
OAuth2
- Command-line interface
Unix
VSCode
SciPy
Scikit-learn
- ELT
Matplotlib
- Data Analytics
Azure Synapse
Random Forest
- PCA
Convolutional neural network
- Recurrent neural network
PL/SQL
XML
- NLP
Machine Learning
Cuda
BeautifulSoup
SQLAlchemy
Tableau
Clojure
Plotly
- Dimensional modeling
- Fact Data Modeling
Redshift
dbt
- Prompt Engineering
LangChain
Looker
Dataflow
-
Data Engineer
ClearSale - 1 year 2 months
- Assisterte med å skalere opp et nytt Biometri-produkt ved å definere ytelsesmålinger og overvåke applikasjonsatferd.
- Identifiserte datainnsamlingshull, noe som muliggjør sanntids problembevissthet i teamet.
- Så en tidoblet økning i klientadopsjon, med månedlige innkommende forespørsler som steg fra noen få tusen til millioner.
Product Intelligence
ClearSale - 1 year 2 months
-
Played a key role in scaling up a new Biometry product by defining metrics for performance evaluation and monitoring the application’s behavior;
-
Identified gaps in data collection processes, enabling the team to address issues in near real-time and enhance overall data quality;
-
Collaborated with Product and Sales teams to reformulate the sales pitch, emphasizing improvements driven by data insights;
-
Successfully contributed to a tenfold increase in client adoption and a significant surge in monthly incoming requests, from a few thousands to millions, showcasing the product's enhanced value proposition.
Teknologier:
- Teknologier:
HTML
Oracle
Scala
Azure Blob storage
- Data Science
Azure Data Factory
TensorFlow
NumPy
OpenCV
XGBoost
Keras
Pandas
R (programming language)
Open source
LaTeX
PyCharm
BigQuery
- CSV
OAuth2
- Command-line interface
Unix
VSCode
SciPy
Scikit-learn
- ELT
Matplotlib
- Data Analytics
Azure Synapse
Random Forest
- PCA
Convolutional neural network
- Recurrent neural network
PL/SQL
XML
- NLP
Machine Learning
- Computer Vision
Cuda
BeautifulSoup
SQLAlchemy
Tableau
Plotly
- Dimensional modeling
- Fact Data Modeling
Redshift
Looker
Dataflow
-
Data Engineer
Accenture Brazil - 5 months
- Jobbet med å implementere GCP dataløsninger på kundenes infrastruktur;
- Utviklet en løsning for å samle data fra 10+ datakilder for å lage et datavarehus, noe som sparer tid brukt på manuell innsamling av data fra dem;
- Bidro til å redusere kostnadene fra tredjepartskilder med 50 % ved å bruke rådataene til å utvikle vår egen Data Viz, og kansellerte redundante Analytics-kontrakter.
Data & AI
Accenture Brazil - 5 months
-
Contributed to the implementation of GCP Data solutions on clients’ infrastructure, enhancing data management capabilities and optimizing workflow efficiency;
-
Designed and implemented a solution to aggregate data from over 10 sources to establish a centralized Data Warehouse, significantly reducing manual data collection efforts and streamlining data processing workflows;
-
Played a key role in cost reduction initiatives by leveraging raw data to develop in-house Data Visualization tools, resulting in a 50% reduction in costs associated with third-party sources and the cancellation of redundant Analytics contracts.
Teknologier:
- Teknologier:
Oracle
Azure Blob storage
- Data Science
Azure Data Factory
TensorFlow
NumPy
OpenCV
Keras
Pandas
Open source
LaTeX
PyTorch
PyCharm
BigQuery
- CSV
OAuth2
- Command-line interface
Unix
VSCode
SciPy
Scikit-learn
- ELT
Matplotlib
- Data Analytics
Azure Synapse
Random Forest
- PCA
Convolutional neural network
PL/SQL
- NLP
Machine Learning
- Computer Vision
Cuda
BeautifulSoup
SQLAlchemy
Tableau
Plotly
- Dimensional modeling
- Fact Data Modeling
Redshift
Talend
Looker
Dataflow
-
Data Scientist
Netshoes Brazil - 11 months
-
Collaborated closely with the Marketing department to optimize the targeting of advertisements and mail campaigns to customers, enhancing their effectiveness;
-
Conceptualized and implemented a source of truth dataset for Customers’ data, leading to an increase in the frequency of model training and improving overall data quality;
-
Leveraged more up-to-date analysis to drive a daily increase of R$50k in gross income by refining the targeting of mailings and advertisements, thereby maximizing revenue generation efforts.
Teknologier:
- Teknologier:
Oracle
Azure Blob storage
- Data Science
Azure Data Factory
TensorFlow
NumPy
OpenCV
Keras
Pandas
R (programming language)
Open source
LaTeX
PyTorch
PyCharm
BigQuery
- CSV
OAuth2
- Command-line interface
Unix
SciPy
Scikit-learn
- ELT
Matplotlib
- Data Analytics
Azure Synapse
Random Forest
- PCA
Convolutional neural network
- Recurrent neural network
PL/SQL
Machine Learning
- Computer Vision
SQLAlchemy
Plotly
- Dimensional modeling
- Fact Data Modeling
Talend
-
Presales Architect
T-Systems Brazil - 2 months
-
Assisted in managing the team by providing insights to understand the team's performance, facilitating informed decision-making and strategic planning;
-
Developed visualizations to analyze and identify clients requiring more attention, enabling proactive engagement and relationship management;
-
Utilized gathered insights to optimize resource allocation and prioritize efforts towards proposals with higher success probabilities, resulting in improved efficiency and effectiveness in client interactions.
Teknologier:
- Teknologier:
NumPy
Pandas
R (programming language)
Open source
LaTeX
PyCharm
BigQuery
- CSV
Unix
SciPy
Scikit-learn
Matplotlib
- Data Analytics
PL/SQL
-
Utdannelse
BSc.Informasjonssystemer
USP - Universitetet i São Paulo · 2017 - 2020
Finn din neste utvikler innen dager, ikke måneder
I løpet av en kort 25-minutters samtale ønsker vi å:
- Forstå dine utviklingsbehov
- Forklare prosessen vår der vi matcher deg med kvalifiserte, evaluerte utviklere fra vårt nettverk
- Dele de neste stegene for å finne riktig match, ofte på mindre enn en uke
