Caio M.

Caio M.

Data Engineer

Brazil
Betrodd medlem siden 2023
6 år erfaring

I tillegg påtok Caio seg en sentral rolle i Nubank, hvor han ledet et team på 200 fagfolk, og styrte initiativet Data Governance and Cost Accountability. Dette initiativet økte innsynet i prosjekt- og oppgaverelaterte kostnader betydelig, til fordel for alle involverte teammedlemmer.

I løpet av sin periode i Accenture viste Caio eksepsjonelt engasjement da han jobbet flittig med å implementere GCP Data-løsninger i kundenes infrastruktur. Denne innsatsen hadde stor innvirkning, og effektiviserte datainnsamlingsprosessen betydelig fra ulike kilder og reduserte tiden som tidligere ble brukt på manuell datainnsamling.

Hovedekspertise

PythonPython6 år
SQLSQL6 år
ETLETL5 år
Apache SparkApache Spark4 år
79+

Erfaring9

Data Engineer

Proxify AB
Data Analytics
Aug 2023 - Jan 2024 · 5m
  • Working In Multiple Clients, providing Data Engineering & Analytics consultancy and development

  • Automating Web Page Navigation and Scraping using Playwright and BeautifulSoup

  • Providing Data Analytics and Data Modelling solutions with multiple frameworks

  • Implementing ETL jobs to integrate data from different sources into Data Warehouses or Data Lakes

HTMLHTML
Azure Blob storageAzure Blob storage
TensorFlowTensorFlow
NumPyNumPy
OpenCVOpenCV
34+
Nubank

Data Engineer

Nubank
Jun 2022 · 3y 9m
  • Reformerte datainnsamlingsarkitekturen for sosiale medier, og reduserte beregningstiden og -kostnadene med over 90 %.
  • Ledet et initiativ for datastyring og kostnadsansvar i et team på 200 personer, og forbedret kostnadssynlighet knyttet til prosjekter og oppgaver.
Nubank

Analytics Engineer

Nubank
Data Analytics
Jun 2022 · 3y 9m
  • Spearheaded the reformulation of social media data collection architecture, achieving a remarkable reduction of computing time and costs by over 90%;

  • Integrated a new pipeline with the company’s data lake, enabling universal access to Social Media datasets and fostering collaboration;

  • Led a Data Governance and Cost Accountability initiative within a team of 200 members, enhancing transparency and providing visibility into costs associated with projects and tasks;

  • Delivering meaningful data and insights empowered the team to concentrate on content analytics and performance, facilitating informed decision-making and optimizing workflow efficiency.

HTMLHTML
ScalaScala
Azure Blob storageAzure Blob storage
Data Science
Azure Data FactoryAzure Data Factory
46+
ClearSale

Data Engineer

ClearSale
Mar 2021 - May 2022 · 1y 2m
  • Assisterte med å skalere opp et nytt Biometri-produkt ved å definere ytelsesmålinger og overvåke applikasjonsatferd.
  • Identifiserte datainnsamlingshull, noe som muliggjør sanntids problembevissthet i teamet.
  • Så en tidoblet økning i klientadopsjon, med månedlige innkommende forespørsler som steg fra noen få tusen til millioner.
ClearSale

Product Intelligence

ClearSale
Data Analytics
Mar 2021 - May 2022 · 1y 2m
  • Played a key role in scaling up a new Biometry product by defining metrics for performance evaluation and monitoring the application’s behavior;

  • Identified gaps in data collection processes, enabling the team to address issues in near real-time and enhance overall data quality;

  • Collaborated with Product and Sales teams to reformulate the sales pitch, emphasizing improvements driven by data insights;

  • Successfully contributed to a tenfold increase in client adoption and a significant surge in monthly incoming requests, from a few thousands to millions, showcasing the product's enhanced value proposition.

HTMLHTML
OracleOracle
ScalaScala
Azure Blob storageAzure Blob storage
Data Science
42+

Data Engineer

Accenture Brazil
Sep 2020 - Feb 2021 · 5m
  • Jobbet med å implementere GCP dataløsninger på kundenes infrastruktur;
  • Utviklet en løsning for å samle data fra 10+ datakilder for å lage et datavarehus, noe som sparer tid brukt på manuell innsamling av data fra dem;
  • Bidro til å redusere kostnadene fra tredjepartskilder med 50 % ved å bruke rådataene til å utvikle vår egen Data Viz, og kansellerte redundante Analytics-kontrakter.

Data & AI

Accenture Brazil
Artificial Intelligence (AI)
Sep 2020 - Feb 2021 · 5m
  • Contributed to the implementation of GCP Data solutions on clients’ infrastructure, enhancing data management capabilities and optimizing workflow efficiency;

  • Designed and implemented a solution to aggregate data from over 10 sources to establish a centralized Data Warehouse, significantly reducing manual data collection efforts and streamlining data processing workflows;

  • Played a key role in cost reduction initiatives by leveraging raw data to develop in-house Data Visualization tools, resulting in a 50% reduction in costs associated with third-party sources and the cancellation of redundant Analytics contracts.

OracleOracle
Azure Blob storageAzure Blob storage
Data Science
Azure Data FactoryAzure Data Factory
TensorFlowTensorFlow
38+

Data Scientist

Netshoes Brazil
Data Analytics
Oct 2018 - Sep 2019 · 11m
  • Collaborated closely with the Marketing department to optimize the targeting of advertisements and mail campaigns to customers, enhancing their effectiveness;

  • Conceptualized and implemented a source of truth dataset for Customers’ data, leading to an increase in the frequency of model training and improving overall data quality;

  • Leveraged more up-to-date analysis to drive a daily increase of R$50k in gross income by refining the targeting of mailings and advertisements, thereby maximizing revenue generation efforts.

OracleOracle
Azure Blob storageAzure Blob storage
Data Science
Azure Data FactoryAzure Data Factory
TensorFlowTensorFlow
32+

Presales Architect

T-Systems Brazil
Data Analytics
Jul 2018 - Sep 2018 · 2m
  • Assisted in managing the team by providing insights to understand the team's performance, facilitating informed decision-making and strategic planning;

  • Developed visualizations to analyze and identify clients requiring more attention, enabling proactive engagement and relationship management;

  • Utilized gathered insights to optimize resource allocation and prioritize efforts towards proposals with higher success probabilities, resulting in improved efficiency and effectiveness in client interactions.

NumPyNumPy
PandasPandas
R (programming language)R (programming language)
Open sourceOpen source
LaTeXLaTeX
9+

Utdannelse

U-U
USP - Universitetet i São Paulo
Informasjonssystemer2017 - 2020

Slutt å bla.
Bli matchet raskere.