NEW
Proxify is bringing transparency to tech team performance based on research conducted at Stanford. An industry first, built for engineering leaders.
Learn more
Felipe A.
Data Scientist
Felipe is a highly skilled Data Scientist with over seven years of experience across fintech, proptech, edtech, and consultancy. He combines strong technical expertise in machine learning with the ability to effectively communicate complex concepts to stakeholders.
His technical proficiency includes working with advanced Data Science and ML tools such as Snowflake, dbt, Airflow, and MLflow. A career highlight was his role at Cambridge University, where he developed and taught an advanced online data science course, showcasing both his subject-matter expertise and ability to simplify complex topics. Additionally, at Outra, he played a key role in securing a multi-million-dollar contract with Zoopla.
Felipe’s unique blend of deep technical knowledge and strong communication skills positions him as a standout professional in the field of Data Science.
Main expertise
- Pytest 2 years

- AWS 3 years
- Bash 4 years

Other skills
- Agile 4 years

- PyTorch 2 years
- Asana 1 years

Selected experience
Employment
Lead Data Scientist
Rylee - 1 year
- Worked at Rylee, an e-commerce platform that provided customers with product insights and market analysis to improve strategies on Bol.com and Amazon.
- Earned Databricks certification in Generative AI, demonstrating expertise with RAG and Agent models.
- Developed a large-scale sales forecasting model using Rylee’s internal data and scraped data from Bol.com to predict product sales and identify bestsellers.
- Built an API with Flask and AWS Lambda to handle product queries, delivering insights and sales forecasts.
- Designed and implemented ETL pipelines using dbt, Airflow, and Spark to automate feature engineering, including an asynchronous solution for efficient data retrieval from the Bol.com API while respecting rate limits and parallelizing at scale; also automated data reconciliation across multiple sellers.
- Applied PyTorch and Spark to optimize high-performance machine learning models.
Technologies:
- Technologies:
MySQL
AWS
Databricks
Apache Spark
Django
Flask
Python
SQL
- Data Science
Digital Ocean
TensorFlow
NumPy
XGBoost
Pandas
- Data Engineering
Scrum
Git
Scikit-learn
- ELT
- Data Analytics
Random Forest
- Clustering
- SVM
- PCA
Convolutional neural network
- Transformer Network
- Data Modeling
ETL
- NLP
Machine Learning
FastAPI
OpenAI API
- Prompt Engineering
Large Language Models (LLM)
Hugging Face Transformers
- MLflow
- Pipeline optimization
Lead Data Scientist
Homemove - 3 months
- Worked at Homemove, a comprehensive platform offering moving-related services, including surveys, removals, and mortgages, integrated within a single app.
- Developed an LLM-powered negotiation tool that allowed users to obtain quotes and negotiate prices via an AI chatbot, automatically added customers to the CRM, and alerted the sales team upon successful negotiation, using OpenAI Assistant and GPT models.
- Led a scalable data transformation initiative, leveraging Snowflake for cloud data warehousing and Sigma for BI and visualization.
- Designed and implemented ETL pipelines from scratch using Snowflake, Python, SQL, dbt, and Airflow to automate data ingestion and transformation.
- Built a predictive modeling solution to reduce marketing costs and improve targeting by identifying high-potential home movers.
- Applied PyTorch and Snowpark for advanced machine learning to optimize high-performance models.
- Delivered a predictive model that was planned for use in attracting investment during Homemove's Series A funding.
Technologies:
- Technologies:
MySQL
AWS
ElasticSearch
Databricks
Apache Spark
Django
Flask
Python
SQL
AWS Lambda
AWS S3
- Data Science
- Regression testing
Digital Ocean
TensorFlow
NumPy
XGBoost
Keras
Pandas
- Data Engineering
PyTorch
PyCharm
Scrum
Git
SciPy
Scikit-learn
- ELT
Matplotlib
- Data Analytics
Random Forest
- Clustering
- SVM
- PCA
Convolutional neural network
- Recurrent neural network
- Transformer Network
Snowflake
- Data Modeling
ETL
- NLP
Machine Learning
FastAPI
SQLAlchemy
Streamlit
Pytest
Plotly
OpenAI API
- Prompt Engineering
Large Language Models (LLM)
Vertex AI
Pinecone
Hugging Face Transformers
- MLflow
- Pipeline optimization
Data Science Instructor and Course Developer
Cambridge University & FourthRev - 8 months
- Worked as a Data Science specialist at FourthRev, creating and teaching an advanced online data science course for Cambridge University students.
- Developed and delivered a comprehensive curriculum covering Neural Networks, NLP for AI, Unsupervised Learning, and advanced Decision Tree algorithms including XGBoost.
- Applied hands-on machine learning implementation from scratch and employed innovative teaching methods to enhance student engagement and learning outcomes.
- Demonstrated deep technical expertise in data science and machine learning, earning recognition from academic peers for effective teaching and curriculum design.
Technologies:
- Technologies:
- Data Science
NumPy
XGBoost
Pandas
Scikit-learn
Matplotlib
Machine Learning
Pytest
Plotly
- Neural Network
Senior Data Scientist
Outra - 2 years
- Worked at Outra, a data-driven property insight company, specializing in delivering household-level data to optimize client services.
- Migrated the platform from Dataiku to a custom in-house Intelligence Fabric using MLflow, Airflow, Snowflake, GitHub Actions, AWS, and DBT for data engineering.
- Developed two major predictive models forecasting household listing and sale/rent timelines, enabling a multi-million-pound partnership with Zoopla.
- Applied LLMs for code documentation, coding assistance, and interactive chatbots for dashboards and customer-facing data.
- Built ETL/ELT pipelines to transform raw data and prepare it for modeling.
- Created visualizations and maps using KeplerGI, Seaborn, and Dataiku to help non-technical users interpret complex data.
Technologies:
- Technologies:
AWS
Apache Spark
Django
Flask
Python
SQL
AWS Lambda
AWS S3
- Data Science
TensorFlow
NumPy
XGBoost
Keras
Pandas
- Data Engineering
PyTorch
Scrum
Git
Scikit-learn
- ELT
Apache Airflow
Matplotlib
- Data Analytics
Random Forest
- Clustering
- SVM
- PCA
Convolutional neural network
- Recurrent neural network
- Transformer Network
Snowflake
- Data Modeling
ETL
- NLP
Machine Learning
AWS EC2
FastAPI
SQLAlchemy
Streamlit
Pytest
Plotly
dbt
OpenAI API
- Prompt Engineering
- Neural Network
Large Language Models (LLM)
Vertex AI
Dataiku
Hugging Face Transformers
- MLflow
- Pipeline optimization
Senior Data Scientist
Belmont Green - 2 years 6 months
- Worked at Belmont Green, a specialist mortgage lending company turned bank, providing financial and mortgage solutions to financially affected customers.
- Created a conversion model using survival analysis techniques and managed the project from inception to production.
- Built machine learning algorithms and statistical models for time series data, focusing on retention, lifetime value, and expected loss models.
- Owned projects end-to-end, ensuring proofs of concept were implemented and deployed successfully in production.
- Implemented machine learning algorithms for cashflow, early redemption, default, pre-payment, and conversion models using Python and R.
- Applied clustering and segmentation techniques to analyze product usage and customer behavior for marketing and strategic purposes.
Technologies:
- Technologies:
- Data Science
NumPy
XGBoost
Keras
Pandas
PyTorch
Scikit-learn
Matplotlib
Machine Learning
SQLAlchemy
Plotly
- Neural Network
Data Scientist
Boster.AI - 2 years
-
Boster.Ai is a company dedicated to create No-code bots for data retrieval, monitoring and automation. Originally, they started as an IT consultancy company, creating personalized solutions to small and mid-size companies to harvest the power of Machine Learning
-
Felipe worked performing data exploration, analysis, and building Machine Learning algorithms and statistical models for several start-ups/mid-size companies in the UK and USA.
-
He built and diagnostic Neural Networks models for forecasting key performance indicators using Python, TensorFlow, and Keras.
-
Worked in e-commerce businesses solving customer behaviour problems such as lifetime value, clustering of customers, etc.
-
Performed ethical web scraping using Python with scraPy, RoboBrowser, and BeautifulSoup to obtain data for various analyses.
Technologies:
- Technologies:
- Data Science
TensorFlow
NumPy
Keras
Pandas
Scikit-learn
Matplotlib
Machine Learning
Plotly
-
Education
Standalone courseMachine Learning Specialization
Stanford University · 2023 - 2023
Standalone courseMachine Learning
Massachusetts Institute of Technology · 2021 - 2022
BSc.Business Management with maths
Kingston University · 2013 - 2016
BSc.Civil Engineering
Adolfo Ibanez University · 2011 - 2013
Find your next developer within days, not months
In a short 25-minute call, we would like to:
- Understand your development needs
- Explain our process to match you with qualified, vetted developers from our network
- You are presented the right candidates 2 days in average after we talk
