Felipe A.

Data Scientist

Felipe is a highly skilled Data Scientist with over seven years of experience across fintech, proptech, edtech, and consultancy. He combines strong technical expertise in machine learning with the ability to effectively communicate complex concepts to stakeholders.

His technical proficiency includes working with advanced Data Science and ML tools such as Snowflake, dbt, Airflow, and MLflow. A career highlight was his role at Cambridge University, where he developed and taught an advanced online data science course, showcasing both his subject-matter expertise and ability to simplify complex topics. Additionally, at Outra, he played a key role in securing a multi-million-dollar contract with Zoopla.

Felipe’s unique blend of deep technical knowledge and strong communication skills positions him as a standout professional in the field of Data Science.

Main expertise

  • Pytest
    Pytest 2 years
  • AWS
    AWS 3 years
  • Bash
    Bash 4 years

Other skills

  • Agile
    Agile 4 years
  • PyTorch
    PyTorch 2 years
  • Asana
    Asana 1 years
Felipe

Felipe A.

United Kingdom

Get started

Selected experience

Employment

  • Lead Data Scientist

    Rylee - 1 year

    • Worked at Rylee, an e-commerce platform that provided customers with product insights and market analysis to improve strategies on Bol.com and Amazon.
    • Earned Databricks certification in Generative AI, demonstrating expertise with RAG and Agent models.
    • Developed a large-scale sales forecasting model using Rylee’s internal data and scraped data from Bol.com to predict product sales and identify bestsellers.
    • Built an API with Flask and AWS Lambda to handle product queries, delivering insights and sales forecasts.
    • Designed and implemented ETL pipelines using dbt, Airflow, and Spark to automate feature engineering, including an asynchronous solution for efficient data retrieval from the Bol.com API while respecting rate limits and parallelizing at scale; also automated data reconciliation across multiple sellers.
    • Applied PyTorch and Spark to optimize high-performance machine learning models.

    Technologies:

    • Technologies:
    • MySQL MySQL
    • AWS AWS
    • Databricks Databricks
    • Apache Spark Apache Spark
    • Django Django
    • Flask Flask
    • Python Python
    • SQL SQL
    • Data Science
    • Digital Ocean Digital Ocean
    • TensorFlow TensorFlow
    • NumPy NumPy
    • XGBoost XGBoost
    • Pandas Pandas
    • Data Engineering
    • Scrum Scrum
    • Git Git
    • Scikit-learn Scikit-learn
    • ELT
    • Data Analytics
    • Random Forest Random Forest
    • Clustering
    • SVM
    • PCA
    • Convolutional neural network Convolutional neural network
    • Transformer Network
    • Data Modeling
    • ETL ETL
    • NLP
    • Machine Learning Machine Learning
    • FastAPI FastAPI
    • OpenAI API OpenAI API
    • Prompt Engineering
    • Large Language Models (LLM) Large Language Models (LLM)
    • Hugging Face Transformers Hugging Face Transformers
    • MLflow
    • Pipeline optimization
  • Lead Data Scientist

    Homemove - 3 months

    • Worked at Homemove, a comprehensive platform offering moving-related services, including surveys, removals, and mortgages, integrated within a single app.
    • Developed an LLM-powered negotiation tool that allowed users to obtain quotes and negotiate prices via an AI chatbot, automatically added customers to the CRM, and alerted the sales team upon successful negotiation, using OpenAI Assistant and GPT models.
    • Led a scalable data transformation initiative, leveraging Snowflake for cloud data warehousing and Sigma for BI and visualization.
    • Designed and implemented ETL pipelines from scratch using Snowflake, Python, SQL, dbt, and Airflow to automate data ingestion and transformation.
    • Built a predictive modeling solution to reduce marketing costs and improve targeting by identifying high-potential home movers.
    • Applied PyTorch and Snowpark for advanced machine learning to optimize high-performance models.
    • Delivered a predictive model that was planned for use in attracting investment during Homemove's Series A funding.

    Technologies:

    • Technologies:
    • MySQL MySQL
    • AWS AWS
    • ElasticSearch ElasticSearch
    • Databricks Databricks
    • Apache Spark Apache Spark
    • Django Django
    • Flask Flask
    • Python Python
    • SQL SQL
    • AWS Lambda AWS Lambda
    • AWS S3 AWS S3
    • Data Science
    • Regression testing
    • Digital Ocean Digital Ocean
    • TensorFlow TensorFlow
    • NumPy NumPy
    • XGBoost XGBoost
    • Keras Keras
    • Pandas Pandas
    • Data Engineering
    • PyTorch PyTorch
    • PyCharm PyCharm
    • Scrum Scrum
    • Git Git
    • SciPy SciPy
    • Scikit-learn Scikit-learn
    • ELT
    • Matplotlib Matplotlib
    • Data Analytics
    • Random Forest Random Forest
    • Clustering
    • SVM
    • PCA
    • Convolutional neural network Convolutional neural network
    • Recurrent neural network
    • Transformer Network
    • Snowflake Snowflake
    • Data Modeling
    • ETL ETL
    • NLP
    • Machine Learning Machine Learning
    • FastAPI FastAPI
    • SQLAlchemy SQLAlchemy
    • Streamlit Streamlit
    • Pytest Pytest
    • Plotly Plotly
    • OpenAI API OpenAI API
    • Prompt Engineering
    • Large Language Models (LLM) Large Language Models (LLM)
    • Vertex AI Vertex AI
    • Pinecone Pinecone
    • Hugging Face Transformers Hugging Face Transformers
    • MLflow
    • Pipeline optimization
  • Data Science Instructor and Course Developer

    Cambridge University & FourthRev - 8 months

    • Worked as a Data Science specialist at FourthRev, creating and teaching an advanced online data science course for Cambridge University students.
    • Developed and delivered a comprehensive curriculum covering Neural Networks, NLP for AI, Unsupervised Learning, and advanced Decision Tree algorithms including XGBoost.
    • Applied hands-on machine learning implementation from scratch and employed innovative teaching methods to enhance student engagement and learning outcomes.
    • Demonstrated deep technical expertise in data science and machine learning, earning recognition from academic peers for effective teaching and curriculum design.

    Technologies:

    • Technologies:
    • Data Science
    • NumPy NumPy
    • XGBoost XGBoost
    • Pandas Pandas
    • Scikit-learn Scikit-learn
    • Matplotlib Matplotlib
    • Machine Learning Machine Learning
    • Pytest Pytest
    • Plotly Plotly
    • Neural Network
  • Senior Data Scientist

    Outra - 2 years

    • Worked at Outra, a data-driven property insight company, specializing in delivering household-level data to optimize client services.
    • Migrated the platform from Dataiku to a custom in-house Intelligence Fabric using MLflow, Airflow, Snowflake, GitHub Actions, AWS, and DBT for data engineering.
    • Developed two major predictive models forecasting household listing and sale/rent timelines, enabling a multi-million-pound partnership with Zoopla.
    • Applied LLMs for code documentation, coding assistance, and interactive chatbots for dashboards and customer-facing data.
    • Built ETL/ELT pipelines to transform raw data and prepare it for modeling.
    • Created visualizations and maps using KeplerGI, Seaborn, and Dataiku to help non-technical users interpret complex data.

    Technologies:

    • Technologies:
    • AWS AWS
    • Apache Spark Apache Spark
    • Django Django
    • Flask Flask
    • Python Python
    • SQL SQL
    • AWS Lambda AWS Lambda
    • AWS S3 AWS S3
    • Data Science
    • TensorFlow TensorFlow
    • NumPy NumPy
    • XGBoost XGBoost
    • Keras Keras
    • Pandas Pandas
    • Data Engineering
    • PyTorch PyTorch
    • Scrum Scrum
    • Git Git
    • Scikit-learn Scikit-learn
    • ELT
    • Apache Airflow Apache Airflow
    • Matplotlib Matplotlib
    • Data Analytics
    • Random Forest Random Forest
    • Clustering
    • SVM
    • PCA
    • Convolutional neural network Convolutional neural network
    • Recurrent neural network
    • Transformer Network
    • Snowflake Snowflake
    • Data Modeling
    • ETL ETL
    • NLP
    • Machine Learning Machine Learning
    • AWS EC2 AWS EC2
    • FastAPI FastAPI
    • SQLAlchemy SQLAlchemy
    • Streamlit Streamlit
    • Pytest Pytest
    • Plotly Plotly
    • dbt dbt
    • OpenAI API OpenAI API
    • Prompt Engineering
    • Neural Network
    • Large Language Models (LLM) Large Language Models (LLM)
    • Vertex AI Vertex AI
    • Dataiku Dataiku
    • Hugging Face Transformers Hugging Face Transformers
    • MLflow
    • Pipeline optimization
  • Senior Data Scientist

    Belmont Green - 2 years 6 months

    • Worked at Belmont Green, a specialist mortgage lending company turned bank, providing financial and mortgage solutions to financially affected customers.
    • Created a conversion model using survival analysis techniques and managed the project from inception to production.
    • Built machine learning algorithms and statistical models for time series data, focusing on retention, lifetime value, and expected loss models.
    • Owned projects end-to-end, ensuring proofs of concept were implemented and deployed successfully in production.
    • Implemented machine learning algorithms for cashflow, early redemption, default, pre-payment, and conversion models using Python and R.
    • Applied clustering and segmentation techniques to analyze product usage and customer behavior for marketing and strategic purposes.

    Technologies:

    • Technologies:
    • Data Science
    • NumPy NumPy
    • XGBoost XGBoost
    • Keras Keras
    • Pandas Pandas
    • PyTorch PyTorch
    • Scikit-learn Scikit-learn
    • Matplotlib Matplotlib
    • Machine Learning Machine Learning
    • SQLAlchemy SQLAlchemy
    • Plotly Plotly
    • Neural Network
  • Data Scientist

    Boster.AI - 2 years

    • Boster.Ai is a company dedicated to create No-code bots for data retrieval, monitoring and automation. Originally, they started as an IT consultancy company, creating personalized solutions to small and mid-size companies to harvest the power of Machine Learning

    • Felipe worked performing data exploration, analysis, and building Machine Learning algorithms and statistical models for several start-ups/mid-size companies in the UK and USA.

    • He built and diagnostic Neural Networks models for forecasting key performance indicators using Python, TensorFlow, and Keras.

    • Worked in e-commerce businesses solving customer behaviour problems such as lifetime value, clustering of customers, etc.

    • Performed ethical web scraping using Python with scraPy, RoboBrowser, and BeautifulSoup to obtain data for various analyses.

    Technologies:

    • Technologies:
    • Data Science
    • TensorFlow TensorFlow
    • NumPy NumPy
    • Keras Keras
    • Pandas Pandas
    • Scikit-learn Scikit-learn
    • Matplotlib Matplotlib
    • Machine Learning Machine Learning
    • Plotly Plotly

Education

  • Standalone courseMachine Learning Specialization

    Stanford University · 2023 - 2023

  • Standalone courseMachine Learning

    Massachusetts Institute of Technology · 2021 - 2022

  • BSc.Business Management with maths

    Kingston University · 2013 - 2016

  • BSc.Civil Engineering

    Adolfo Ibanez University · 2011 - 2013

Find your next developer within days, not months

In a short 25-minute call, we would like to:

  • Understand your development needs
  • Explain our process to match you with qualified, vetted developers from our network
  • You are presented the right candidates 2 days in average after we talk

Not sure where to start? Let’s have a chat