
Data Engineer
Alongside her technical abilities, Rihab has broad experience in leadership and project management. One of her key achievements is building a data curation service while also performing as Scrum Master, where she successfully managed a team and implemented a new data service using Scala.
Rihab’s mix of strong technical skills and leadership experience makes her a great fit for projects in regulated industries.


Design & Implementation of a Forecasting Platform – Engie (French Global Energy Company)


Building and supporting promotion planning demo solution
Developed generic data pipelines to transform raw client data into a format compatible with the data model of the promotion planning demo system;
Wrote scripts to generate meaningful business data, ensuring alignment with the needs of the application;
Collaborated with the science team to understand business requirements and determine the necessary data transformations to enhance data utility;
Designed and implemented a generic PySpark codebase that efficiently transforms data to fit the required data model;
Utilized tools such as PySpark, JupyterHub, Kubernetes, and Azure Data Lake to execute and support the project.

Implementing and Migrating Data Pipelines, and Supporting Legacy Systems - SumUp (Fintech German Company)
Designed and implemented data pipelines for both batch and stream processing, optimizing data flow and efficiency;
Explored and implemented data pipelines using AWS Glue and PySpark, ensuring scalability and robustness;
Integrated Delta Lake into the pipelines to enable delta processing, enhancing data management capabilities;
Developed job templating using Jinja to streamline the creation and management of data processing jobs;
Built and automated data validation pipelines, ensuring the accuracy and reliability of processed data;
Deployed and configured Trino to facilitate efficient data access and querying across various sources;
Prepared comprehensive documentation for each component and tool explored, ensuring knowledge transfer and easy maintenance;
Utilized tools such as Python, PySpark, Glue (Jobs, Crawlers, Catalogs), Athena, AWS, MWAA (Airflow), Kubernetes, Trino, and Jinja to achieve project goals.

Building a Data Curation Platform
Implemented a platform designed to make building data pipelines generic, easy, scalable, and quick to assemble for any new client;
Prepared detailed design documents, architectural blueprints, and specifications for the platform;
Gathered and documented requirements, creating specific epics and tasks, and efficiently distributed work among team members;
Developed command-line and pipeline functionalities that enable chaining transformations, facilitating the creation of generic data pipelines;
Supported the management of metadata for various entities defined within the platform;
Conducted runtime analysis and optimized the performance of different platform functionalities;
Studied scalability requirements and designed performance improvement strategies to enhance the platform's robustness;
Built a PySpark interface to facilitate seamless integration with data science workflows.



Project 1: Building a Speech Recognition Solution
Developed a speech recognition solution aimed at transforming retailers' questions and commands into actionable tasks executed against a user interface (UI);
Utilized TensorFlow, Python, AWS, and Node.js to design and implement the solution, ensuring seamless interaction between the speech recognition engine and the UI.
Project 2: Design and Implementation of a Short Life Cycle Forecasting System
Prepared comprehensive design documents and conducted studies on existing AI solutions, with a focus on voice and speech recognition capabilities;
Collaborated with the team to prepare and collect relevant data for the project;
Executed the processes of data augmentation, validation, and transformation to extract essential information for forecasting purposes;
Contributed to building a user interface and integrated backend functionalities using tools such as TensorFlow, Python, AWS, JavaScript, Node.js, Scala, and Spark.

Designed and structured the architecture for various components of a retail forecasting project;
Implemented and deployed key components, ensuring seamless functionality within the overall system;
Integrated all components, automating the processes and establishing an end-to-end batch process for streamlined operations;
Optimized the runtime and performance of each component, enhancing the system's overall efficiency;
Developed forecast comparison templates to facilitate the evaluation of forecast quality, aiding in accurate performance assessments;
Utilized Logicblox, Python, and Tableau Software to achieve project goals, ensuring high-quality results.
Engineering excellence
Rihab’s overall performance in a 90-minute live technical assessment ranks in the top 25% of vetted Data Engineers at Proxify.
Issued Feb 2025 - Expires Feb 2027
Credential ID 133741658
Issued Feb 2025 - Expires Feb 2027
Credential ID 133741658
Talk to an expert and get tailored matches from our network in just 2 days.
A network of over 6,000+ tech experts
Get matched with perfect-fit talent in 2 days on average
Hire quickly and easily with 94% match success