
Data Engineer
Teknisten taitojensa lisäksi Rihabilla on laaja kokemus johtamisesta ja projektinhallinnasta. Yksi hänen tärkeimmistä saavutuksistaan on datan kuratointipalvelun rakentaminen samalla kun hän toimi Scrum Masterina, jossa hän johti menestyksekkäästi tiimiä ja toteutti uuden datapalvelun Scalan avulla.
Rihabin vahvat tekniset taidot ja johtamiskokemus yhdistyvät, joten hän sopii erinomaisesti säännellyillä toimialoilla toteutettaviin hankkeisiin.


Ennustealustan suunnittelu ja toteutus - Engie (ranskalainen maailmanlaajuinen energiayhtiö)


Building and supporting promotion planning demo solution
Developed generic data pipelines to transform raw client data into a format compatible with the data model of the promotion planning demo system;
Wrote scripts to generate meaningful business data, ensuring alignment with the needs of the application;
Collaborated with the science team to understand business requirements and determine the necessary data transformations to enhance data utility;
Designed and implemented a generic PySpark codebase that efficiently transforms data to fit the required data model;
Utilized tools such as PySpark, JupyterHub, Kubernetes, and Azure Data Lake to execute and support the project.

Implementing and Migrating Data Pipelines, and Supporting Legacy Systems - SumUp (Fintech German Company)
Designed and implemented data pipelines for both batch and stream processing, optimizing data flow and efficiency;
Explored and implemented data pipelines using AWS Glue and PySpark, ensuring scalability and robustness;
Integrated Delta Lake into the pipelines to enable delta processing, enhancing data management capabilities;
Developed job templating using Jinja to streamline the creation and management of data processing jobs;
Built and automated data validation pipelines, ensuring the accuracy and reliability of processed data;
Deployed and configured Trino to facilitate efficient data access and querying across various sources;
Prepared comprehensive documentation for each component and tool explored, ensuring knowledge transfer and easy maintenance;
Utilized tools such as Python, PySpark, Glue (Jobs, Crawlers, Catalogs), Athena, AWS, MWAA (Airflow), Kubernetes, Trino, and Jinja to achieve project goals.

Building a Data Curation Platform
Implemented a platform designed to make building data pipelines generic, easy, scalable, and quick to assemble for any new client;
Prepared detailed design documents, architectural blueprints, and specifications for the platform;
Gathered and documented requirements, creating specific epics and tasks, and efficiently distributed work among team members;
Developed command-line and pipeline functionalities that enable chaining transformations, facilitating the creation of generic data pipelines;
Supported the management of metadata for various entities defined within the platform;
Conducted runtime analysis and optimized the performance of different platform functionalities;
Studied scalability requirements and designed performance improvement strategies to enhance the platform's robustness;
Built a PySpark interface to facilitate seamless integration with data science workflows.



Project 1: Building a Speech Recognition Solution
Developed a speech recognition solution aimed at transforming retailers' questions and commands into actionable tasks executed against a user interface (UI);
Utilized TensorFlow, Python, AWS, and Node.js to design and implement the solution, ensuring seamless interaction between the speech recognition engine and the UI.
Project 2: Design and Implementation of a Short Life Cycle Forecasting System
Prepared comprehensive design documents and conducted studies on existing AI solutions, with a focus on voice and speech recognition capabilities;
Collaborated with the team to prepare and collect relevant data for the project;
Executed the processes of data augmentation, validation, and transformation to extract essential information for forecasting purposes;
Contributed to building a user interface and integrated backend functionalities using tools such as TensorFlow, Python, AWS, JavaScript, Node.js, Scala, and Spark.

Designed and structured the architecture for various components of a retail forecasting project;
Implemented and deployed key components, ensuring seamless functionality within the overall system;
Integrated all components, automating the processes and establishing an end-to-end batch process for streamlined operations;
Optimized the runtime and performance of each component, enhancing the system's overall efficiency;
Developed forecast comparison templates to facilitate the evaluation of forecast quality, aiding in accurate performance assessments;
Utilized Logicblox, Python, and Tableau Software to achieve project goals, ensuring high-quality results.
Tekniikan huippuosaaminen
Rihab yleinen suorituskyky 90 minuutin suorassa teknisessä arvioinnissa on top 25 % Proxifyn tarkastetuista Data Engineer.
Issued Feb 2025 - Expires Feb 2027
Credential ID 133741658
Issued Feb 2025 - Expires Feb 2027
Credential ID 133741658
Keskustele asiantuntijan kanssa ja saat räätälöityjä ehdotuksia verkostostamme vain 2 päivässä.
Pääsy yli 6 000+ asiantuntijaa
Löydä kehittäjä keskimäärin 2 päivässä
Palkkaa nopeasti ja helposti 94% onnistuneella osumalla