PhD Researcher in AI, Data Science, and Distributed Digital Twins
I am a data scientist with a strong passion for machine learning, AI, and decentralized systems. My experiences span various industries, from developing data extraction tools to advancing voice synthesis, and contributing to environmental sustainability through intelligent systems.
I am preparing for a PhD at the University of Amsterdam, where I will research event-based communication algorithms for decentralized digital twins in intelligent wastewater treatment systems, under the supervision of Dr. Victoria Degeler. This research is part of the NWO Merian Fund's "DDTclean" project, aiming to enhance sustainability through AI, sensor data, and machine learning.
With expertise in Python, R, SQL, and machine learning frameworks (such as TensorFlow, PyTorch, and Scikit-learn), I am excited to contribute to the intersection of data science, AI, and sustainable technologies.
Phenoma Platform of Agriculture, Benguerir, Morocco
Mar 2024 – Sep 2024
Led the development of an English Knowledge Graph for plant biology, enhancing data management and accessibility using Named Entity Recognition (NER) and Relation Extraction (RE). This project focused on improving data extraction, management, and utilization within agricultural research.
Utilized advanced NLP technologies, including a fine-tuned BERT model, to model relationships within agricultural data. The research significantly enhanced the ability to extract actionable insights from large, unstructured datasets, facilitating more informed decision-making for agricultural researchers.
Refined research methodologies to address key challenges in agricultural knowledge management, resulting in a substantial improvement in the accuracy and utility of data extraction from diverse sources.
The World Bank, Rabat, Morocco
Sep 2023 – Oct 2023
Engineered a data extraction tool using advanced language models and regular expressions, which increased the efficiency of data pipelines by 40% and reduced processing time by 25%. This tool was designed to handle large-scale data extraction and processing tasks, improving overall operational efficiency.
Aggregated unstructured data from over 10 different sources using advanced web scraping techniques, extracting valuable insights that directly contributed to high-level decision-making and policy recommendations at The World Bank.
VOSYN Inc., Remote, Canada
Jul 2023 – Sep 2023
Leveraged Tacotron 2, VALL-E, and Tortoise-TTS alongside large language models (LLMs) to enhance the quality and naturalness of voice synthesis, significantly improving user experience and engagement with the platform.
Played a key role in developing an innovative voice synthesis platform, improving multilingual capabilities and dynamic accuracy. This development broadened accessibility for users globally, offering more natural and accurate voice interactions across various languages.
Optimized linguistic models with PyTorch, enhancing comprehension and response generation, which led to a 30% decrease in user query resolution time, boosting the platform's overall efficiency and user satisfaction.
CERN, Geneva, Switzerland
Jul 2022 – Aug 2022
Implemented robust RESTful APIs to facilitate efficient data management and integration for the High Granularity Timing Detector (HGTD) within the ATLAS experiment, significantly boosting research productivity through improved data accessibility.
Designed and developed a sophisticated visualization application for the HGTD, enhancing data interpretation for researchers involved in particle physics studies.
Collaborated with interdisciplinary teams to design data schemas tailored for long-term data storage, ensuring compatibility and scalability to meet diverse research needs across CERN.
OCP Group, Morocco
May 2022 – Jul 2022
Structured and analyzed data to improve energy efficiency at OCP Group, working on optimizing energy consumption in various processes within the organization.
Created machine learning models to make accurate predictions and inform decision-making processes, particularly in energy management.
Provided solutions to optimize energy consumption at the Wastewater Treatment Plant (WWTP) of Ben Guerir city, significantly improving environmental sustainability while reducing operational costs.
PhD in Intelligent Wastewater Treatment - Distributed Digital Twins
University of Amsterdam, Netherlands
Starting: November 2024
Master of Science in Data Engineering
Mohammed VI Polytechnic University, Morocco
Graduated: 2024
Bachelor of Science in Data Science
Mohammed VI Polytechnic University, Morocco
Graduated: 2022