This image, generated with DALL-E, depicts a wide Moroccan landscape where ancient ruins and modern AI structures blend, symbolizing the harmony between the past and the future.
- π± Hello, I'm Saad, a 23-year-old based in France, with a deep passion for creating projects in the realms of Data and Artificial Intelligence.
- π I hold a Data Engineering degree from INPT.
- πΌ Currently working as a Machine Learning Engineering Apprentice at AXA - Direct Assurance.
- π I'm also preparing for a Master's degree in Machine Learning and Data Science at Paris CitΓ© University.
Contributed to repackaging and updating the GIT Clustering algorithm π based on insights from an arXiv paper, with implementation available in the GitHub repository π and distribution through the TestPyPI Package π¦.
- Machine Learning Engineer / Data Scientist Apprenticeship at
AXA - Direct Assurance
, Paris, France (Ongoing) More details - Data Engineer / Data Scientist Internship at
Chefclub
, Paris, France (6 months) More details - Data Engineer Intern at
Capgemini Engineering
, Casablanca, Morocco (2 months) - Data Scientist Intern at
AIOX Labs
, Rabat, Morocco (2 months) - Web/Backend Developer Intern at
DXC Technologies
, Rabat, Morocco (2 months)
- Repository: LLM RAG
Description: A Streamlit application leveraging a Retrieval-Augmented Generation (RAG) Language Model (LLM) π€ with FAISS indexing ποΈ to provide answers from uploaded markdown files. Users can upload documents π, input queries, and receive contextually relevant answers using Similarity Search π, showcasing a practical application of NLP technologies π€. The project is also equipped with a CI/CD pipeline π ensuring code quality & tests and simple deployment, and it supports containerization with Docker π³ for easy distribution and deployment.
- Technologies/Tools: Streamlit, OpenAI API Models (LLMs, Embedding Models), FAISS, Python, Docker, CI/CD (Github Actions), Makefile, venv.
Description: A showcase of MLOps best practices using Kedro π οΈ, this repository shows the journey of Machine Learning Models from development to deployment π, utilizing Docker π³. Featuring straightforward training, evaluation, and deployment of models such as XGBoost Regressor, LightGBM π‘ and Random Forest Regeressor π³, it integrates built-in visualization π and logging π for effective monitoring. Dive into the world of modular and scalable data pipelines with Kedro π Kedro Documentation. The integration of an automated CI pipeline π with Github Actions ensures code quality β and reliability π.
- Technologies/Tools: Docker, Kedro, MLOps, CI/CD (Github Actions), Machine Learning (XGBoost, Random Forest, LightGBM), Jupyter Notebook, Makefile, venv, Python.
- Repository: GIT Clustering
Description: An upgraded version of the GIT Clustering algorithm π, informed by insights from an arXiv paper π, with easy deployment via TestPyPI π¦. Includes benchmarking notebooks π comparing it to state-of-the-art clustering algorithms π.
- Technologies/Tools: Benchmarking, Poetry Packaging, PyPI Distributing, Machine Learning (K-means, DBSCAN, AgglomerativeClustering, Gaussian Mixture..), Jupyter Notebook, Makefile, venv, Python.
- Repository: Monthly & Daily Energy Forecasting Docker API
Description: This repository π¦ houses an Energy Forecasting API β‘ that uses Machine Learning to predict daily π and monthly π energy consumption from historical data π. It's designed as a practical demonstration of a ML Engeineering/Data Science workflow, from initial analysis to a deployable API packaged with Docker π³.
- Technologies/Tools: MLOps, Docker, API design, Machine Learning (XGBoost, Random Forest), Jupyter Notebook, Makefile, venv, Python.
Let's make something innovative together! Feel free to reach out for collaborations or discussions in Data & Artificial Intelligence!
- README last updated on 17/04/2024. Regularly updated to reflect current work and interests.