Skip to content
View Arifuzzamanjoy's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report Arifuzzamanjoy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Arifuzzamanjoy/README.md
Typing SVG

Email LinkedIn Profile Views


🧠 About Me

I'm an AI & Machine Learning Engineer specializing in end-to-end model development and deployment. With 7+ years of Python experience, I excel in fine-tuning models, building automated AI systems, and deploying cutting-edge ML/AI solutions. As a 5-star reviewed freelancer on Upwork and Fiverr, I focus on multi-modal data tasks and have published multiple papers in high-impact SCI journals.

  • πŸ€– LoRA Fine-tuning Expert - Creating specialized image and video generation models
  • πŸ”„ ML/DL Researcher - Published in SCI journals (IF 0.7-7.1, Q1)
  • πŸ’» Python Automation Specialist - Expert in workflow automation & AI agents
  • 🧩 Problem Solver - Converting complex requirements into efficient solutions
Top Languages

πŸ’Ό Experience

  • AI & Machine Learning Engineer (Freelance) - Upwork, Fiverr, & Direct Clients (2023 - Present)

    • Develop and deploy cutting-edge ML and AI models, specializing in multi-modal data tasks.
  • Research Assistant - Rajshahi University Solar Lab / AI Lab (Mar 2022 - May 2023)

    • Conducted research on renewable energy and speech processing, designing high-efficiency solar cells.
    • Applied ML/DL techniques to analyze results and improve performance.
  • ERP System Setup and Data Analyst - KBEC, Dhaka (3-month contract)

    • Implemented Odoo ERP system for business process automation.
    • Scraped contact data for niche marketing.

πŸ’ͺ Technical Expertise

skills = {
    "Programming": ["Python (7+ yrs)", "SQL (5+ yrs)", "JavaScript", "HTML/CSS"],
    "AI/ML": ["Data Science (6+ yrs)", "Machine Learning (5+ yrs)", "Deep Learning (5+ yrs)", "Agentic AI (2+ yrs)", "NLP", "Hugging Face (3+ yrs)"],
    "Libraries & Frameworks": ["PyTorch", "TensorFlow", "Langchain", "Selenium", "OpenCV", "Diffusers", "LiveKit", "Librosa", "Transformers"],
    "DevOps & MLOps": ["Docker (3+ yrs)", "Kubernetes (3+ yrs)", "CI/CD (3+ yrs)", "Git", "GitHub"],
    "Cloud & SaaS": ["Azure & AWS (3+ yrs)", "SaaS Development (1+ yrs)", "MongoDB", "Firebase", "MySQL"],
    "Spoken Languages": ["English (Fluent)", "Bangla (Native)"]
}

πŸ”¬ Research & Publications

I've published in high-impact scientific journals focusing on ML in renewable energy and medical imaging:

  • Machine learning assisted revelation of the best performing single hetero-junction thermophotovoltaic cell - Sustainable Energy Technologies and Assessments (IF 7.1, Q1)
  • Machine Learning-Enabled Performance Exploration of AuCuSe4 in Thermophotovoltaic Cell - Solar Energy (IF 6, Q1)
  • Unleashing the Power of Open-Source Transformers in Medical Imaging - International Journal of Advanced Computer Science & Applications (IF 0.7)
  • Numerical studies on a ternary AgInTe2 chalcopyrite thin film solar cell - Heliyon (IF 4, Q1)

πŸ› οΈ Featured Projects

Advanced Image Generation System with LoRA

Self-hosted platform for hyper-realistic image generation and editing using Flux LoRA, Gradio, and Hugging Face Diffusers for product visualization and AI influencer models.

Tech: Python, Flux, LoRA, Gradio, Hugging Face Diffusers, Qwen-Image-Edit, PyTorch, CLIP, T5, CUDA

Self-Hosted Multi-GPU Video Generation with Wan 2.2

Multi-GPU pipeline for high-fidelity image-to-video, text-to-video, and speech-to-video generation using Wan 2.2 model.

Tech: Python, PyTorch, Multi-GPU (torch.distributed), FSDP, Docker, Bash Scripting, S2V, DiT, T5

Voice-Pro: AI-Powered Speech Processing Platform

Web application for speech recognition, translation, and voice cloning across 100+ languages, supporting YouTube processing and real-time translation.

Tech: Python, Whisper, WhisperX, F5-TTS, E2-TTS, Edge-TTS, Deep-Translator

Humanoid Calling Agent Platform

Full-stack platform for natural multi-modal conversations with real-time SIP/WebRTC telephony and emotionally expressive AI voice interactions.

Tech: Python, LiveKit, LLMs (OpenAI, Gemini), SIP, WebRTC, Avatar & Voice Synthesis APIs

πŸ“Š GitHub Analytics

GitHub Stats GitHub Streak

πŸŽ“ Education & Certifications

  • B.Sc. in Electrical & Electronic Engineering - University of Rajshahi, Bangladesh (2017-2020, CGPA: 3.13)
  • Higher Secondary Certificate (H.Sc.), Science - Dhaka Education Board (2015-2016, GPA: 5.00)
  • Deep Learning with TensorFlow - IBM
  • Prompt Engineering for ChatGPT - Vanderbilt University
  • SQL (Advanced) Certificate - HackerRank
  • Introduction to Programming with MATLAB - Vanderbilt University
  • Data, Signal, and Image Analysis with MATLAB - Coursera

Random Dev Quote

Last updated: 2025-10-08 09:33:03 UTC

Popular repositories Loading

  1. Bone-Conducted-Speech-Enhancement-With-Neural-Network Bone-Conducted-Speech-Enhancement-With-Neural-Network Public

    Jupyter Notebook 4

  2. Forecast-Electricity-Load-Beyond-Available-Data Forecast-Electricity-Load-Beyond-Available-Data Public

    Jupyter Notebook 3

  3. Watson Watson Public

    2

  4. Image-Classification Image-Classification Public

    Jupyter Notebook 2

  5. RU-Audio RU-Audio Public

    Here I've played with audio

    Jupyter Notebook 2

  6. TSTNN TSTNN Public

    Forked from key2miao/TSTNN

    transformer based neural network for speech enhancement in time domain

    Python 2