Skip to content
#

quantization

Here are 10 public repositories matching this topic...

A self-contained AI project that runs a quantized Large Language Model (Qwen2.5-0.5B) entirely on your local machine. Built with FastAPI and llama-cpp-python, this agent intelligently switches between standard chat and "Search Mode" to fetch real-time data from the internet. The project features a responsive HTML/CSS/JS frontend and is fully Docker

  • Updated Jan 8, 2026
  • HTML

The Automated Waste Classification System is a web-based application designed to identify and categorize waste materials automatically using machine learning. It helps users efficiently sort waste into categories like glass, plastic, cardboard, etc., promoting recycling and proper waste management.

  • Updated Dec 7, 2025
  • HTML

AI-Powered Vocabulary Quiz Generator: A full-stack RAG application leveraging a fine-tuned Flan-T5 model and Pinecone Vector DB to generate dynamic English quizzes. Features ONNX quantization for high-speed CPU inference, a 7,100+ word curated dataset, and a secure Docker deployment on AWS EC2 with Nginx/SSL.

  • Updated Jan 4, 2026
  • HTML

Improve this page

Add a description, image, and links to the quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the quantization topic, visit your repo's landing page and select "manage topics."

Learn more