offline-inference

Here are 12 public repositories matching this topic...

dineshsoudagar / local-llms-on-android

Run local LLMs like Gemma, Qwen, and LLaMA on Android for offline, private, real-time chat and question answering with LiteRT and ONNX Runtime.

android chatbot android-app litert on-device-ai mobile-ai onnx-runtime huggingface-tokenizers local-llm qwen llama3 local-llm-integration offline-inference litert-lm gemma4 gemma4-2b gemma4-e4b

Updated May 6, 2026
Kotlin

lunashia / o-m_beatmap_trainer

Star

Training pipeline for an osu!mania 7k next-event model, designed as the upstream predictor for audio-driven 7k map generation systems.

Updated May 6, 2026
Python

iBz-04 / offeline

Star

run AI without internet, on the web & desktop

desktop-app webgl electron-app webgpu privacy-enhancing-technologies inference-engine ondeviceai ai-lab on-device-ai llamacpp local-ai ollama node-llama-cpp ollama-app quantizedai offline-inference privateai

Updated Apr 29, 2026
TypeScript

crowdaware-inno-wing-iot / crowdaware-dual-cam-test

Star

Offline CrowdAware system for Raspberry Pi 4B and Heltec LoRa V3 using Raspberry Pi Camera Module 3 and MLX90640 Thermal Camera.

raspberry-pi yolo thermal-imaging mlx90640 onnxruntime raspberry-pi-4b crowd-detection esp32-s3 heltec-lora-32-v3 offline-inference

Updated Apr 26, 2026
Python

A Cloud-to-Edge MLOps pipeline for offline industrial diagnostics. Fine-tunes Phi-3-mini (3.8B) on Cloud GPUs via QLoRA, quantizes to INT4, and deploys as a CPU-optimized ONNX microservice for industrial standard sensor logs.

react python docker typescript quantization mlops embedded-ai fastapi edge-ai industrial-iot onnx-runtime qlora small-language-models phi-3 offline-inference

Updated Jan 20, 2026
Python

rahul-dangi-ai / download_hf_models_to_local_dir

Star

Download Hugging Face model snapshots locally for offline inference, deployment, and Docker workflows.

python docker machine-learning ai transformers llama huggingface llm model-downloader offline-inference

Updated May 8, 2026
Python

brotSchimmelt / llm-offline-inference

Star

A comprehensive toolkit for streamlining and simplifying the offline inference process for LLMs across various models and libraries.

batching llm llm-inference structured-generation offline-inference

Updated Feb 14, 2025
Python

akshatvasisht / janus

Star

Real-time semantic audio codec achieving 300bps bandwidth via gen AI reconstruction.

python signal-processing tts speech-synthesis audio-codec systems-engineering low-bandwidth edge-ai semantic-compression offline-inference

Updated Feb 25, 2026
Python

tk-yasuno / gpt-oss-20b-local-execute

Star

GPT-OSS B20 Local Execution. Lightweight local environment for running it with Python 3.12 and CUDA acceleration. - Run GPT-OSS B20 entirely offline - Optimize text generation with GPU - Enable fast, secure inference on consumer hardware.

text-generation performance-optimization gpu-optimization edge-ai inference-acceleration secure-inference model-runtime minimal-setup llm-inference open-source-llm local-execution offline-inference privacy-preserving-ai consumer-gpu gpt-oss-b20 lightweight-environment

Updated Aug 13, 2025
Python

JIoccb / ecup-counterfeit-classifier

Star

Мультимодальная офлайновая система детекции контрафакта (текст+изображение+таблица).

docker ocr scikit-learn transformers tesseract pytorch clip multimodal offline-inference counterfeit-detection

Updated Sep 10, 2025
Python

valerii-barenkov / gesture2text

Star

Inclusive hand gesture recognition system for assistive human–computer interaction, based on classical machine learning and MediaPipe Hands.

python ai computer-vision accessibility scikit-learn ml logistic-regression hci assistive-technology gesture-recognition mediapipe offline-inference

Updated Feb 7, 2026
Python

Marcus-V-Freitas / MVFC.ImageProcessing

Star

Event-driven image processing pipeline with automatic format normalization, thumbnail generation, AI-powered captioning (BLIP), and full lifecycle management. Built with .NET 10, Python, Docker, and Terraform.

python microservices dotnet docker-compose terraform gcp image-processing cloud-storage event-driven image-captioning pub-sub refit blip cqrs-pattern offline-inference net10 magicknet

Updated May 20, 2026
C#

Improve this page

Add a description, image, and links to the offline-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the offline-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

offline-inference

Here are 12 public repositories matching this topic...

dineshsoudagar / local-llms-on-android

lunashia / o-m_beatmap_trainer

iBz-04 / offeline

crowdaware-inno-wing-iot / crowdaware-dual-cam-test

Nibir1 / NanoSentri

rahul-dangi-ai / download_hf_models_to_local_dir

brotSchimmelt / llm-offline-inference

akshatvasisht / janus

tk-yasuno / gpt-oss-20b-local-execute

JIoccb / ecup-counterfeit-classifier

valerii-barenkov / gesture2text

Marcus-V-Freitas / MVFC.ImageProcessing

Improve this page

Add this topic to your repo