smolvlm
Here are 16 public repositories matching this topic...
Real-time webcam demo using SmolVLM with vLLM backend
-
Updated
May 15, 2025 - HTML
This repository contains the implementation of AlignVLM paper, which proposes a novel method for vision language alignment
-
Updated
May 23, 2025 - Python
🎭 Real-time voice-controlled 3D avatar with multimodal AI - speak naturally and watch your AI companion respond with perfect lip-sync
-
Updated
Jul 5, 2025 - TypeScript
A small VLM that sees everything
-
Updated
Sep 15, 2025 - HTML
Scripts for combining SmolVLM and LLM
-
Updated
May 15, 2025 - Python
⭐ Comparing VLMs with CNNs for garbage classification
-
Updated
Sep 29, 2025 - Jupyter Notebook
A simple web application for real-time AI vision analysis using SmolVLM-500M-Instruct with live camera feed processing and text-to-speech.
-
Updated
Jun 30, 2025 - JavaScript
Real-time vision demo using SmolVLM with llama.cpp backend
-
Updated
Aug 29, 2025 - HTML
This blog post introduces SmolVLM, a 2B VLM, SOTA for its memory footprint. SmolVLM is small, fast, memory-efficient, and fully open-source. All model checkpoints, VLM datasets, training recipes and tools are released under the Apache 2.0 license.
-
Updated
May 17, 2025 - HTML
A Flask-based web app for managing multimodal datasets text and images with CRUD operations via SQLite, and seamless export as a structured Parquet dataset to Hugging Face Hub.
-
Updated
Jul 23, 2025 - HTML
A some what optimized implementation of some light weight and popular models
-
Updated
Sep 12, 2025 - Python
Real-time webcam demo with SmolVLM and llama.cpp server
-
Updated
Sep 30, 2025 - JavaScript
Improve this page
Add a description, image, and links to the smolvlm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the smolvlm topic, visit your repo's landing page and select "manage topics."