[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
-
Updated
Dec 25, 2025 - Python
[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
The PyVisionAI Official Repo
This is an official repository for "Harnessing Vision Models for Time Series Analysis: A Survey".
Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
A2Mamba: Attention-augmented State Space Models for Visual Recognition
A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and performance.
An implementation of gated MLPs in tinygrad, as an alternative to transformers.
DART (Diffusion-Autoregressive Recursive Transformer) is a novel hybrid architecture that combines diffusion-based and autoregressive approaches for text generation.
Dataset for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models
A framework to compute threshold sensitivity of deep networks to visual stimuli.
🤖 Ollama Consumer - A Python-based interactive chat interface for Ollama models with advanced model management, comprehensive benchmarking, vision support, and automatic error recovery. Features dynamic model switching, GPU optimization, and intelligent service monitoring for seamless AI model interactions.
Vision-based swarms in the Presence of Occlusions
(paused) building AVA from ex-machina; a lightweight multi-modal system from scratch, just for learning & experimentation
Decom-Renorm-Merge: Merging deep learning models through shared representation space.
Add a description, image, and links to the vision-models topic page so that developers can more easily learn about it.
To associate your repository with the vision-models topic, visit your repo's landing page and select "manage topics."