Recognize Any Regions
-
Updated
Dec 18, 2024 - Python
Recognize Any Regions
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning (CVPR25)
Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"
MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model
A Vision Foundation Model for Cine Cardiac Magnetic Resonance Imaging
This repo collects some latest research work of Generative AI. It provides simple implementations to understand the ideas and some follow-up discussions to inspire future work.
"Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model"
Codebase for probing VFMs and Feature Upsamplers using Intractive Segmentation.
Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
Add a description, image, and links to the vision-foundation-model topic page so that developers can more easily learn about it.
To associate your repository with the vision-foundation-model topic, visit your repo's landing page and select "manage topics."