-
t2v_metrics Public
Evaluating text-to-image/video/3D models with VQAScore
-
-
linzhiqiu.github.io Public
Forked from RayeRen/acad-homepage.github.ioZhiqiu Lin's site
-
cross_modal_adaptation Public
Cross-modal few-shot adaptation with CLIP
-
-
lmms-eval Public
Forked from EvolvingLMMs-Lab/lmms-evalAccelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Python Other UpdatedFeb 14, 2025 -
streamlit-video-captioning Public
Forked from streamlit/llm-examplesStreamlit LLM app
Python Apache License 2.0 UpdatedJan 30, 2025 -
pytorchvideo Public
Forked from facebookresearch/pytorchvideoA deep learning library for video understanding research.
Python Apache License 2.0 UpdatedJan 25, 2025 -
streamlit-feedback-video Public
Forked from trubrics/streamlit-feedbackCollect user feedback from within your Streamlit app
JavaScript MIT License UpdatedJan 13, 2025 -
CLIP-FlanT5 Public
Training code for CLIP-FlanT5
-
llm-can-optimize-vlm.github.io Public
Forked from llm-can-optimize-vlm/llm-can-optimize-vlm.github.ioJavaScript UpdatedMay 5, 2024 -
LLaVA Public
Forked from haotian-liu/LLaVA[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
Python Apache License 2.0 UpdatedDec 14, 2023 -
PerceptualSimilarity Public
Forked from richzhang/PerceptualSimilarityLPIPS metric. pip install lpips
Python BSD 2-Clause "Simplified" License UpdatedOct 27, 2023 -
visual_gpt_score Public
VisualGPTScore for visio-linguistic reasoning
-
avalanche Public
Forked from ContinualAI/avalancheAvalanche: an End-to-End Library for Continual Learning.
Python MIT License UpdatedSep 16, 2023 -
vision-language-models-are-bows Public
Forked from mertyg/vision-language-models-are-bowsExperiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
Python MIT License UpdatedApr 9, 2023 -
debiased-pseudo-labeling Public
Forked from frank-xwang/debiased-pseudo-labeling[CVPR 2022] Debiased Learning from Naturally Imbalanced Pseudo-Labels
Jupyter Notebook MIT License UpdatedFeb 19, 2023 -
why-winoground-hard Public
Forked from ajd12342/why-winoground-hardCode for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
Python MIT License UpdatedFeb 4, 2023 -
-
-
-
mmselfsup Public
Forked from open-mmlab/mmselfsupOpenMMLab Self-Supervised Learning Toolbox and Benchmark
Python Apache License 2.0 UpdatedAug 11, 2022 -
HRNet-Semantic-Segmentation Public
Forked from HRNet/HRNet-Semantic-SegmentationThe OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
Python Other UpdatedAug 1, 2022 -
-
clear-benchmark.github.io Public
Forked from clear-benchmark/clear-benchmark.github.ioHTML UpdatedJul 6, 2022 -
dino Public
Forked from facebookresearch/dinoPyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Python Apache License 2.0 UpdatedJun 9, 2022 -
-
-
-
HTML4Vision Public
Forked from mtli/HTML4VisionA simple HTML visualization tool for computer vision research 🛠️
Python MIT License UpdatedNov 9, 2020