Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,402 462 Updated Jan 28, 2025

benedettaliberatori / T3AL

Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024

Python 49 1 Updated Sep 11, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,329 426 Updated May 29, 2024

joslefaure / HERMES

[ECCVW'24] Long-form Video Understanding by Bridging Episodic Memory and Semantic Knowledge

Python 23 3 Updated Sep 27, 2024

apple / ml-aim

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,162 55 Updated Nov 22, 2024

cvlab-stonybrook / LearningToCountEverything

Python 386 77 Updated Nov 30, 2023

IDEA-Research / ChatRex

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 139 5 Updated Jan 24, 2025

niki-amini-naieni / CountGD

Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.

Python 140 14 Updated Jan 6, 2025

lancedb / lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 4,162 254 Updated Feb 7, 2025

agno-agi / agno

Agno is a lightweight framework for building multi-modal Agents

Python 18,683 2,535 Updated Feb 8, 2025

lancedb / lancedb

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Python 5,466 377 Updated Feb 7, 2025

thswodnjs3 / CSTA

The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"

Python 47 8 Updated Dec 30, 2024

mit-han-lab / hart

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 414 19 Updated Oct 16, 2024

vikhyat / moondream

tiny vision language model

Python 7,268 568 Updated Feb 7, 2025

sanworks / Bpod_Gen2

Repository for 2nd generation Bpod platform (formerly beta branch of Bpod repository)

MATLAB 33 39 Updated Jan 15, 2025

andrewyng / aisuite

Simple, unified interface to multiple Generative AI providers

Python 9,951 904 Updated Feb 6, 2025

IDEA-Research / Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,611 159 Updated Dec 21, 2024

huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 19,569 2,747 Updated Feb 5, 2025

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,410 111 Updated Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

healthonrails healthonrails

Achievements

Achievements

Block or report healthonrails

Stars

cupy / cupy

zarr-developers / zarr-python

LINCellularNeuroscience / VAME

DAMO-NLP-SG / VideoLLaMA3

EthoML / VAME

deepseek-ai / Janus

usefulsensors / moonshine

thewh1teagle / kokoro-onnx

deepseek-ai / DeepSeek-R1

OpenBMB / MiniCPM-o

facebookresearch / SlowFast

NVIDIA / Cosmos