Skip to content
View healthonrails's full-sized avatar

Block or report healthonrails

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NumPy & SciPy for GPU

Python 9,864 881 Updated Feb 7, 2025

An implementation of chunked, compressed, N-dimensional arrays for Python.

Python 1,595 305 Updated Feb 7, 2025

Variational Animal Motion Embedding - A tool for time series embedding and clustering

Python 179 62 Updated Oct 3, 2024

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 380 22 Updated Feb 7, 2025

Variational Animal Motion Embedding - A tool for time series embedding and clustering

Python 21 3 Updated Jan 28, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 15,278 1,990 Updated Feb 1, 2025

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,541 130 Updated Feb 4, 2025

TTS with kokoro and onnx runtime

Python 1,481 131 Updated Feb 6, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,213 1,302 Updated Feb 8, 2025

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 6,777 1,229 Updated Nov 26, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,402 462 Updated Jan 28, 2025

Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024

Python 49 1 Updated Sep 11, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,329 426 Updated May 29, 2024

[ECCVW'24] Long-form Video Understanding by Bridging Episodic Memory and Semantic Knowledge

Python 23 3 Updated Sep 27, 2024

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,162 55 Updated Nov 22, 2024

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 139 5 Updated Jan 24, 2025

Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.

Python 140 14 Updated Jan 6, 2025

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 4,162 254 Updated Feb 7, 2025

Agno is a lightweight framework for building multi-modal Agents

Python 18,683 2,535 Updated Feb 8, 2025

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Python 5,466 377 Updated Feb 7, 2025

The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"

Python 47 8 Updated Dec 30, 2024

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 414 19 Updated Oct 16, 2024

tiny vision language model

Python 7,268 568 Updated Feb 7, 2025

Repository for 2nd generation Bpod platform (formerly beta branch of Bpod repository)

MATLAB 33 39 Updated Jan 15, 2025

Simple, unified interface to multiple Generative AI providers

Python 9,951 904 Updated Feb 6, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,611 159 Updated Dec 21, 2024

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 19,569 2,747 Updated Feb 5, 2025

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,410 111 Updated Jan 24, 2025
Next