Skip to content
View max-yue's full-sized avatar
🎯
Love what you do, do what you love.
🎯
Love what you do, do what you love.

Block or report max-yue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Python tool for converting files and office documents to Markdown.

HTML 36,575 1,643 Updated Feb 11, 2025

Fast Semantic Text Deduplication

Python 507 22 Updated Jan 28, 2025

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

340 9 Updated Jan 17, 2025

GenEval: An object-focused framework for evaluating text-to-image alignment

HTML 169 8 Updated Jul 24, 2024

Compute FID scores with PyTorch.

Python 3,534 526 Updated Jul 3, 2024

Versatile Evaluation of Speech and Audio

Python 156 13 Updated Feb 8, 2025

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 781 108 Updated Feb 11, 2025

OpenMMLab Detection Toolbox and Benchmark

Python 30,203 9,541 Updated Aug 21, 2024

官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,438 87 Updated Jul 3, 2024

A generative speech model for daily dialogue.

Python 34,256 3,705 Updated Jan 25, 2025

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,792 601 Updated May 31, 2024

Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊

260 7 Updated Jan 27, 2025

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,310 201 Updated Jan 13, 2025

Comfortably monitor your Internet traffic 🕵️‍♂️

Rust 21,434 629 Updated Feb 7, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 909 66 Updated Jan 31, 2025
Python 2,107 145 Updated Feb 10, 2025

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

Python 28 Updated Jan 24, 2025

Data and Code for Program of Thoughts (TMLR 2023)

Python 258 22 Updated May 15, 2024

contrastive decoding

Python 191 12 Updated Nov 14, 2022

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 306 7 Updated Nov 17, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

796 20 Updated Jul 31, 2024

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 32,329 3,023 Updated Feb 11, 2025

A fast, secure, and portable multichain light client for Ethereum

Rust 1,944 335 Updated Feb 10, 2025
Python 168 10 Updated Feb 6, 2025

A feature-rich command-line audio/video downloader

Python 99,793 7,812 Updated Feb 11, 2025

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

155 4 Updated Jan 23, 2025

Towards Large Multimodal Models as Visual Foundation Agents

Python 176 6 Updated Feb 5, 2025

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,362 175 Updated Jan 30, 2025
Next