Stars
Faster Whisper transcription with CTranslate2
An awesome & curated list of best LLMOps tools for developers
A Telegram bot to recommend arXiv papers
Header-only TOML config file parser and serializer for C++17.
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
Simple, unified interface to multiple Generative AI providers
[arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Official PyTorch implementation of FlatQuant: Flatness Matters for LLM Quantization
Cross-platform C++ library providing a simple API to read and write INI-style configuration files
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
涵盖C++ Primer 5th、 effective C++ 、 STL api和demos C++ 基础知识与理论、 智能指针、C++11、 Git教程 Linux命令 Unix操作系统(进程、线程、内存管理、信号)计算机网络、 数据结构(排序、查找)、数据库、、C++对象模型、 设计模式、算法(《剑指offer》、leetcode、lintcode、hihocoder、《王道程序员求职…
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
Unsupervised text tokenizer for Neural Network-based text generation.
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
中文大模型能力评测榜单:目前已囊括153个大模型,覆盖chatgpt、gpt-4o、谷歌gemini、Claude3.5、百度文心一言、千问、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、书生internLM2.5等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.