Skip to content
View tzhang2014's full-sized avatar
  • guangzhou

Block or report tzhang2014

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Faster Whisper transcription with CTranslate2

Python 13,598 1,146 Updated Jan 1, 2025

An awesome & curated list of best LLMOps tools for developers

Shell 4,274 418 Updated Jan 21, 2025

A Telegram bot to recommend arXiv papers

Python 225 17 Updated Jan 8, 2025

Header-only TOML config file parser and serializer for C++17.

C++ 1,635 161 Updated Nov 14, 2024

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 387 45 Updated Jan 17, 2025

Python bindings for llama.cpp

C++ 199 28 Updated Apr 22, 2023

Simple, unified interface to multiple Generative AI providers

Python 9,812 888 Updated Jan 20, 2025

[arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 119 Updated Jun 12, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,546 4,737 Updated Jan 21, 2025

Official PyTorch implementation of FlatQuant: Flatness Matters for LLM Quantization

Python 90 8 Updated Nov 12, 2024
Python 37 4 Updated Oct 31, 2024

Run generative AI models in sophgo BM1684X

Python 154 24 Updated Jan 21, 2025

llm deploy project based onnx.

C++ 30 7 Updated Oct 9, 2024

Cross-platform C++ library providing a simple API to read and write INI-style configuration files

C++ 1,153 322 Updated Dec 9, 2024

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

61,301 7,882 Updated Jan 19, 2025

涵盖C++ Primer 5th、 effective C++ 、 STL api和demos C++ 基础知识与理论、 智能指针、C++11、 Git教程 Linux命令 Unix操作系统(进程、线程、内存管理、信号)计算机网络、 数据结构(排序、查找)、数据库、、C++对象模型、 设计模式、算法(《剑指offer》、leetcode、lintcode、hihocoder、《王道程序员求职…

HTML 2,533 606 Updated Jan 16, 2022

An acceleration library that supports arbitrary bit-width combinatorial quantization operations

C++ 212 21 Updated Sep 30, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,494 1,187 Updated Dec 1, 2024

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 755 59 Updated Oct 8, 2024

中文大模型能力评测榜单:目前已囊括153个大模型,覆盖chatgpt、gpt-4o、谷歌gemini、Claude3.5、百度文心一言、千问、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、书生internLM2.5等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!

3,320 150 Updated Jan 21, 2025

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,559 112 Updated Jul 5, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,417 94 Updated Aug 13, 2024

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 5,380 452 Updated Jan 11, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,890 709 Updated Jan 11, 2025

Kolors Team

Python 4,121 305 Updated Nov 13, 2024
Next