Skip to content
View Jun-Howie's full-sized avatar

Block or report Jun-Howie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 63.8k 11.5k

  2. xorbitsai/inference xorbitsai/inference Public

    Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…

    Python 8.8k 766

  3. Vahe1994/AQLM Vahe1994/AQLM Public

    Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

    Python 1.3k 191

  4. QwenLM/Qwen3 QwenLM/Qwen3 Public

    Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

    Python 25.5k 1.8k

  5. QwenLM/Qwen3-VL QwenLM/Qwen3-VL Public

    Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

    Jupyter Notebook 16.6k 1.4k

  6. LLMxMapReduce LLMxMapReduce Public

    Forked from thunlp/LLMxMapReduce

    Python 3