Skip to content
View hlin99's full-sized avatar

Block or report hlin99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. vllm-fork vllm-fork Public

    Forked from HabanaAI/vllm-fork

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  2. vllm-hpu-extension vllm-hpu-extension Public

    Forked from HabanaAI/vllm-hpu-extension

    Python

  3. Mooncake Mooncake Public

    Forked from kvcache-ai/Mooncake

    Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

    C++

  4. hccl_demo hccl_demo Public

    Forked from HabanaAI/hccl_demo

    C++

  5. gateway-api-inference-extension gateway-api-inference-extension Public

    Forked from kubernetes-sigs/gateway-api-inference-extension

    Gateway API Inference Extension

    Jupyter Notebook

  6. neural-compressor neural-compressor Public

    Forked from intel/neural-compressor

    SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

    Python