Skip to content
View ShiXianzheng's full-sized avatar

Block or report ShiXianzheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.

Python 280 19 Updated May 20, 2024

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,485 153 Updated Dec 2, 2024

Codebase for Automated Creation of Digital Cousins for Robust Policy Learning

Python 181 17 Updated Dec 3, 2024

Official implementation of "Self-Improving Video Generation"

Python 58 2 Updated Dec 26, 2024

Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.

Python 447 27 Updated Dec 6, 2024

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,139 616 Updated Sep 26, 2024
Python 7,215 567 Updated Jan 24, 2025

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,135 48 Updated Jan 23, 2025

Inference code for Llama models

Python 57,312 9,672 Updated Aug 18, 2024

Rotary Transformer

Python 866 52 Updated Mar 21, 2022

AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)

5,364 645 Updated Apr 24, 2024

大数据入门指南 ⭐

Java 16,127 4,272 Updated Jan 5, 2024

[ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization"

Python 86 6 Updated Aug 23, 2024

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,091 2,153 Updated Dec 13, 2024

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Python 422 82 Updated May 15, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 34,675 5,301 Updated Jan 24, 2025

[TMLR 2024] Efficient Large Language Models: A Survey

1,077 89 Updated Jan 14, 2025

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Python 265 25 Updated Sep 3, 2024

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 756 59 Updated Oct 8, 2024

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,242 332 Updated May 16, 2023

The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)

Jupyter Notebook 127 21 Updated Nov 19, 2020
Python 15 6 Updated Oct 26, 2022

中国大模型

5,814 491 Updated Nov 30, 2024

Grok open release

Python 49,860 8,345 Updated Aug 30, 2024

Official inference library for Mistral models

Jupyter Notebook 9,872 879 Updated Nov 12, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,341 406 Updated Aug 7, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,646 502 Updated Jan 21, 2025

The official implementation of the ICML 2023 paper OFQ-ViT

Python 30 1 Updated Oct 3, 2023

Running BERT without Padding

C++ 468 54 Updated Mar 18, 2022

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,502 199 Updated Jun 12, 2023
Next