ShiXianzheng

Follow

ShiXianzheng

Follow

0 followers · 5 following

Starred repositories

dongyh20 / Octopus

🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.

Python 280 19 Updated May 20, 2024

Picsart-AI-Research / StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,485 153 Updated Dec 2, 2024

cremebrule / digital-cousins

Codebase for Automated Creation of Digital Cousins for Robust Policy Learning

Python 181 17 Updated Dec 3, 2024

Video-as-Agent / VideoAgent

Official implementation of "Self-Improving Video Generation"

Python 58 2 Updated Dec 26, 2024

liruiw / HPT

Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.

Python 447 27 Updated Dec 6, 2024

HVision-NKU / StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,139 616 Updated Sep 26, 2024

kyutai-labs / moshi

Python 7,215 567 Updated Jan 24, 2025

showlab / Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,135 48 Updated Jan 23, 2025

meta-llama / llama

Inference code for Llama models

Python 57,312 9,672 Updated Aug 18, 2024

ZhuiyiTechnology / roformer

Rotary Transformer

Python 866 52 Updated Mar 21, 2022

amusi / AI-Job-Notes

AI算法岗求职攻略（涵盖准备攻略、刷题指南、内推和AI公司清单等资料）

5,364 645 Updated Apr 24, 2024

heibaiying / BigData-Notes

大数据入门指南 ⭐

Java 16,127 4,272 Updated Jan 5, 2024

xinghaochen / SLAB

[ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization"

Python 86 6 Updated Aug 23, 2024

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,091 2,153 Updated Dec 13, 2024

Zhen-Dong / HAWQ

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Python 422 82 Updated May 15, 2023

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 34,675 5,301 Updated Jan 24, 2025

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

1,077 89 Updated Jan 14, 2025

facebookresearch / LLM-QAT

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Python 265 25 Updated Sep 3, 2024

OpenGVLab / OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 756 59 Updated Oct 8, 2024

bytedance / lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,242 332 Updated May 16, 2023

hustzxd / LSQuantization

The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)

Jupyter Notebook 127 21 Updated Nov 19, 2020

sIncerass / QBERT

Python 15 6 Updated Oct 26, 2022

wgwang / awesome-LLMs-In-China

中国大模型

5,814 491 Updated Nov 30, 2024

xai-org / grok-1

Grok open release

Python 49,860 8,345 Updated Aug 30, 2024

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 9,872 879 Updated Nov 12, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,341 406 Updated Aug 7, 2024

AutoGPTQ / AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,646 502 Updated Jan 21, 2025

nbasyl / OFQ

The official implementation of the ICML 2023 paper OFQ-ViT

Python 30 1 Updated Oct 3, 2023

bytedance / effective_transformer

Running BERT without Padding

C++ 468 54 Updated Mar 18, 2022

Tencent / TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,502 199 Updated Jun 12, 2023

Starred topics

Tensorflow