-
A high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedAug 5, 2024 -
-
glow Public
Forked from pytorch/glowCompiler for Neural Network hardware accelerators
C++ Apache License 2.0 UpdatedApr 14, 2022 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
C++ Other UpdatedJul 14, 2021 -