-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedDec 21, 2024 -
neuraloperator Public
Forked from neuraloperator/neuraloperatorLearning in infinite dimension with neural operators.
Python MIT License UpdatedDec 20, 2024 -
kineto Public
Forked from pytorch/kinetoA CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
HTML Other UpdatedNov 8, 2024 -
RouteLLM Public
Forked from lm-sys/RouteLLMA framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
Python Apache License 2.0 UpdatedJul 9, 2024