Popular repositories Loading
-
DeepView.Profile
DeepView.Profile Public🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
-
DeepView.Explore
DeepView.Explore Public🛠 VSCode plugin that provides visual interface for CentML Tools
-
DeepView.Predict
DeepView.Predict Public🔮 Execution time predictions for deep neural network training iterations across different GPUs.
-
-
flexible-inference-bench
flexible-inference-bench PublicA modular, extensible LLM inference benchmarking framework that supports multiple benchmarking frameworks and paradigms.
Python 8
-
gpu-usage-estimator
gpu-usage-estimator PublicPython script to estimate GPU utilization using NVIDIA Nsight Systems
Python 4
Repositories
- vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
CentML/vllm’s past year of commit activity - spiffe-jwt Public
CentML/spiffe-jwt’s past year of commit activity - flash-attention Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
CentML/flash-attention’s past year of commit activity - platform_api_python_client Public
CentML/platform_api_python_client’s past year of commit activity - flexible-inference-bench Public
A modular, extensible LLM inference benchmarking framework that supports multiple benchmarking frameworks and paradigms.
CentML/flexible-inference-bench’s past year of commit activity - codex Public
A comprehensive collection of integration examples for CentML. This repository serves as a resource hub for developers looking to seamlessly incorporate CentML's capabilities into their applications. Explore a variety of use cases and implementations to accelerate your integration process.
CentML/codex’s past year of commit activity - centml_platform_docs Public
CentML/centml_platform_docs’s past year of commit activity - centml-python-client Public
CentML/centml-python-client’s past year of commit activity