Name	Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/workflows	.github/workflows
tests	tests
tracestorm	tracestorm
.gitignore	.gitignore
.pre-commit-config.yaml	.pre-commit-config.yaml
README.md	README.md
pyproject.toml	pyproject.toml
requirements-lint.txt	requirements-lint.txt
requirements-test.txt	requirements-test.txt
requirements.txt	requirements.txt

Name

Last commit message

Last commit date

.github/workflows

tests

tracestorm

.gitignore

.pre-commit-config.yaml

README.md

pyproject.toml

requirements-lint.txt

requirements-test.txt

requirements.txt

TraceStorm

TraceStorm is a tool for generating and replaying traces of requests to OpenAI API endpoints. It allows users to simulate load testing by generating requests based on specified patterns and configurations.

Features

Generate synthetic traces using various patterns (e.g., uniform, poisson, etc.)
Load traces from public datasets (e.g., Azure LLM Inference dataset)
Replay requests to any OpenAI-compatible LLM service
Analyze results and visualize performance metrics

Installation

pip install tracestorm

Usage

Start an OpenAI-Compatible Server

Before running the load test, ensure you have an OpenAI-compatible server running. If you haven't already installed vllm, you can do so with the following command:

# Install vllm if you haven't already
pip install vllm

# Start the server with the desired model
vllm serve Qwen/Qwen2.5-1.5B-Instruct

Run the Load Test

Once the server is running, you can execute the load test using the tracestorm command. Here's how to run it with different options:

Example Command for Synthetic Trace

tracestorm --model "Qwen/Qwen2.5-1.5B-Instruct" --rps 5 --pattern uniform --duration 10

Example Command for Azure Trace

To load a trace from the Azure dataset, use:

tracestorm --model "Qwen/Qwen2.5-1.5B-Instruct" --pattern azure_code

Command Options

--model: Required. The name of the model to use.
--rps: Optional. Requests per second (default is 1, only used for synthetic patterns).
--pattern: Optional. Pattern for generating trace. Valid patterns include:
- uniform: Distributes requests evenly across the duration.
- azure_code: Loads the Azure inference dataset for code.
- azure_conv: Loads the Azure inference dataset for conversation.
--duration: Optional. Duration in seconds (default is 10, only used for synthetic patterns).
--subprocesses: Optional. Number of subprocesses to use (default is 1).
--base-url: Optional. OpenAI Base URL (default is http://localhost:8000/v1).
--api-key: Optional. OpenAI API Key (default is none).

Make sure to adjust the parameters according to your testing needs!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TraceStorm

Features

Installation

Usage

Start an OpenAI-Compatible Server

Run the Load Test

Example Command for Synthetic Trace

Example Command for Azure Trace

Command Options

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

ServerlessLLM/TraceStorm

Folders and files

Latest commit

History

Repository files navigation

TraceStorm

Features

Installation

Usage

Start an OpenAI-Compatible Server

Run the Load Test

Example Command for Synthetic Trace

Example Command for Azure Trace

Command Options

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages