Skip to content

Conversation

happyandslow
Copy link
Collaborator

Pull Request Description

This branch lets workload generator to generate workload based on prefix shared synthetic data

Related Issues

Resolves: #813

@happyandslow happyandslow changed the title Generate workload based on prefix sharing synthetic data [WIP] Generate workload based on prefix sharing synthetic data Mar 10, 2025
gangmuk and others added 6 commits March 12, 2025 14:17
* Script that generates workload for prefix aware routing. Included some prepared prefix workload

Signed-off-by: “Gangmuk <gangmuk@gmail.com>

* generate_realistic_prefix_share_workload.py in benchmakrs/generator

Signed-off-by: “Gangmuk <gangmuk@gmail.com>

---------

Signed-off-by: “Gangmuk <gangmuk@gmail.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
@happyandslow happyandslow force-pushed the lexu/cache-aware-workload branch from 4989a69 to 33e5499 Compare March 12, 2025 21:17
@happyandslow happyandslow changed the title [WIP] Generate workload based on prefix sharing synthetic data Generate workload based on prefix sharing synthetic data Mar 12, 2025
@happyandslow happyandslow requested a review from Jeffwan March 12, 2025 21:19
Le Xu added 5 commits March 15, 2025 18:39
Signed-off-by: Le Xu <le.xu@bytedance.com>
…on workload)

Signed-off-by: Le Xu <le.xu@bytedance.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
@happyandslow happyandslow merged commit 4496d9a into vllm-project:main Mar 16, 2025
3 checks passed
happyandslow added a commit that referenced this pull request Mar 16, 2025
add missing image

Signed-off-by: Le Xu <le.xu@bytedance.com>
Co-authored-by: Le Xu <le.xu@bytedance.com>
gangmuk added a commit to gangmuk/aibrix-gangmuk that referenced this pull request Jun 21, 2025
…t#840)

* Workload generation scripts for prefix aware routing (vllm-project#820)

* Script that generates workload for prefix aware routing. Included some prepared prefix workload

Signed-off-by: “Gangmuk <gangmuk@gmail.com>

* generate_realistic_prefix_share_workload.py in benchmakrs/generator

Signed-off-by: “Gangmuk <gangmuk@gmail.com>

---------

Signed-off-by: “Gangmuk <gangmuk@gmail.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>

* Generate workload based on prefix sharing synthetic data

Signed-off-by: Le Xu <le.xu@bytedance.com>

* update prefix sharing from distribution

Signed-off-by: Le Xu <le.xu@bytedance.com>

* remove adapter name

Signed-off-by: Le Xu <le.xu@bytedance.com>

* update user argument

Signed-off-by: Le Xu <le.xu@bytedance.com>

* update README

Signed-off-by: Le Xu <le.xu@bytedance.com>

* fix model argument

Signed-off-by: Le Xu <le.xu@bytedance.com>

* adding default model to client (making compatible with older generation workload)

Signed-off-by: Le Xu <le.xu@bytedance.com>

* fixing None statitiscs in output file

Signed-off-by: Le Xu <le.xu@bytedance.com>

* update readme for references

Signed-off-by: Le Xu <le.xu@bytedance.com>

* clean up

Signed-off-by: Le Xu <le.xu@bytedance.com>

---------

Signed-off-by: “Gangmuk <gangmuk@gmail.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
Co-authored-by: Gangmuk Lim <gangmuk@gmail.com>
Co-authored-by: Le Xu <le.xu@bytedance.com>
gangmuk pushed a commit to gangmuk/aibrix-gangmuk that referenced this pull request Jun 21, 2025
add missing image

Signed-off-by: Le Xu <le.xu@bytedance.com>
Co-authored-by: Le Xu <le.xu@bytedance.com>
Yaegaki1Erika pushed a commit to Yaegaki1Erika/aibrix that referenced this pull request Jul 23, 2025
…t#840)

* Workload generation scripts for prefix aware routing (vllm-project#820)

* Script that generates workload for prefix aware routing. Included some prepared prefix workload

Signed-off-by: “Gangmuk <gangmuk@gmail.com>

* generate_realistic_prefix_share_workload.py in benchmakrs/generator

Signed-off-by: “Gangmuk <gangmuk@gmail.com>

---------

Signed-off-by: “Gangmuk <gangmuk@gmail.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>

* Generate workload based on prefix sharing synthetic data

Signed-off-by: Le Xu <le.xu@bytedance.com>

* update prefix sharing from distribution

Signed-off-by: Le Xu <le.xu@bytedance.com>

* remove adapter name

Signed-off-by: Le Xu <le.xu@bytedance.com>

* update user argument

Signed-off-by: Le Xu <le.xu@bytedance.com>

* update README

Signed-off-by: Le Xu <le.xu@bytedance.com>

* fix model argument

Signed-off-by: Le Xu <le.xu@bytedance.com>

* adding default model to client (making compatible with older generation workload)

Signed-off-by: Le Xu <le.xu@bytedance.com>

* fixing None statitiscs in output file

Signed-off-by: Le Xu <le.xu@bytedance.com>

* update readme for references

Signed-off-by: Le Xu <le.xu@bytedance.com>

* clean up

Signed-off-by: Le Xu <le.xu@bytedance.com>

---------

Signed-off-by: “Gangmuk <gangmuk@gmail.com>
Signed-off-by: Le Xu <le.xu@bytedance.com>
Co-authored-by: Gangmuk Lim <gangmuk@gmail.com>
Co-authored-by: Le Xu <le.xu@bytedance.com>
Yaegaki1Erika pushed a commit to Yaegaki1Erika/aibrix that referenced this pull request Jul 23, 2025
add missing image

Signed-off-by: Le Xu <le.xu@bytedance.com>
Co-authored-by: Le Xu <le.xu@bytedance.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Prefix sharing workload generation
3 participants