[Serve] Flagscale serve supports automatically composition of multiple models #732

cyber-pioneer · 2025-08-13T06:39:35Z

Description

Flagscale serve supports automatically composition of multiple models

python run.py --config-path ./examples/qwen2_5/conf --config-name serve_multiple_models action=run

Hchnr · 2025-08-13T09:00:52Z

I'm not sure, maybe it's a better choice to implement it at a higher level (adding a new compose-mode with no changes in serve-mode) ?

ceci3 · 2025-08-13T08:53:26Z

examples/qwen2_5/conf/serve_multiple_models.yaml

+    deploy:
+      port: 6701
+      use_fs_serve: true
+      enable_omposition: true


what does this parameter means?

enable_composition means using composition of multiple models.

ceci3 · 2025-08-13T08:54:34Z

flagscale/serve/engine.py


 from dag_utils import check_and_get_port
 from fastapi import FastAPI, HTTPException, Request
 from pydantic import create_model
-from ray import workflow
+from ray import serve
+from ray.serve.handle import DeploymentHandle


need add ray version restrictions？

Nice idea, more ray versions will be tested later

ceci3 · 2025-08-13T08:57:45Z

flagscale/serve/engine.py


 from flagscale.logger import logger

+RequestData = create_model(


how to deal with multimodal input

set config as follows:

deploy: request: args: - prompt - num types: - str - int

cyber-pioneer · 2025-08-14T02:01:18Z

I'm not sure, maybe it's a better choice to implement it at a higher level (adding a new compose-mode with no changes in serve-mode) ?

As discussed, what you mentioned refers to combining different independent services, while this scenario focuses on combining multiple different models. Your idea will be taken into consideration

zihugithub · 2025-08-15T06:55:20Z

.github/workflows/all-tests-nvidia.yml

uses: ./.github/workflows/functional-tests.yml should be changed to uses: ./.github/workflows/functional-tests-nvidia.yml

cyber-pioneer requested a review from a team as a code owner August 13, 2025 06:39

ceci3 reviewed Aug 13, 2025

View reviewed changes

zihugithub reviewed Aug 15, 2025

View reviewed changes

cyber-pioneer requested a review from aoyulong as a code owner August 15, 2025 06:56

cyber-pioneer changed the title ~~[WIP] Flagscale serve supports automatically composition of multiple models~~ [Serve] Flagscale serve supports automatically composition of multiple models Aug 15, 2025

cyber-pioneer added 17 commits August 20, 2025 19:18

dev multiple models

f7d9ae8

dev multiple models 2

b9118a6

polish code

7f9651b

add check dag

a6dd245

polish code

8fca885

polish code

99f7b96

polish code

5cfae67

polish code

c500a7a

recover test case

9ebe787

recover test case2

fe954a9

fix typo

edfd593

polish log

245c00b

fix pythonpath

7ff4490

debug path

44f3ed7

polish env

6477e4e

polish env 2

cf742b7

fix test case

d4e58d0

cyber-pioneer force-pushed the submodules branch from 9f273a5 to d4e58d0 Compare August 20, 2025 12:03

cyber-pioneer added 5 commits August 21, 2025 10:16

fix ci

27be931

update case

4169828

update case

8121337

update case

4931fc1

update case

62888a9

cyber-pioneer added 9 commits August 21, 2025 14:25

update case

6b559d2

debug 1

8d1cfba

debug 2

7a9bd92

debug 3

42509a3

debug 4

841c9fc

debug 5

f5d8dcb

debug 6

80a290c

fix code

d501ea9

polish code

54b0b67

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Serve] Flagscale serve supports automatically composition of multiple models #732

[Serve] Flagscale serve supports automatically composition of multiple models #732

Uh oh!

cyber-pioneer commented Aug 13, 2025

Uh oh!

Hchnr commented Aug 13, 2025

Uh oh!

ceci3 Aug 13, 2025

Uh oh!

cyber-pioneer Aug 13, 2025

Uh oh!

ceci3 Aug 13, 2025

Uh oh!

cyber-pioneer Aug 13, 2025

Uh oh!

ceci3 Aug 13, 2025

Uh oh!

cyber-pioneer Aug 13, 2025

Uh oh!

cyber-pioneer commented Aug 14, 2025

Uh oh!

zihugithub Aug 15, 2025

Uh oh!

cyber-pioneer Aug 15, 2025

Uh oh!

Uh oh!


		from flagscale.logger import logger

		RequestData = create_model(

[Serve] Flagscale serve supports automatically composition of multiple models #732

Are you sure you want to change the base?

[Serve] Flagscale serve supports automatically composition of multiple models #732

Uh oh!

Conversation

cyber-pioneer commented Aug 13, 2025

Description

Uh oh!

Hchnr commented Aug 13, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cyber-pioneer commented Aug 14, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!