feat: add mm embed worker and service. #518

dongxianzhe · 2025-12-11T03:36:08Z

add mm embedding service.
add mm embedding worker.
add tensor serialization and deserialization in shared memory manager.

liutongxuan · 2025-12-12T07:16:55Z

xllm/core/framework/batch/batch.cpp

  // if raw_output.outputs.size() value is 0,
  // this means all sequences are in prefill stage status.
-  const int64_t num_seqs = raw_output.outputs.size();
+  int64_t num_seqs;


还是需要加上MMBatch的子类，不能改变现有Batch逻辑。

fix: convert bytes data to base64 encoding. fix: fix base64 encoding.

DragonFive · 2025-12-17T12:29:44Z

xllm/core/framework/request/mm_batch_data.h

+  MMBatchData(const std::vector<MMData>& datas);
+  MMBatchData(uint32_t ty, const MMDict& items);
+
+  bool has(uint32_t type) const { return type & ty_ != 0; }


type & (ty_ != 0) may be better.

DragonFive · 2025-12-17T12:30:20Z

xllm/core/distributed_runtime/dist_manager.cpp

  if (model_backend == "llm") {
-    worker_type =
-        (options.task_type() == "generate") ? WorkerType::LLM : WorkerType::ELM;
+    if (options.task_type() == "gnerate") {


generate may be ok?

…6 and bf16 types.

feat: add mm_embed worker and service.

d0f3526

dongxianzhe requested review from liutongxuan, wly-115, xiao-yu-chen and yq33victor December 11, 2025 03:36

dongxianzhe changed the title ~~feat: add mm_embed worker and service.~~ feat: add mm embed worker and service. Dec 11, 2025

liutongxuan reviewed Dec 12, 2025

View reviewed changes

Xianzhe Dong added 2 commits December 12, 2025 20:25

feat: support transmitting binary mm embeddings.

f818375

feat: support the return of mm embeddings of any type.

a103d80

dongxianzhe force-pushed the feat/mm_embed_service branch from d642170 to a103d80 Compare December 12, 2025 12:30

Xianzhe Dong and others added 11 commits December 12, 2025 20:32

feat: add parameters to tensor proto.

3a5fd9d

feat: support return metadata in embedding output.

3824f3d

fix: convert bytes data to base64 encoding. fix: fix base64 encoding.

feat: add support for multimodal embedding input in chat messages.

1cbc985

feat: vlm support binary mm input.

d80c9d7

refactor: refactor mm_data module.

efbca00

fix: fix mm embedding handler.

84de21e

fix: get image number from mm data.

8a42114

feat: modify embedding proto and parse embedding in chat messages.

fc3116d

feat: modify qwen2 image processor to process modality order.

3937a96

bugfix: fix sequence metadata bug in generating output.

e6df41b

feat: support iterator for mm input.

87144a5

DragonFive self-requested a review December 17, 2025 12:30

Xianzhe Dong added 2 commits December 18, 2025 10:59

bugfix: fix task type error in llm.

679d73a

bugfix: fix memory leak bug in non stream call.

23fe048

DragonFive reviewed Dec 18, 2025

View reviewed changes

feat: support for base64 encoding and decoding of embeddings with fp1…

bd7e6be

…6 and bf16 types.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add mm embed worker and service. #518

feat: add mm embed worker and service. #518

Uh oh!

dongxianzhe commented Dec 11, 2025

Uh oh!

liutongxuan Dec 12, 2025

Uh oh!

DragonFive Dec 17, 2025

Uh oh!

DragonFive Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: add mm embed worker and service. #518

Are you sure you want to change the base?

feat: add mm embed worker and service. #518

Uh oh!

Conversation

dongxianzhe commented Dec 11, 2025

Uh oh!

liutongxuan Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

DragonFive Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

DragonFive Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants