[python] Update lmi_dist warmup logic #1367

xyang16 · 2023-12-05T22:56:15Z

Description

Brief description of what this PR is about

If this change is a backward incompatible change, why must this change be made?
Interesting edge cases to note here

lanking520 · 2023-12-05T22:58:01Z

engines/python/setup/djl_python/rolling_batch/lmi_dist_rolling_batch.py

-            self.properties.get("max_rolling_batch_prefill_tokens", -1))
-        self.model.warmup(batch_size, max_batch_prefill_tokens)
+        max_prefill_tokens = int(
+            self.properties.get("max_rolling_batch_prefill_tokens", 4096))


the purpose of max_rolling_batch_prefill_tokens has changed. Do we still want to keep this concept? And why we need now to introduce a max_prefill_tokens, can we just set some small value like 512, 1024?

and why we start to bring this to external, can this be done inside lmi-dist?

This was external before.

why we set to 4096 such small value? Does that have impact to the total token we can support? Or on the other hand, if customer set this value to be very high, like 32000, will this impact model loading?

lanking520 · 2023-12-05T23:57:39Z

tests/integration/llm/prepare.py

@@ -773,6 +773,7 @@ def build_lmi_dist_model(model):
        )
    options = lmi_dist_model_list[model]
    options["engine"] = "MPI"
+    options["option.mpi_mode"] = "true"


this is not needed? Given engine is MPI

Because this change require this property: https://github.com/deepjavalibrary/djl-serving/blob/master/engines/python/setup/djl_python/rolling_batch/lmi_dist_rolling_batch.py#L42

I got error without adding mpi_mode.

xyang16 requested review from zachgk, frankfliu and a team as code owners December 5, 2023 22:56

lanking520 reviewed Dec 5, 2023

View reviewed changes

xyang16 force-pushed the lmi_dist branch 2 times, most recently from 72a0134 to 70f7204 Compare December 5, 2023 23:37

lanking520 reviewed Dec 5, 2023

View reviewed changes

[python] Update lmi_dist warmup logic

8e6b0ba

xyang16 force-pushed the lmi_dist branch from 70f7204 to 8e6b0ba Compare December 6, 2023 18:30

lanking520 approved these changes Dec 6, 2023

View reviewed changes

xyang16 merged commit a39e1d1 into deepjavalibrary:master Dec 6, 2023
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python] Update lmi_dist warmup logic #1367

[python] Update lmi_dist warmup logic #1367

xyang16 commented Dec 5, 2023

lanking520 Dec 5, 2023

lanking520 Dec 5, 2023

xyang16 Dec 5, 2023

lanking520 Dec 5, 2023

lanking520 Dec 5, 2023

xyang16 Dec 6, 2023

xyang16 Dec 6, 2023 •

edited

Loading

[python] Update lmi_dist warmup logic #1367

[python] Update lmi_dist warmup logic #1367

Conversation

xyang16 commented Dec 5, 2023

Description

lanking520 Dec 5, 2023

Choose a reason for hiding this comment

lanking520 Dec 5, 2023

Choose a reason for hiding this comment

xyang16 Dec 5, 2023

Choose a reason for hiding this comment

lanking520 Dec 5, 2023

Choose a reason for hiding this comment

lanking520 Dec 5, 2023

Choose a reason for hiding this comment

xyang16 Dec 6, 2023

Choose a reason for hiding this comment

xyang16 Dec 6, 2023 • edited Loading

Choose a reason for hiding this comment

xyang16 Dec 6, 2023 •

edited

Loading