New QWEN 2 VLM + o1 fixes for VHELM #3247

teetone · 2024-12-27T07:11:39Z

No description provided.

…vlms

ImKeTT

Looks all good to me!

teetone · 2025-01-05T05:02:36Z

src/helm/clients/openai_client.py

@@ -191,6 +187,18 @@ def _make_chat_request(self, request: Request) -> RequestResult:
            if raw_request["stop"] is None:
                raw_request.pop("stop")

+            if request.model_engine == "o1-2024-12-17":
+                # Avoid error:


@yifanmai note the extra logic for o1-2024-12-17 that's different from the other o1 models.

teetone · 2025-01-05T05:03:26Z

src/helm/benchmark/run_spec_factory.py

@@ -138,6 +138,13 @@ def alter_run_spec(run_spec: RunSpec) -> RunSpec:
        ):
            run_spec = singleton(IncreaseMaxTokensRunExpander(value=1).expand(run_spec))

+        if model.name == "openai/o1-2024-12-17":


@yifanmai o1-2024-12-17 specifically is unusable in HELM without this change.

In practice, a few thousand reasoning tokens are used, at least for the VHELM scenarios.

teetone added 8 commits December 19, 2024 17:17

added openai/o1-2024-12-17 and openai/gpt-4o-2024-11-20

d9171f9

handle o1 vision

6934951

handle new o1

24a80e9

handle new o1

8d9598f

handle new o1

249edb9

support qwen vl

bbf2fa4

support qwen vl

bfa44f7

Merge branch 'main' of https://github.com/stanford-crfm/helm into new…

6f4d829

…vlms

teetone requested review from yifanmai and ImKeTT December 27, 2024 07:11

special handling for latest o1

6f101cd

ImKeTT approved these changes Dec 27, 2024

View reviewed changes

teetone added 7 commits December 28, 2024 10:33

auto device map

567e61b

increase to recommended reasoning tokens

26e288b

handle inappropriate prompts

b9afc40

rank on quasi_prefix_exact_match

c4f9950

reduce memory usage

56ffd54

support flash-attention

f5a0fef

optional dependency:

400f795

teetone commented Jan 5, 2025

View reviewed changes

teetone merged commit ee8cf38 into main Jan 5, 2025
8 checks passed

teetone deleted the newvlms branch January 5, 2025 05:12

teetone changed the title ~~New QWEN 2 VLM~~ New QWEN 2 VLM + o1 fixes for VHELM Jan 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New QWEN 2 VLM + o1 fixes for VHELM #3247

New QWEN 2 VLM + o1 fixes for VHELM #3247

Uh oh!

teetone commented Dec 27, 2024

Uh oh!

ImKeTT left a comment

Uh oh!

teetone Jan 5, 2025

Uh oh!

teetone Jan 5, 2025

Uh oh!

teetone Jan 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

New QWEN 2 VLM + o1 fixes for VHELM #3247

New QWEN 2 VLM + o1 fixes for VHELM #3247

Uh oh!

Conversation

teetone commented Dec 27, 2024

Uh oh!

ImKeTT left a comment

Choose a reason for hiding this comment

Uh oh!

teetone Jan 5, 2025

Choose a reason for hiding this comment

Uh oh!

teetone Jan 5, 2025

Choose a reason for hiding this comment

Uh oh!

teetone Jan 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

teetone Jan 5, 2025 •

edited

Loading