add support for llama2 and claude via bedrock #54

davidsbailey · 2024-02-16T06:08:15Z

Follows #53

Adds support for the following models via bedrock:

anthropic.claude-v2
meta.llama2-13b-chat-v1
meta.llama2-70b-chat-v1

This enables rubric tester to evaluate the following experiments in s3://cdo-ai/teaching_assistant/experiments/:

ai-rubrics-json-llama2
ai-rubrics-json-reason-llama2
ai-rubrics-json-reason-claude

Cost warning: we pay cash (not AWS credits) for the use of these models. a complete test run with Claude costs about $4. See README updates in #53

snickell

Haven't been able to get the bedrock bits working locally, but I did my mistral experiment based on this, and seems to be working. So.... LGTM!

Python usage note... I don't think its the same PR, and there's serious downsides too (like where do you set default values) but some use of *args and **kwargs could potentially DRY up ai_label_student_work, and make more visible the cases where args/kwargs are changed/overridden before being passed into the model, e.g.:

def ai_label_student_work(self, *args, **kwargs):
        if llm_model.startswith("gpt"):
            return self.openai_label_student_work(*args, **kwargs)

Obviously has some downsides too as you suddenly can't see what ai_label_student_work takes params-wise, but I've seen this pattern help a lot of scientific and data sci code if there are lots of layers, and lots of params, and you add/remove params and suddenly have to pipe them around everywhere. May or may not be useful, YMMV.

davidsbailey · 2024-02-27T18:31:06Z

Haven't been able to get the bedrock bits working locally,

can you please try running bin/aws_llama_test.py and share your output?

but I did my mistral experiment based on this, and seems to be working. So.... LGTM!

exciting! How are mistral results looking so far?

Python usage note... I don't think its the same PR, and there's serious downsides too (like where do you set default values) but some use of *args and **kwargs could potentially DRY up ai_label_student_work, and make more visible the cases where args/kwargs are changed/overridden before being passed into the model, e.g.:
def ai_label_student_work(self, *args, **kwargs):
        if llm_model.startswith("gpt"):
            return self.openai_label_student_work(*args, **kwargs)
Obviously has some downsides too as you suddenly can't see what ai_label_student_work takes params-wise, but I've seen this pattern help a lot of scientific and data sci code if there are lots of layers, and lots of params, and you add/remove params and suddenly have to pipe them around everywhere. May or may not be useful, YMMV.

I agree that this is a problem, and ideas for pythonic solutions are always welcome as I am still very new at python. Thank you for flagging.

davidsbailey · 2024-02-29T19:05:26Z

did an accuracy regression test run before merging to confirm no regression on experiments/ai-rubrics-json-reason-gpt-4-turbo

davidsbailey added 5 commits February 15, 2024 20:15

add bedrock meta model options

8fcda43

safely initialize bedrock client

03faee8

compute meta prompt and send request to bedrock

682669f

increase meta max_gen_len a bit

6a335ab

add support for claude via bedrock

722cbf8

This was referenced Feb 16, 2024

Add claude model option #45

Closed

add llama2 model options #44

Closed

snickell mentioned this pull request Feb 20, 2024

Add a setup.py #56

Closed

Base automatically changed from handle-json-response to main February 26, 2024 18:13

Merge branch 'main' into bedrock-models

0ffb7a3

davidsbailey marked this pull request as ready for review February 26, 2024 18:16

touch up comments

511a6b4

davidsbailey requested review from snickell and a team February 26, 2024 18:20

pass response type through to openai_label_student_work

827b335

snickell approved these changes Feb 27, 2024

View reviewed changes

snickell mentioned this pull request Feb 29, 2024

Add a setup.py #62

Draft

davidsbailey merged commit 5c1fc9d into main Feb 29, 2024

davidsbailey deleted the bedrock-models branch February 29, 2024 19:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for llama2 and claude via bedrock #54

add support for llama2 and claude via bedrock #54

Uh oh!

davidsbailey commented Feb 16, 2024 •

edited

Loading

Uh oh!

snickell left a comment •

edited

Loading

Uh oh!

davidsbailey commented Feb 27, 2024

Uh oh!

davidsbailey commented Feb 29, 2024

Uh oh!

Uh oh!

add support for llama2 and claude via bedrock #54

add support for llama2 and claude via bedrock #54

Uh oh!

Conversation

davidsbailey commented Feb 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

snickell left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidsbailey commented Feb 27, 2024

Uh oh!

davidsbailey commented Feb 29, 2024

Uh oh!

Uh oh!

davidsbailey commented Feb 16, 2024 •

edited

Loading

snickell left a comment •

edited

Loading