Skip to content

Conversation

@hinthornw
Copy link
Collaborator

This task is meant to test a couple things:

  1. Classification -> both on common things where it is expected to perform well (e.g., sentiment, toxicity -> which currently is always 0)
  2. Structured json output -> the schema is nested, which confused some of the smaller 7b models i tested out but works fine for llama 32b code instruct (and OAI/anthropic)

Includes a couple common things like enums.

TODO: Clean up notebooks and add more analysis

@hinthornw hinthornw merged commit 5ffdbb5 into main Dec 1, 2023
@hinthornw hinthornw deleted the wfh/clc-extr branch December 1, 2023 18:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants