Add eval logic based on https://txt.cohere.com/evaluating-llm-outputs/#llm-generated-evaluation #26

Open

Open

Add eval logic based on https://txt.cohere.com/evaluating-llm-outputs/#llm-generated-evaluation#26

Add an evaluation section in the config file that checks for common functions and maybe even prompts for LLMs
Track the eval metrics for train, eval, test sets in W&B

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests