Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LLMaJ for evaluation of InstructLab fine-tuned LLM with and without RAG #177

Open
1 task
ktam3 opened this issue Nov 14, 2024 · 0 comments
Open
1 task
Assignees
Labels
enhancement New feature or request

Comments

@ktam3
Copy link

ktam3 commented Nov 14, 2024

Feature Overview (mandatory - Complete while in New status)

LLM-as-Judge (LLMaJ) pipeline for the side-by-side evaluation of POCs using InstructLab fine-tuned LLMs with and without RAG.

Goals (mandatory - Complete while in New status)
Provide an LLMaJ pipeline users can execute for evaluation comparing one or more combinations:

  • Starter model
  • Starter model w/RAG
  • InstructLab fine-tuned model
  • InstructLab fine-tuned model w/RAG
  • External LLM provider (e.g. OpenAI)

The LLMaJ model should be served by an OpenAI-compatible endpoint such that the user can use GPT4 or a custom or on-premise LLMaJ model (e.g. Mixtral)

Requirements (mandatory -_ Complete while in Refinement status):

  • The LLMaJ pipeline for evaluating models with and without RAG should be available for downstream users.
  • It should store the evaluations in a format that can be consumed by data scientists interested in further analysis (e.g., parquet table)

Done - Acceptance Criteria (mandatory - Complete while in Refinement status):

  • The pipeline is available for RHEL AI users
  • The docs should include information on how to use/run this new pipeline

Out of Scope {}(Initial completion while in Refinement status):{}

  • This is not about an exhaustive evaluation of RAG methodologies.
  • This is not about creating a LoRA layer or fine-tuned model for LLMaJ

Tasks Needed:

@ktam3 ktam3 added the enhancement New feature or request label Nov 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants