GitHub - JuryMindAI/jurymind-ai: Framework for agentic evaluation of LLMs, Prompt Optimization, Data Generation and Labeling.

Collective AI Judgment for Smarter Models

Overview

JuryMind AI is designed to harness the power of large language models (LLMs) as intelligent judges. Our platform enables automated LLM evaluation, prompt optimization, dataset generation and auto-labeling with agentic AI judges working collaboratively — like a jury of experts.

JuryMind AI empowers ML teams, AI researchers, and startups to measure, refine, and improve their language models and prompt engineering workflows with minimal manual effort.

Features

LLM Evaluation: Score model outputs based on customizable criteria using expert LLM judges.
Prompt Optimization: Iteratively improve prompts to achieve specified goals like clarity, relevance, or conciseness.
Dataset Generation Generate high-quality datasets over your data.
Auto Labeling: Generate high-quality labels automatically using AI-driven judgments.
Agentic Judges: Leverage multiple AI agents working in parallel or consensus for robust evaluations.
Modular Architecture: Easily extend the platform with modules like JudgeLab(TBD), PromptLab(TBD), and LabelLab(TBD).

Getting Started

Prerequisites

Python 3.8+
Docker & Docker Compose
OpenAI API key (set in .env)

Installation

Clone the repo:

git clone https://github.com/yourusername/jurymind-ai.git
cd jurymind-ai

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
docs		docs
jurymind		jurymind
mlartifacts		mlartifacts
src		src
.gitignore		.gitignore
.python-version		.python-version
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.md		README.md
README.rst		README.rst
logo.png		logo.png
main.py		main.py
mkdocs.yml		mkdocs.yml
mlflow_logs.txt		mlflow_logs.txt
pydantic_client.py		pydantic_client.py
pyproject.toml		pyproject.toml
small_data.json		small_data.json
spoiler_dataset.json		spoiler_dataset.json
uv.lock		uv.lock
weather.py		weather.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Features

Getting Started

Prerequisites

Installation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

JuryMindAI/jurymind-ai

Folders and files

Latest commit

History

Repository files navigation

Overview

Features

Getting Started

Prerequisites

Installation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages