LLM Mapping Dataset 🚀

1. Project Overview

This project provides a framework for generating and evaluating datasets for natural language processing (NLP) mapping tasks using large language models (LLMs). It supports tasks such as string case conversion and RNA-to-protein translation, and is designed for extensibility and reproducibility.

2. Project Architecture 🗂️

src/: Main source code directory
- generate_data.py: Generates datasets for various mapping tasks and saves them in JSONL format.
- run_eval.py: Evaluates LLMs on the generated datasets and prints results.
- utils/llm.py: Wrapper for interacting with Google GenAI models.
- utils/: Contains utility modules for mapping and string comparison.
- evaluation.py: Defines mapping classes and evaluation logic.
- data/: Contains prompt templates and generated datasets.
requirements.txt: Minimal required Python packages for running the project.
environment.yaml: Conda environment specification (optional, for full-featured development).
run_subtask.py: Unified CLI for generating data and running evaluation.

3. Running Instructions 🏃‍♂️

Environment Setup

Install dependencies with pip:

pip install -r requirements.txt

Or create the full environment with conda:

conda env create -f environment.yaml
conda activate nlp-final

Data Generation

Generate datasets for all tasks (default):

python src/generate_data.py

Or specify tasks and size:

python src/generate_data.py --tasks lowercase rna --size 50 --output ./src/data/examples.jsonl

Evaluation

Run evaluation on a generated dataset:

python src/run_eval.py

Or use the unified CLI:

python run_subtask.py --model pro --size 100 --tasks lowercase rna --eval

Notes

🔑 You must provide a valid API key for Google GenAI in src/key.secret.
📄 Generated datasets are saved in JSONL format in the specified output path.
📝 Evaluation prints results with timestamps and confidence scores.
⚡ Model-specific behavior:
- For flash (Gemini 2.5 Flash), thinking is disabled (thinking_budget=0).
- For pro (Gemini 2.5 Pro), minimum thinking is enabled (thinking_budget=128).

4. Extending the Project

➕ Add new mapping tasks by implementing new classes in evaluation.py and updating generate_data.py.
🛠️ Add new model aliases in utils/llm.py as needed.

For further details, see comments in the source files.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Mapping Dataset 🚀

1. Project Overview

2. Project Architecture 🗂️

3. Running Instructions 🏃‍♂️

Environment Setup

Data Generation

Evaluation

Notes

4. Extending the Project

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

mathoory/llm-mapping-dataset

Folders and files

Latest commit

History

Repository files navigation

LLM Mapping Dataset 🚀

1. Project Overview

2. Project Architecture 🗂️

3. Running Instructions 🏃‍♂️

Environment Setup

Data Generation

Evaluation

Notes

4. Extending the Project

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages