Large Language Models Can Solve Real-World Planning Rigorously with Formal Verification Tools

Codes and Dataset for the Paper "Large Language Models Can Solve Real-World Planning Rigorously with Formal Verification Tools".

Framework

Setup Environment

Create a conda environment and install dependency:

conda create -n fmtravelplanner python=3.9
conda activate fmtravelplanner
pip install -r requirements.txt

The UnsatChristmas dataset is provided in database_small folder. You can run interactive plan repair experiment with this database.
To run satisfiable plan generation experiment, refer to paper "TravelPlanner: A Benchmark for Real-World Planning with Language Agents" and their github repo to download their database and train/validation/test set.

Running

Satisfiable Plan Solving

The file for satisfiable plan generation experiment is test_travelplanner.py. An example command is python test_travelplanner.py --set_type train --model_name gpt Note: You might want to use the training set to adjust the prompts for different LLMs. You can add customized checker for steps and codes to further improve the performance.

Unsatisfiable Plan Repair

Run test_travelplanner_interactive.py for unsatisfiable interactive plan repair experiment for TravelPlanner. Follow the instructions in file to first collect initial codes and then do the plan repair.
Run test_unsat.py for unsatisfiable interactive plan repair experiment for UnsatChristmas. Follow the instructions in file to first collect initial codes and then do the plan repair.

Prompts

The prompts we used are included in the prompts folder

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
database_small		database_small
imgs		imgs
prompts		prompts
tools		tools
tools_small		tools_small
utils		utils
README.md		README.md
collect_plans.py		collect_plans.py
convert_json.py		convert_json.py
openai_func.py		openai_func.py
requirements.txt		requirements.txt
test_travelplanner.py		test_travelplanner.py
test_travelplanner_interactive.py		test_travelplanner_interactive.py
test_unsat.py		test_unsat.py
travelplanner_unsat.csv		travelplanner_unsat.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large Language Models Can Solve Real-World Planning Rigorously with Formal Verification Tools

Framework

Setup Environment

Running

Satisfiable Plan Solving

Unsatisfiable Plan Repair

Prompts

About

Releases

Packages

Languages

yih301/LLM_Formal_Travel_Planner

Folders and files

Latest commit

History

Repository files navigation

Large Language Models Can Solve Real-World Planning Rigorously with Formal Verification Tools

Framework

Setup Environment

Running

Satisfiable Plan Solving

Unsatisfiable Plan Repair

Prompts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages