Hyper Tree Proof Search (HTPS) is an AI model designed for mathematical theorem proving. It combines advanced machine learning techniques with sophisticated search algorithms to verify and prove mathematical theorems.
At the heart of HTPS is a tree search algorithm, which efficiently explores large decision spaces. This algorithm balances the exploration of new, potentially beneficial paths with the exploitation of paths already known to be effective. In the context of theorem proving, this allows HTPS to navigate through complex mathematical proofs systematically.
HTPS utilizes reinforcement learning, a type of machine learning where the system learns optimal strategies through trial and error. This approach allows HTPS to improve its theorem-proving capabilities over time by adapting its strategies based on the outcomes of its decisions.
State Representation: In HTPS, the 'state' refers to the current stage of a theorem being verified. This representation allows the system to track progress and make decisions based on the current proof status.
Policy Model: HTPS employs a policy model, potentially driven by a Large Language Model (LLM), to suggest the best tactic or the minimum unit of proof in theorem verification. This model can be thought of as performing text generation tasks, proposing the next logical step in the proof process.
Critic Model: The critic model in HTPS assesses the verifiability of the theorem under consideration. It acts as a text classification model, determining whether the policy model can provide a valid proof for the current state of the theorem.
Both the policy model and critic model in HTPS use the language model from ReProver as the target model for training.
- Move to the docker directory
cd docker
- Build the docker image and run the container
./build.sh
- First, you need to install the LeanDojo in the container
cd ..
pip install -e src/app/LeanDojo/
python src/app/gym/evaluate_minif2f.py
This project has been made possible with the support and contributions of several individuals. Special thanks to:
-
Developers:
- Kei Tsukamoto
- Fumiya Uchiyama
-
Advisors:
- Sho Sonoda
- Akiyoshi Sannai
- Wataru Kumagai
We deeply appreciate your invaluable support and advice.
Yang, K., Swope, A., Gu, A., Chalamala, R., Song, P., Yu, S., Godil, S., Prenger, R., & Anandkumar, A. (2023). LeanDojo: Theorem Proving with Retrieval-Augmented Language Models. In Proceedings of the Neural Information Processing Systems Conference (NeurIPS 2023), Datasets and Benchmarks Track, Oral presentation.