InvBench is the official repository for the paper "Can LLMs Accelerate Program Verification with Invariant Synthesis?".
InvBench/
├─ Dataset/
│ ├─ Evaluation/
│ │ ├─ Easy/ # 113 C programs (Easy Split)
│ │ ├─ Hard/ # 113 C programs (Hard Split)
│ │ └─ timing.json # Evaluation timing data
│ └─ Training/
│ ├─ Programs/ # 3589 C programs for training
│ ├─ invariants.json # Ground-truth invariants for training programs
│ └─ timing.json # Training timing data
- Easy Split: 113 C programs under
Dataset/Evaluation/Easy/. - Hard Split: 113 C programs under
Dataset/Evaluation/Hard/. - Training Programs: 3589 C programs under
Dataset/Training/Programs/. - Ground-truth Invariants:
Dataset/Training/invariants.json. - Timing Data:
Dataset/Evaluation/timing.jsonandDataset/Training/timing.json. Eachtimevalue records the time that UAutomizer takes to solve the corresponding program.
If our research inspires you, please cite our paper:
@misc{wei2025invbenchllmsaccelerateprogram,
title={InvBench: Can LLMs Accelerate Program Verification with Invariant Synthesis?},
author={Anjiang Wei and Tarun Suresh and Tianran Sun and Haoze Wu and Ke Wang and Alex Aiken},
year={2025},
eprint={2509.21629},
archivePrefix={arXiv},
primaryClass={cs.PL},
url={https://arxiv.org/abs/2509.21629},
}This project is licensed under the Apache License 2.0.