update

wellecks · Oct 31, 2023 · 29046a5 · 29046a5
1 parent 747e4fc
commit 29046a5
Show file tree

Hide file tree

Showing 2 changed files with 13 additions and 6 deletions.
diff --git a/README.md b/README.md
@@ -1,5 +1,6 @@
 # `llmstep`: [L]LM proofstep suggestions in Lean
 *News*
+- [11.2023] Experimental support for [*Llemma*](https://arxiv.org/abs/2310.10631) suggestions that leverage the document's content
 - [10.2023] New paper describing version 1.0.0 of `llmstep`: [[paper](https://arxiv.org/abs/2310.18457)]
 - [10.2023] `llmstep` adds direct support for reprover (`leandojo-lean4-tacgen-byt5-small`)
 - [9.2023] `llmstep` adds support for free GPU servers via Google Colab
@@ -58,18 +59,23 @@ Please see the recommended servers below.
 The `llmstep` tactic communicates with a server that you can run in your own environment (e.g., CPU, GPU, Google Colab). \
 The table below shows the recommended language model and server commands:
 
-| Environment  | Command | Model | Speed | miniF2F-test |
-| -------- | ------- | ------- |------- |------- |
-| CPU  | `python python/server_encdec.py` | `leandojo-lean4-tacgen-byt5-small` | 3.16s | 22.1\%|
-| Colab GPU  | See [Colab setup](#google-colab)  | `llmstep-pythia-2.8b` | 1.68s | 27.9\%|
-| CUDA GPU | `python python/server_vllm.py` | `llmstep-pythia-2.8b` | **0.25s** | **27.9\%**|
+| Environment  | Command | Model | Context |Speed | miniF2F-test |
+| -------- | ------- | ------- |-------|------- |------- |
+| CPU  | `python python/server_encdec.py` | `leandojo-lean4-tacgen-byt5-small` | State | 3.16s | 22.1\%|
+| Colab GPU  | See [Colab setup](#google-colab)  | `llmstep-pythia-2.8b` |State |1.68s | 27.9\%|
+| CUDA GPU | `python python/server_vllm.py` | `llmstep-pythia-2.8b` |State|**0.25s** | **27.9\%**|
+| CUDA GPU* | `python python/server_llemma.py`* | `llemma_7b`* |State, **_current file_** 🔥 | N/A | N/A|
 
-Please refer to [our paper](https://arxiv.org/abs/2310.18457) for further information.
+Please refer to [our paper](https://arxiv.org/abs/2310.18457) for further information on the benchmarks.
 
 If your GPU does not support [vLLM](https://vllm.readthedocs.io/en/latest/), please use `python python/server.py` to start a server.
 
 `llmstep` aims to be a model-agnostic tool. We welcome contributions of new models.
 
+
+\*[Llemma](https://arxiv.org/abs/2310.10631) support is currently experimental.
+
+
 ## Implementation
 <img src="./docs/llmstep.png" width="700"/>
 

diff --git a/python/server_llemma.py b/python/server_llemma.py
@@ -205,6 +205,7 @@ def get_config(args):
 
 if __name__ == '__main__':
     parser = get_argparser()
+    parser.set_defaults(hf_model='EleutherAI/llemma_7b')
     args = parser.parse_args()
 
     config = get_config(args)