Add Dataclasses and RepoEnv Info refac #50

matheper · 2025-02-10T23:27:03Z

Create some dataclasses to keep track of the environment information (EnvInfo) and LLM responses (LLMResponse and TokenUsage)
Methods step and reset always return an EnvInfo object instead of tuples
LLM always returns an LLMResponse
Move token usage tracking to HistoryTracker
Combine HistoryTracker.save_prompt_response_pairs into HistoryTracker.step
Remove Random LLM
Test tool and toolbox

Fix test_human

matheper · 2025-02-10T23:40:18Z

example_agent/utils.py

-                ]
+            prp = self.prompt_response_pairs[game_step]
+            if prp and include_prompt_response_pairs:
+                json_out["prompt_response_pairs"] = self._format_prompt_response_pairs(


I'm changing the format of the prompt_response_pairs when the prompt is a list of messages, see _format_prompt_response_pairs and tests/test_agents.py:193. I'm not sure I understand the previous logic when there are multiple message turns. Now the messages are concatenated into one prompt.

This was for the cases where there are multiple steps of LLM calls in a single env.step. For instance, in some CoT settings, we first call the LLM to generate the reasoning string, then conditioned on this string, we ask the LLM again to generate an action. I believe it's better to keep the list instead of concat, because list can be better presented when doing json.dumps when creating prompts.

I removed the string format, but moved the token_usage inside prompt_response_pair since there's usage for each of the llm calls

xingdi-eric-yuan · 2025-02-11T02:41:51Z

example_agent/utils.py

-                ]
+            prp = self.prompt_response_pairs[game_step]
+            if prp and include_prompt_response_pairs:
+                json_out["prompt_response_pairs"] = self._format_prompt_response_pairs(


This was for the cases where there are multiple steps of LLM calls in a single env.step. For instance, in some CoT settings, we first call the LLM to generate the reasoning string, then conditioned on this string, we ask the LLM again to generate an action. I believe it's better to keep the list instead of concat, because list can be better presented when doing json.dumps when creating prompts.

froggy/envs/env.py

… inside `prompt_response_pairs`

xingdi-eric-yuan

let's go!

matheper added 16 commits January 31, 2025 11:25

test toolbox

efc2126

test EnvironmentTool

62cd9e1

EnvironmentTool abstract class + abstractmethod reset

c9b8328

Merge branch 'main' into tools-hooks

b0395c9

Removed available_commands and WordCompleter from Human

24391cc

Remove Random LLM

db47207

Add tools to infos. Autocomplete human action

28209c4

Fix test_human

d35774e

Fix test_human

Merge branch 'main' into delete-random

7c9ab14

added tools to env.infos

43dc6ec

Merge branch 'delete-random' into tools-hooks

e856504

Add EnvInfo dataclass, plus RepoEnv info refac

e08a996

Refac HistoryTracker. Add LLMResponse and TokenUsage dataclass.

a49a341

Merged master into tools-hooks

4e36912

removed commented code

248ec10

black format

a751d9f

matheper commented Feb 10, 2025

View reviewed changes

matheper changed the title ~~Repoenv info dataclass~~ Add Dataclasses and RepoEnv Info refac Feb 10, 2025

xingdi-eric-yuan reviewed Feb 11, 2025

View reviewed changes

xingdi-eric-yuan requested a review from MarcCote February 11, 2025 02:56

matheper added 3 commits February 11, 2025 12:10

deepcopy info

ef8b0fc

Dump prompt_response_pairs to json as dict.

6d33ea7

back to a list of prompt_response_pairs per step. moved token_usage…

20fcab6

… inside `prompt_response_pairs`

xingdi-eric-yuan approved these changes Feb 11, 2025

View reviewed changes

matheper merged commit 2fbeae5 into main Feb 11, 2025
4 checks passed

matheper deleted the repoenv-info-dataclass branch February 11, 2025 21:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Dataclasses and RepoEnv Info refac #50

Add Dataclasses and RepoEnv Info refac #50

Uh oh!

matheper commented Feb 10, 2025

Uh oh!

matheper Feb 10, 2025

Uh oh!

xingdi-eric-yuan Feb 11, 2025

Uh oh!

matheper Feb 11, 2025

Uh oh!

xingdi-eric-yuan Feb 11, 2025

Uh oh!

Uh oh!

xingdi-eric-yuan left a comment

Uh oh!

Uh oh!

Uh oh!

Add Dataclasses and RepoEnv Info refac #50

Add Dataclasses and RepoEnv Info refac #50

Uh oh!

Conversation

matheper commented Feb 10, 2025

Uh oh!

matheper Feb 10, 2025

Choose a reason for hiding this comment

Uh oh!

xingdi-eric-yuan Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

matheper Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

xingdi-eric-yuan Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

xingdi-eric-yuan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!