Skip to content

Commit

Permalink
revert experience maker
Browse files Browse the repository at this point in the history
Rate limit · GitHub

Whoa there!

You have triggered an abuse detection mechanism.

Please wait a few minutes before you try again;
in some cases this may take up to an hour.

openllmai0 committed Jul 8, 2024
1 parent 688cb0d commit a76ed31
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions openrlhf/trainer/ppo_utils/experience_maker.py
Original file line number Diff line number Diff line change
@@ -369,11 +369,11 @@ def _generate_vllm(self, prompts: List[str], **kwargs) -> Tuple[torch.Tensor, to
for output in outputs:
# left padding input
input_len = len(output.prompt_token_ids)
input_ids = [pad_token_id] * (max_input_len - input_len) + list(output.prompt_token_ids)
input_ids = [pad_token_id] * (max_input_len - input_len) + output.prompt_token_ids

# right padding output
output_len = len(output.outputs[0].token_ids)
output_ids = list(output.outputs[0].token_ids) + [pad_token_id] * (max_output_len - output_len)
output_ids = output.outputs[0].token_ids + [pad_token_id] * (max_output_len - output_len)

if output_ids[output_len - 1] != eos_token_id:
output_ids[min(output_len, len(output_ids) - 1)] = eos_token_id

0 comments on commit a76ed31

Please sign in to comment.