Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prefill return first token #105

Merged
merged 16 commits into from
Jun 28, 2024
Merged

Prefill return first token #105

merged 16 commits into from
Jun 28, 2024

Conversation

jwyang-google
Copy link
Collaborator

@jwyang-google jwyang-google commented Jun 25, 2024

Modify Jetstream to make prefill return first token. Pending testing with MLPerf loadgen.

Copy link
Contributor

@JoeZijunZhou JoeZijunZhou left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need to update unit tests and resolve the CICD failures after all.

Copy link
Contributor

@JoeZijunZhou JoeZijunZhou left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks Junwei!

@jwyang-google jwyang-google merged commit c3fe3ce into main Jun 28, 2024
3 checks passed
@jwyang-google jwyang-google deleted the prefill-return-first-token branch June 28, 2024 15:42
vivianrwu pushed a commit to vivianrwu/JetStream that referenced this pull request Jul 2, 2024
Change prefill API to return first token.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants