Skip to content

Conversation

@Duyi-Wang
Copy link
Contributor

No description provided.

@Duyi-Wang Duyi-Wang linked an issue Jun 6, 2024 that may be closed by this pull request
@Duyi-Wang Duyi-Wang added bug Something isn't working continuous batching continuous batching labels Jun 6, 2024
Copy link
Contributor

@pujiang2018 pujiang2018 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Next version, we may not need FP32 for logits.

@Duyi-Wang
Copy link
Contributor Author

Duyi-Wang commented Jun 6, 2024

Next version, we may not need FP32 for logits.

It can reduce communication overhead, but we still need to convert to FP32 when passed to Python for sampling....
We can do the convert and reorder at the same time.

@Duyi-Wang Duyi-Wang merged commit ba79f6f into intel:main Jun 6, 2024
@Duyi-Wang Duyi-Wang deleted the fix_multi_rank_cb_issue branch June 6, 2024 08:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working continuous batching continuous batching

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Crash when using CB mode with multi-rank

2 participants