Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[v0.4.0] Release Tracker #3155

Closed
1 of 3 tasks
zhuohan123 opened this issue Mar 2, 2024 · 11 comments
Closed
1 of 3 tasks

[v0.4.0] Release Tracker #3155

zhuohan123 opened this issue Mar 2, 2024 · 11 comments
Labels
release Related to new version release v0.3.4

Comments

@zhuohan123
Copy link
Member

zhuohan123 commented Mar 2, 2024

ETA: Before Mar 28th

Major changes

TBD.

PRs to be merged before the release

@zhuohan123 zhuohan123 added release Related to new version release v0.3.4 labels Mar 2, 2024
@zhuohan123 zhuohan123 pinned this issue Mar 2, 2024
@nivibilla
Copy link

@zhuohan123 do you think the second part for int8 inference should also be added into the roadmap?
#1508

@miko7879
Copy link

miko7879 commented Mar 6, 2024

Would it be possible to request that the following is merged prior to release: #2961

It is preventing us from being able to deploy models using vLLM in certain contexts.

@Xu-Chen
Copy link

Xu-Chen commented Mar 7, 2024

This amazing MR created by @chu-tianxiang was originally supposed to be merged into the previous version. Hope it can be merged this time. The speed improvement is very obvious.

@zhaoyang-star
Copy link
Contributor

@zhuohan123 Hope Prefix Caching with FP8 KV cache support (#3234) could be merged.

@GennVa
Copy link

GennVa commented Mar 14, 2024

@zhuohan123 Would it be possible to request if #3233 can be merged in this release?
As solution for #2398 and #3118
Thanks

@robertgshaw2-neuralmagic
Copy link
Collaborator

Cmake -> #2830

@simon-mo
Copy link
Collaborator

I'm planning to merge usage reporting PR by Monday.

@JMHenri
Copy link

JMHenri commented Mar 25, 2024

We are very hopeful to have the JAIS commit in v0.3.4 4c07dd2

@wsp317
Copy link

wsp317 commented Mar 26, 2024

When do you plan to release v0.3.4?

@simon-mo simon-mo changed the title [v0.3.4] Release Tracker [v0.4.0] Release Tracker Mar 26, 2024
@simon-mo
Copy link
Collaborator

simon-mo commented Mar 26, 2024

We have a hard deadline to release it this week. Sorry about the delay.

@grandiose-pizza
Copy link
Contributor

We are very hopeful to have the JAIS commit in v0.3.4 4c07dd2

Hi @JMHenri , Please let me know in case you face any issues with the JAIS models. Since we don't have a chat_template in the tokenizer, please feel to reach out in case of any issues.

You can certainly look into this for reference on usage.

@WoosukKwon WoosukKwon unpinned this issue Apr 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release Related to new version release v0.3.4
Projects
None yet
Development

No branches or pull requests