Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix EleutherAI/gpt-neox-20b does not work in tgi #2346

Merged
merged 1 commit into from
Aug 8, 2024

Conversation

sywangyi
Copy link
Contributor

@sywangyi sywangyi commented Aug 1, 2024

@OlivierDehaene OR @Narsil @danieldk please help review.
found gpt-neox-20b does not work in tgi. there's shape issue in rotary_emb
https://github.com/huggingface/transformers/blob/main/src/transformers/models/gpt_neox/modeling_gpt_neox.py#L221-L233
add similar logic in tgi could fix the crash.

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
@sywangyi
Copy link
Contributor Author

sywangyi commented Aug 8, 2024

@drbh

@drbh
Copy link
Collaborator

drbh commented Aug 8, 2024

Hi @sywangyi, thanks for the contribution! Just tested with EleutherAI/gpt-neox-20b locally and it loads and generates as expected 🙏

@drbh drbh merged commit 689b1ab into huggingface:main Aug 8, 2024
yuanwu2017 pushed a commit to yuanwu2017/tgi-gaudi that referenced this pull request Sep 26, 2024
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants