Skip to content

[Model] Add PLaMo2 #14323

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 43 commits into from
Apr 16, 2025
Merged

[Model] Add PLaMo2 #14323

merged 43 commits into from
Apr 16, 2025

Conversation

Alnusjaponica
Copy link
Contributor

@Alnusjaponica Alnusjaponica commented Mar 6, 2025

This PR adds support for PLaMo2, specifically PLaMo2 1B and PLaMo2 8B. PLaMo2 is a hybrid mamba2 architecture featuring sliding window attention.

In this PR, we have created the inference architecture for the PLaMo2 model.

Please note that some functionalities such as PP/TP, quantization, and LoRA are not yet supported in this PR. We plan to address these features in subsequent follow-up PRs.

Alnusjaponica and others added 5 commits March 4, 2025 22:09
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>

Co-Authored-By: Kento Nozawa <nzw0301@preferred.jp>
Co-Authored-By: Hiroaki Mikami <mhiroaki@preferred.jp>
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>

Co-authored-by: Calvin Metzger <metzger@preferred.jp>
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Copy link

github-actions bot commented Mar 6, 2025

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

@mergify mergify bot added the ci/build label Mar 6, 2025
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
@Alnusjaponica
Copy link
Contributor Author

This is my first time submitting a PR to this repository, and I just joined the Slack. I believe I do not have permission to unblock additional CIs, so I would appreciate it if you could add me to the Buildkite org. If I have missed anything or if there are areas for improvement, please do let me know.

@Alnusjaponica Alnusjaponica marked this pull request as ready for review March 6, 2025 03:28
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Alnusjaponica and others added 5 commits March 10, 2025 12:23
Signed-off-by: shemmi <shemmi@preferred.jp>
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
@nzw0301 nzw0301 mentioned this pull request Mar 12, 2025
1 task
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
@DarkLight1337
Copy link
Member

@tlrmchlsmth can you do a quick pass to check that the implementation of this model fits our architecture? No need to test for correctness since they are the model vendor

Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
@Alnusjaponica
Copy link
Contributor Author

Alnusjaponica commented Apr 7, 2025

@DarkLight1337 Thanks for your review. I've updated the tests.
@tlrmchlsmth Now that the public model is updated and I’ve also updated this PR according to the new changes, could you take another look?

Copy link
Member

@DarkLight1337 DarkLight1337 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for forgetting about this, should be good to go now!

@DarkLight1337 DarkLight1337 enabled auto-merge (squash) April 15, 2025 04:12
@github-actions github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 15, 2025
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
auto-merge was automatically disabled April 15, 2025 05:26

Head branch was pushed to by a user without write access

@Alnusjaponica
Copy link
Contributor Author

Thanks for your approval!
I found some tests fails after I merged the main branch, so I am trying to fix them. I would appreciated it if you could take another look once the test passes.

@Alnusjaponica
Copy link
Contributor Author

It seems unrelated part of the tests are still failing. I'll wait main branch to be fixed.

@DarkLight1337
Copy link
Member

Can you update models tests in

- VLLM_USE_V1=0 pytest -v -s models/test_initialization.py -k 'not llama4'
to run your model separate from the rest of the models? There is a known memory leak issue for Llama 4 in those tests so perhaps that's also the case for your model.

@Alnusjaponica
Copy link
Contributor Author

Alnusjaponica commented Apr 15, 2025

Sure, it passed in my environment, so I thought it was not relevant either. Let me see if other tests pass with the latest main branch at first.

Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
@Alnusjaponica
Copy link
Contributor Author

@DarkLight1337
Separating plamo2 passes the Basic Model Test, so might be the case for our models. Is there anything I can do to fix this?

@DarkLight1337
Copy link
Member

The problem doesn't appear in V1 so let's just merge this.

@vllm-bot vllm-bot merged commit 3badb02 into vllm-project:main Apr 16, 2025
63 of 69 checks passed
@Alnusjaponica
Copy link
Contributor Author

Thanks for the approval! Next, I'll I will submit a follow-up PR to provide support for TP/PP, etc.

lionelvillard pushed a commit to lionelvillard/vllm that referenced this pull request Apr 17, 2025
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: shemmi <shemmi@preferred.jp>
Co-authored-by: Kento Nozawa <nzw0301@preferred.jp>
Co-authored-by: Hiroaki Mikami <mhiroaki@preferred.jp>
Co-authored-by: Calvin Metzger <metzger@preferred.jp>
yangw-dev pushed a commit to yangw-dev/vllm that referenced this pull request Apr 21, 2025
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: shemmi <shemmi@preferred.jp>
Co-authored-by: Kento Nozawa <nzw0301@preferred.jp>
Co-authored-by: Hiroaki Mikami <mhiroaki@preferred.jp>
Co-authored-by: Calvin Metzger <metzger@preferred.jp>
Signed-off-by: Yang Wang <elainewy@meta.com>
jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Apr 29, 2025
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: shemmi <shemmi@preferred.jp>
Co-authored-by: Kento Nozawa <nzw0301@preferred.jp>
Co-authored-by: Hiroaki Mikami <mhiroaki@preferred.jp>
Co-authored-by: Calvin Metzger <metzger@preferred.jp>
lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: shemmi <shemmi@preferred.jp>
Co-authored-by: Kento Nozawa <nzw0301@preferred.jp>
Co-authored-by: Hiroaki Mikami <mhiroaki@preferred.jp>
Co-authored-by: Calvin Metzger <metzger@preferred.jp>
RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025
Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
Signed-off-by: shemmi <shemmi@preferred.jp>
Co-authored-by: Kento Nozawa <nzw0301@preferred.jp>
Co-authored-by: Hiroaki Mikami <mhiroaki@preferred.jp>
Co-authored-by: Calvin Metzger <metzger@preferred.jp>
Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci/build documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants