[Model] GPT2ForSequenceClassification model #19663

nie3e · 2025-06-15T13:42:47Z

Support GPT2ForSequenceClassification model.
Tested on nie3e/sentiment-polish-gpt2-small

Test:

curl -X 'POST' \
  'http://127.0.0.1:8000/classify' \
  -H 'Content-Type: application/json' \
  -d '{
  "model": "nie3e/sentiment-polish-gpt2-small",
  "input": ["Przestronny hotel , jasny , z dużymi oknami .", "Położony całkiem blisko centrum ."]
}'

Result:

{"id":"classify-6619cecdb01a4bf8900df136a9b33b15","object":"list","created":1749994841,"model":"nie3e/sentiment-polish-gpt2-small","data":[{"index":0,"label":"POSITIVE","probs":[6.198883056640625e-06,1.7881393432617188e-07,1.0,7.569789886474609e-06],"num_classes":4},{"index":1,"label":"AMBIGUOUS","probs":[0.00013005733489990234,0.004421234130859375,0.005367279052734375,0.990234375],"num_classes":4}],"usage":{"prompt_tokens":17,"total_tokens":17,"completion_tokens":0,"prompt_tokens_details":null}}

Signed-off-by: nie3e <adrcwiek@gmail.com>

github-actions · 2025-06-15T13:42:55Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Summary of Changes

Hello @nie3e, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

I've summarized this pull request which adds support for the GPT2ForSequenceClassification model. This enables users to run GPT-2 models fine-tuned for sequence classification tasks directly within vLLM, leveraging its optimized inference capabilities.

Highlights

New Model Class: I see the introduction of GPT2ForSequenceClassification, a new class inheriting from nn.Module that wraps the base GPT2Model and adds classification-specific layers.
Classification Logic: The new class includes a pooler method that implements last-token pooling, applies a linear scoring layer, and a Softmax activation to produce classification scores.
Model Registration: The GPT2ForSequenceClassification model is registered in the model registry, making it discoverable and loadable by vLLM.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configureGemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request successfully adds support for GPT2ForSequenceClassification. The implementation is clear and aligns well with vLLM's architecture. The pooling mechanism correctly utilizes the last token's representation for classification. One minor improvement is suggested for the nn.Softmax layer initialization to ensure explicitness and future compatibility.

vllm/model_executor/models/gpt2.py

Adds dim do Softmax Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: nie3e <adrcwiek@gmail.com>

Signed-off-by: nie3e <adrcwiek@gmail.com>

vllm/model_executor/models/gpt2.py

vllm/model_executor/models/registry.py

vllm/model_executor/models/gpt2.py

Signed-off-by: nie3e <adrcwiek@gmail.com>

nie3e · 2025-06-18T16:22:33Z

@Isotr0py You are right! Thanks to your help the code looks much simpler!

Isotr0py

LGTM! Thanks for your patience!

nie3e · 2025-06-18T18:01:46Z

@Isotr0py one thing before I fix tests. I am not sure if this model is correctly put into _CROSS_ENCODER_MODELS. What do you think?

Edit: _TEXT_GENERATION_MODELS might be good place.

Isotr0py · 2025-06-19T03:37:30Z

You can put it in _EMBEDDING_MODELS.

Signed-off-by: nie3e <adrcwiek@gmail.com>

mergify · 2025-06-19T08:46:09Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @nie3e.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: nie3e <adrcwiek@gmail.com>

mergify · 2025-06-19T09:12:48Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @nie3e.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: nie3e <adrcwiek@gmail.com>

tests/models/registry.py

Signed-off-by: nie3e <adrcwiek@gmail.com>

nie3e · 2025-06-19T15:49:16Z

@Isotr0py Can we somehow rerun v1-test?

Timed Out
Waited 1m 30s
·
Ran in 3h 0m

Isotr0py · 2025-06-20T01:52:17Z

#19872 should fix the CI deadlock, you can merge from main to get the fix.

Signed-off-by: nie3e <adrcwiek@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> added notebooks to playground updates remoted verbatim HF secrets from all files updates [custom_op][vllm-plugin] update custom_op class to use op_registry (vllm-project#19164) Signed-off-by: Chendi.Xue <chendi.xue@intel.com> Export NaNs in logits to scheduler_stats if output is corrupted (vllm-project#18777) Signed-off-by: Vlad Mihailescu <vtmihailescu@gmail.com> [CPU][CI] Fallback sliding window to v0 and fix CPU pooling model tests (vllm-project#19901) Signed-off-by: jiang1.li <jiang1.li@intel.com> [Kernel] mark TorchSDPABackend swap_blocks NotImplementedError (vllm-project#19749)

DarkLight1337 · 2025-06-21T03:29:06Z

Can you open another PR to update the Supported Models page?

Signed-off-by: nie3e <adrcwiek@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Signed-off-by: nie3e <adrcwiek@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: juncheoll <th6re8e@naver.com>

Signed-off-by: nie3e <adrcwiek@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: fhl <2410591650@qq.com>

Signed-off-by: nie3e <adrcwiek@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Signed-off-by: nie3e <adrcwiek@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Will Eaton <weaton@redhat.com>

Signed-off-by: nie3e <adrcwiek@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Signed-off-by: nie3e <adrcwiek@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: avigny <47987522+avigny@users.noreply.github.com>

Adds GPT2ForSequenceClassification model

d650dd8

Signed-off-by: nie3e <adrcwiek@gmail.com>

gemini-code-assist bot reviewed Jun 15, 2025

View reviewed changes

vllm/model_executor/models/gpt2.py Outdated Show resolved Hide resolved

nie3e and others added 2 commits June 15, 2025 16:31

Update vllm/model_executor/models/gpt2.py

c966686

Adds dim do Softmax Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: nie3e <adrcwiek@gmail.com>

Fixes pre-commit

88997af

Signed-off-by: nie3e <adrcwiek@gmail.com>

nie3e force-pushed the main branch from bd6417b to 88997af Compare June 15, 2025 14:31

Isotr0py reviewed Jun 15, 2025

View reviewed changes

vllm/model_executor/models/gpt2.py Outdated Show resolved Hide resolved

vllm/model_executor/models/registry.py Outdated Show resolved Hide resolved

vllm/model_executor/models/gpt2.py Outdated Show resolved Hide resolved

vllm/model_executor/models/gpt2.py Outdated Show resolved Hide resolved

Isotr0py reviewed Jun 18, 2025

View reviewed changes

vllm/model_executor/models/gpt2.py Outdated Show resolved Hide resolved

vllm/model_executor/models/gpt2.py Outdated Show resolved Hide resolved

nie3e and others added 2 commits June 18, 2025 18:05

Merge branch 'vllm-project:main' into main

d097691

Simplifies model, repairs registry

a2dd831

Signed-off-by: nie3e <adrcwiek@gmail.com>

Isotr0py approved these changes Jun 18, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) June 18, 2025 16:47

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jun 18, 2025

Fixes registry, adds to test registry

b2c9ca6

Signed-off-by: nie3e <adrcwiek@gmail.com>

auto-merge was automatically disabled June 19, 2025 08:45
Head branch was pushed to by a user without write access

nie3e requested review from DarkLight1337 and ywang96 as code owners June 19, 2025 08:45

mergify bot added needs-rebase and removed needs-rebase labels Jun 19, 2025

nie3e force-pushed the main branch from d55a045 to b2c9ca6 Compare June 19, 2025 09:12

Merge with upstream

3b704a7

Signed-off-by: nie3e <adrcwiek@gmail.com>

mergify bot added needs-rebase and removed needs-rebase labels Jun 19, 2025

Fixes v0_only

ab1b4a2

Signed-off-by: nie3e <adrcwiek@gmail.com>

Isotr0py reviewed Jun 19, 2025

View reviewed changes

tests/models/registry.py Outdated Show resolved Hide resolved

Removes v0_only from GPT2ForSequenceClassification

6351f36

Signed-off-by: nie3e <adrcwiek@gmail.com>

Isotr0py enabled auto-merge (squash) June 19, 2025 11:02

Merge branch 'vllm-project:main' into main

0c7f4ea

Isotr0py merged commit f1e840e into vllm-project:main Jun 20, 2025
72 checks passed

This was referenced Jun 21, 2025

[Docs] Add GPT2ForSequenceClassification to supported models in docs #19931

Closed

[Docs] Add GPT2ForSequenceClassification to supported models in docs #19932

Merged

Uh oh!

[Model] GPT2ForSequenceClassification model #19663

[Model] GPT2ForSequenceClassification model #19663

Uh oh!

Conversation

nie3e commented Jun 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jun 15, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nie3e commented Jun 18, 2025

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

nie3e commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Isotr0py commented Jun 19, 2025

Uh oh!

mergify bot commented Jun 19, 2025

Uh oh!

mergify bot commented Jun 19, 2025

Uh oh!

Uh oh!

nie3e commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Isotr0py commented Jun 20, 2025

Uh oh!

Uh oh!

DarkLight1337 commented Jun 21, 2025

Uh oh!

Uh oh!

nie3e commented Jun 15, 2025 •

edited by github-actions bot

Loading

nie3e commented Jun 18, 2025 •

edited

Loading

nie3e commented Jun 19, 2025 •

edited

Loading