Skip to content

Conversation

@zucchini-nlp
Copy link
Member

What does this PR do?

As per title, got a ping from our CI bot about it

@github-actions github-actions bot marked this pull request as draft March 19, 2025 12:21
@github-actions
Copy link
Contributor

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. When it is ready for review, please click the Ready for review button (at the bottom of the PR page).

@zucchini-nlp zucchini-nlp marked this pull request as ready for review March 19, 2025 12:21
@github-actions github-actions bot requested a review from ydshieh March 19, 2025 12:21
@github-actions
Copy link
Contributor

This comment contains run-slow, running the specified jobs: This comment contains run-slow, running the specified jobs:

models: ['models/gemma3']
quantizations: [] ...

@zucchini-nlp
Copy link
Member Author

run-slow: gemma3

@github-actions
Copy link
Contributor

This comment contains run-slow, running the specified jobs: This comment contains run-slow, running the specified jobs:

models: ['models/gemma3']
quantizations: [] ...

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Collaborator

@ydshieh ydshieh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

works for me as long as you are able to get it work on slow CI :-)

Thanks a lot

@zucchini-nlp
Copy link
Member Author

run-slow: gemma3

@github-actions
Copy link
Contributor

This comment contains run-slow, running the specified jobs: This comment contains run-slow, running the specified jobs:

models: ['models/gemma3']
quantizations: [] ...

@ydshieh
Copy link
Collaborator

ydshieh commented Mar 20, 2025

run-slow: gemma3

@github-actions
Copy link
Contributor

This comment contains run-slow, running the specified jobs: This comment contains run-slow, running the specified jobs:

models: ['models/gemma3']
quantizations: [] ...

@zucchini-nlp
Copy link
Member Author

The failing test -> 'tests/models/gemma3/test_modeling_gemma3.py::Gemma3Vision2TextModelTest::test_model_parallel_beam_search'

is not related to the PR and seems was broken always. I don't know if static cache + multi GPU ever worked with VLMs. In Gemma3 we can't turn off static cache, so will check later. Not big issue for now I hope

Will merge to unblock others :)

@zucchini-nlp zucchini-nlp merged commit 42c489f into huggingface:main Mar 20, 2025
13 of 14 checks passed
zucchini-nlp added a commit to zucchini-nlp/transformers that referenced this pull request May 14, 2025
* fix test

* require_read_token and public repo ids

* flash-attn test uncomment

* fix torchscript
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants