Allow for model kwargs when loading transformers from pretrained #754

NathanHB · 2025-05-20T14:13:46Z

No description provided.

HuggingFaceDocBuilderDev · 2025-05-20T14:16:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copilot

Pull Request Overview

This PR enhances the TransformersModelConfig to accept arbitrary keyword arguments when loading a pretrained Hugging Face model, removes the hardcoded generation_size setting, and updates the internal loader to pass through these custom kwargs.

Introduce model_loading_kwargs in the config
Remove the generation_size field
Update _create_auto_model to merge and pass model_loading_kwargs instead of a local kwargs dict

Comments suppressed due to low confidence (1)

src/lighteval/models/transformers/transformers_model.py:139

Removing the generation_size field is a breaking change for users. Consider deprecating it first or updating documentation and release notes to guide consumers through the migration.

generation_size: PositiveInt = 256

src/lighteval/models/transformers/transformers_model.py

Copilot · 2025-05-20T14:16:02Z

src/lighteval/models/transformers/transformers_model.py

        if "quantization_config" not in pretrained_config.to_dict():
-            kwargs["quantization_config"] = quantization_config
+            self.config.model_loading_kwargs["quantization_config"] = quantization_config


Mutating config.model_loading_kwargs in place can lead to unexpected state if _create_auto_model is called multiple times. Consider merging into a local dict and passing that to from_pretrained.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull Request Overview

This PR introduces support for passing custom keyword arguments when loading pretrained transformer models, enabling more flexible configuration of model loading. It also replaces the fixed "generation_size" parameter with a more general "model_loading_kwargs" field.

Removed the fixed generation_size parameter.
Added a new model_loading_kwargs field to the configuration.
Updated the auto model creation to copy the provided kwargs.

Comments suppressed due to low confidence (1)

src/lighteval/models/transformers/transformers_model.py:139

The removal of the fixed generation_size parameter may be a breaking change. If this change is intentional, please update the documentation to clarify the impact on users and ensure that all dependent functionalities are adjusted accordingly.

-    generation_size: PositiveInt = 256

src/lighteval/models/transformers/transformers_model.py

## Pull Request Overview This PR introduces support for passing custom keyword arguments when loading pretrained transformer models, enabling more flexible configuration of model loading. It also replaces the fixed "generation_size" parameter with a more general "model_loading_kwargs" field. - Removed the fixed generation_size parameter. - Added a new model_loading_kwargs field to the configuration. - Updated the auto model creation to copy the provided kwargs. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * suggestion from copilot --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

NathanHB added 2 commits May 20, 2025 14:11

add way to pass kwargs when loading transformers model

31b59c0

add way to pass kwargs when loading transformers model

a2c4dc7

NathanHB requested a review from Copilot May 20, 2025 14:13

NathanHB linked an issue May 20, 2025 that may be closed by this pull request

[FT] better support for model loading args in transformers #752

Closed

Copilot AI reviewed May 20, 2025

View reviewed changes

NathanHB and others added 2 commits May 20, 2025 16:28

Update src/lighteval/models/transformers/transformers_model.py

5eeb8fa

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

suggestion from copilot

6ff02c6

NathanHB requested a review from Copilot May 20, 2025 15:00

Copilot AI reviewed May 20, 2025

View reviewed changes

src/lighteval/models/transformers/transformers_model.py Show resolved Hide resolved

src/lighteval/models/transformers/transformers_model.py Show resolved Hide resolved

NathanHB self-assigned this May 20, 2025

NathanHB added the feature/enhancement New feature/request label May 20, 2025

NathanHB requested a review from clefourrier May 20, 2025 15:27

clefourrier approved these changes May 20, 2025

View reviewed changes

NathanHB merged commit ce1dbb5 into main May 21, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow for model kwargs when loading transformers from pretrained #754

Allow for model kwargs when loading transformers from pretrained #754

Uh oh!

NathanHB commented May 20, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 20, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI May 20, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Allow for model kwargs when loading transformers from pretrained #754

Allow for model kwargs when loading transformers from pretrained #754

Uh oh!

Conversation

NathanHB commented May 20, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 20, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Copilot AI May 20, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!