FEAT: add HF tags for models that have been trained with llama-factory #2474

younesbelkada · 2024-02-13T06:08:49Z

Great work on the Llama-Factory library ! 🚀
I would like to propose a new feature request: model tagging

With transformers 4.37.0 we added model tagging support by automatically tagging models on the Hub. That way you could sort models that have a specific tag, e.g. it you want to filter models that have the tag trl you could use think link: https://huggingface.co/models?other=trl / so for llama-factory: https://huggingface.co/models?other=llama-factory

I see it is already done in the get_modelcard_args method but add_model_tags enables you to push that tag even if you don't call trainer.push_to_hub and simply calling model.push_to_hub() will automatically push the model + the llama-factory tag

Let me know if this PR makes sense, otherwise we can also close it as technically the tags are already pushed, this PR would just cover the case where users use your trainer without pushing the trainer itself but the model.

hiyouga · 2024-02-13T14:09:54Z

Hi @younesbelkada ! Thanks for adding this patch, we are willing to utilize the new feature in transformers 4.37.0.

However, we found the implementation of push_to_hub in the Hugging Face's Trainer a bit strange: [1]

"model_tags" would be only appended when the users pass the "tags" argument to push_to_hub.
It would result in duplicate tags if there was an overlap between "model_tags" and "tags".

I am curious about the reason behind this design. Looking forward to hearing back from you.

[1] https://github.com/huggingface/transformers/blob/v4.37.2/src/transformers/trainer.py#L3737-L3747

younesbelkada · 2024-02-14T01:28:59Z

Hi @hiyouga

Thanks!
I had a look at your comment, I think that I did a mistake when designing that logic, indeed we should always push the tag if model.add_model_tags() is called. I made huggingface/transformers#29009 which should solve this unintended behaviour
Regarding your second point, there shouldn't be any duplicated tag I think, the block:

                if model_tag not in kwargs["tags"]:
                    kwargs["tags"].append(model_tag)

Circumvents that + even if multiple duplicated tags exists on the model card (which is less likely to happen give that guard + https://github.com/huggingface/transformers/blob/main/src/transformers/utils/hub.py#L1142 ) on the frontend they'll be always displayed a as a single tag. See: https://huggingface.co/ybelkada/test-bert-tags/commit/631bcabcc22cf313dd63441a85b122317fce6680

Let me know what do you think !

hiyouga · 2024-02-14T02:25:42Z

@younesbelkada Thanks! I think the above modification is fine, and this PR could now be safely merged.

add v1 hf tags

0ca0f08

hiyouga self-requested a review February 13, 2024 14:20

younesbelkada mentioned this pull request Feb 14, 2024

FIX [Trainer / tags]: Fix trainer + tags when users do not pass "tags" to trainer.push_to_hub() huggingface/transformers#29009

Merged

hiyouga approved these changes Feb 14, 2024

View reviewed changes

hiyouga merged commit 8a1b389 into hiyouga:main Feb 14, 2024
1 check passed

younesbelkada deleted the add-hf-tags branch February 14, 2024 02:33

hiyouga added the solved This problem has been already solved label Feb 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: add HF tags for models that have been trained with llama-factory #2474

FEAT: add HF tags for models that have been trained with llama-factory #2474

younesbelkada commented Feb 13, 2024

hiyouga commented Feb 13, 2024 •

edited

Loading

younesbelkada commented Feb 14, 2024

hiyouga commented Feb 14, 2024

FEAT: add HF tags for models that have been trained with llama-factory #2474

FEAT: add HF tags for models that have been trained with llama-factory #2474

Conversation

younesbelkada commented Feb 13, 2024

hiyouga commented Feb 13, 2024 • edited Loading

younesbelkada commented Feb 14, 2024

hiyouga commented Feb 14, 2024

hiyouga commented Feb 13, 2024 •

edited

Loading