-
Notifications
You must be signed in to change notification settings - Fork 30.5k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix the error where a keyword argument appearing before *args
#41099
opened Sep 23, 2025 by
cyyever
Loading…
Add support for kernels-ext-npu/RMSNorm acceleration on npu
#41098
opened Sep 23, 2025 by
zheliuyu
Loading…
2 of 5 tasks
Delay and probably avoid unnecessary graph breaks in _upad_input of modeling_flash_attention_utils.py
#41097
opened Sep 23, 2025 by
cyyever
Loading…
Fix attention sink implementation in flex attention
#41083
opened Sep 23, 2025 by
SamuelBarryCS
Loading…
Fix: add num_hidden_layers property to T5GemmaConfig and add test for use_cache
#41077
opened Sep 22, 2025 by
priyankabolem
Loading…
Fix Qwen3 deterministic generation when do_sample=False and num_beams=1 for Greedy Decoding
#41075
opened Sep 22, 2025 by
Flakes342
Loading…
4 of 5 tasks
[tests]
CausalLMTester
automatically infers other test classes from base_model_class
🐛 🔫
#41066
opened Sep 22, 2025 by
gante
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-09-20.