-
Notifications
You must be signed in to change notification settings - Fork 27.8k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add
Qwen2VLImageProcessorFast
into Qwen2VLProcessor
#35987
opened Jan 31, 2025 by
yeliudev
Loading…
1 of 5 tasks
Display warning for unknown quants config instead of an error
#35963
opened Jan 29, 2025 by
SunMarc
Loading…
Add support for partial rotary embeddings in Phi3 model
#35947
opened Jan 28, 2025 by
garg-amit
Loading…
1 of 5 tasks
Fix how we compute the final non-padding token for ForSequenceClassification models
#35911
opened Jan 27, 2025 by
Rocketknight1
Loading…
Fix Gradient Checkpointing for Deberta & Deberta-V2 using PEFT / Adapters
#35898
opened Jan 26, 2025 by
lenglaender
Loading…
1 of 5 tasks
Fix XGLM loss computation (PyTorch and TensorFlow)
#35878
opened Jan 24, 2025 by
damianoamatruda
Loading…
[docs] no hard coding cuda as bnb has multi-backend support
#35867
opened Jan 24, 2025 by
faaany
Loading…
Fix device mismatch error in Whisper model during feature extraction
#35866
opened Jan 24, 2025 by
thedebugger
Loading…
Update doc re list of models supporting TP
#35864
opened Jan 23, 2025 by
kwen2501
Loading…
1 task done
Fix PaliGemma Pad Token Masking During Training #35855
#35859
opened Jan 23, 2025 by
sambhavnoobcoder
Loading…
Optimize Qwen2VL vision model by precomputing cos/sin embeds before ViT blocks
Multimodal
optimization
#35837
opened Jan 22, 2025 by
li-plus
Loading…
1 of 5 tasks
Fix multi gpu loss sync condition, add doc and test
#35743
opened Jan 17, 2025 by
techkang
Loading…
2 of 5 tasks
Make
output_dir
Optional in TrainingArguments
#27866
#35735
opened Jan 16, 2025 by
sambhavnoobcoder
Loading…
tests: revert change of torch_require_multi_gpu to be device agnostic
#35721
opened Jan 16, 2025 by
dvrogozh
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-01-30.