Pulse · huggingface/accelerate

August 16, 2025 – August 23, 2025

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

Deferred initialization support
#3666 commented on Aug 17, 2025 • 0 new comments
Transformer Engine memory-efficient initialization to convert_model for large models
#3652 commented on Aug 17, 2025 • 0 new comments
dtype issue when using accelerate
#3685 commented on Aug 18, 2025 • 0 new comments
Can devices be specified in distributed training
#3726 commented on Aug 20, 2025 • 0 new comments
CUDA Out of Memory using 2 GPUs
#3735 commented on Aug 21, 2025 • 0 new comments
Advice with GPU computations in collator/pre-process
#3691 commented on Aug 21, 2025 • 0 new comments
[DeepSpeed] GPU VRAM usage increases each validation step
#3690 commented on Aug 21, 2025 • 0 new comments
What will happened if I use mix_precision but don't add with accelerator.autocast():
#3689 commented on Aug 21, 2025 • 0 new comments
Multi-XPU launch error
#3664 commented on Aug 21, 2025 • 0 new comments
logger message "dataset had no length" confusing when `drop_last=True`
#3693 commented on Aug 22, 2025 • 0 new comments
bugs in accelerator.unwrap_model
#3683 commented on Aug 22, 2025 • 0 new comments
[WIP] optimize infer_auto_device_map for multi-GPU allocation
#3321 commented on Aug 22, 2025 • 0 new comments
Unwrap before saving/loading sharded model
#3733 commented on Aug 22, 2025 • 0 new comments