0.20.6
Server deployment
Memory optimization
This release reduces peak server memory usage. Previously, memory grew with the total number of instances ever submitted; this is now fixed. We recommend upgrading if memory usage increases over time.
Logs storage
Fluent Bit + Elasticsearch/OpenSearch
Run logs can now be stored in your own log storage via Fluent Bit. At the same time, dstack can now read run logs from Elasticsearch/OpenSearch (to display in the UI and CLI), if Fluent Bit ships the logs there.
See the docs for more details.
Fleets
Since 0.20, dstack requires at least one fleet to be created before you can submit any runs. To make this easier, we’ve simplified default fleet creation during project setup in the UI:
In addition, if your project doesn’t have a fleet, the UI will prompt you to create one.
What's Changed
- Hotfix. Fixed generation fleet fields in project forms by @olgenn in #3486
- Add missing Box imports by @r4victor in #3485
- Use the same metrics endpoint label for 404 requests by @r4victor in #3455
- Refactoring Inspect page by @olgenn in #3457
- Migrate from Slurm by @peterschmidt85 in #3454
- [Internal]: Handle GitHub API errors in
release_notes.pyby @jvstme in #3463 - Display
InstanceAvailability.NO_BALANCEin CLI by @jvstme in #3460 - Do not return
NO_BALANCEto older clients by @jvstme in #3462 - Optimize job submissions loading by @r4victor in #3466
- [CLI] Add
--memoryoption toapplyandofferby @un-def in #3461 - [runner] Rework and fix user processing by @un-def in #3456
- Optimize fleet instances db queries by @r4victor in #3467
- Kubernetes: adjust offer GPU count by @un-def in #3469
- Add missing job status change event for scaling by @jvstme in #3465
- Fix
find_optimal_fleet_with_offerslog message by @un-def in #3470 - Fix missing instance lock in delete_fleets by @r4victor in #3471
- Optimize list and get fleets by @r4victor in #3472
- feat(logging): add fluent-bit log shipping by @DragonStuff in #3431
- Adjust fluent-bit logging integration by @r4victor in #3478
- Emit events for instance status changes by @jvstme in #3477
- [runner] Restore
--home-diroption as no-op by @un-def in #3480 - [UI] Default fleet in project wizard by @olgenn in #3464
- Support shared AWS compute caches by @r4victor in #3483
- [UI] Minor re-order in the sidebar by @peterschmidt85 in #3484
Full changelog: 0.20.3...0.20.6