-
Notifications
You must be signed in to change notification settings - Fork 354
Insights: pytorch/rl
Overview
-
0 Active issues
-
- 10 Merged pull requests
- 1 Open pull request
- 0 Closed issues
- 0 New issues
Could not load contribution data
Please try again later
10 Pull requests merged by 2 people
-
[Feature] LLM collector
#2879 merged
Apr 4, 2025 -
[Feature] History API
#2890 merged
Apr 4, 2025 -
[BugFix] Fix compile compatibility of PPO losses
#2889 merged
Apr 3, 2025 -
[Feature] Pass lists of policy_factory
#2888 merged
Apr 3, 2025 -
[Refactor] Fix repeats order
#2887 merged
Apr 3, 2025 -
[Test] Fix warnings in tests
#2886 merged
Apr 3, 2025 -
[BugFix] Fix .item() warning on tensors that require grad
#2885 merged
Apr 3, 2025 -
[Feature] Support lazy tensordict inputs in ppo loss
#2883 merged
Apr 2, 2025 -
[Refactor] MaskedCategorical cross_entropy usage for faster loss
#2882 merged
Apr 2, 2025 -
[Refactor] Avoid padding in transformer wrapper
#2881 merged
Apr 2, 2025
1 Pull request opened by 1 person
-
[Feature] Support lazy tensordict inputs in KL reward transform
#2884 opened
Apr 2, 2025
1 Unresolved conversation
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
v0 param server (using collectives not object store)
#2865 commented on
Mar 28, 2025 • 1 new comment