[RLlib; DreamerV3] How to evaluate the model during training. #48533
Labels
docs
An issue or change related to documentation
P2
Important issue, but not time-critical
rllib
RLlib related issues
rllib-algorithms
An RLlib algorithm/Trainer is not learning.
rllib-evaluation
Bug affecting policy evaluation with RLlib.
I can obtain episode reward mean from the train result, but the fluctuation is very large, and it is difficult to judge when to stop the training iteration, so I hope to use the result of evaluate.
I tried two methods, but both failed (ray=2.38.0).
Method 1 uses evaluation config.
** error message (Same as #47527?) **
** code **
Method 2 uses evaluate function.
** error message **
** code **
The text was updated successfully, but these errors were encountered: