Skip to content

Commit

Permalink
Clarify the chance_recurrent_fn arguments.
Browse files Browse the repository at this point in the history
Fixes #60.

PiperOrigin-RevId: 555288953
  • Loading branch information
fidlej authored and MctxDev committed Aug 15, 2023
1 parent bfb7316 commit d38b186
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion mctx/_src/policies.py
Original file line number Diff line number Diff line change
Expand Up @@ -268,7 +268,7 @@ def stochastic_muzero_policy(
`(DecisionRecurrentFnOutput, afterstate_embedding)`.
chance_recurrent_fn: a callable to be called on the leaf chance nodes and
unvisited actions retrieved by the simulation step, which takes as args
`(params, rng_key, action, afterstate_embedding)` and returns a
`(params, rng_key, chance_outcome, afterstate_embedding)` and returns a
`(ChanceRecurrentFnOutput, state_embedding)`.
num_simulations: the number of simulations.
num_actions: number of environment actions.
Expand Down

0 comments on commit d38b186

Please sign in to comment.