Open
Description
The term "trajectory" and "episode" conflict. The term "rollout" also appears from time to time in the code and should be clarified or changed too.
See this thread: a trajectory could be used for a possibly incomplete episode, which should have an initial and terminal state.
Rollout should maybe be kept for MCTS usage.