-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Closed
Labels
P2Important issue, but not time-criticalImportant issue, but not time-criticalbugSomething that is supposed to be working; but isn'tSomething that is supposed to be working; but isn'ttriageNeeds triage (eg: priority, bug/not-bug, and owning component)Needs triage (eg: priority, bug/not-bug, and owning component)
Description
What is the problem?
The output fo the DQN model is not within the action space.
Something is wrong when constructing the torch model when dueling is off. The output dimension of the model is equal to whatever is passed in "fcnet_hiddens" instead of being of the size of the action space.
Ray version and other system information (Python version, TensorFlow version, OS):
- ray==0.9.0.dev0
- python 3.6.10
- mac OS
Reproduction (REQUIRED)
import ray
from ray import tune
ray.init()
config = {
"env": "CartPole-v1",
"num_workers": 1,
"train_batch_size": 128,
"learning_starts": 128,
"model": {"fcnet_hiddens": [32]},
"dueling": False ,
"framework": "torch"
}
tune.run("DQN", name="MWE", config=config, stop={"training_iteration": 100})
- I have verified my script runs in a clean environment and reproduces the issue.
- I have verified the issue also occurs with the latest wheels.
Metadata
Metadata
Assignees
Labels
P2Important issue, but not time-criticalImportant issue, but not time-criticalbugSomething that is supposed to be working; but isn'tSomething that is supposed to be working; but isn'ttriageNeeds triage (eg: priority, bug/not-bug, and owning component)Needs triage (eg: priority, bug/not-bug, and owning component)