Behavioral Cloning (BC) / pretraining with recurrent=true

**Describe the bug**
I get the following error message during behavioral cloning or pretraining with recurrent=true.

**To Reproduce**
Steps to reproduce the behavior:
1. Setup new conda environment with python=3.7
2. Pull and install ml-agents: ```pip install .```
3. Set recurrent=true in config/offline_bc_config.yaml
4. Run ```mlagents-learn config/offline_bc_config.yaml --run-id=pyramids-update-rnn-test --train```
5. Press play in the Basic Unity scene from examples.
6. Observe the error within a few seconds.

**Console logs / stack traces**
```
$ mlagents-learn config/offline_bc_config.yaml --run-id=rnn-bc-test --train
Traceback (most recent call last):
  File "/Users/njustesen/anaconda3/envs/ml-test/bin/mlagents-learn", line 8, in <module>
    sys.exit(main())
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/mlagents/trainers/learn.py", line 408, in main
    run_training(0, run_seed, options, Queue())
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/mlagents/trainers/learn.py", line 253, in run_training
    tc.start_learning(env)
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/mlagents/trainers/trainer_controller.py", line 209, in start_learning
    n_steps = self.advance(env_manager)
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/mlagents/envs/timers.py", line 263, in wrapped
    return func(*args, **kwargs)
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/mlagents/trainers/trainer_controller.py", line 297, in advance
    trainer.update_policy()
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/mlagents/trainers/bc/trainer.py", line 138, in update_policy
    run_out = self.policy.update(mini_batch, self.n_sequences)
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/mlagents/trainers/bc/policy.py", line 95, in update
    run_out = self._execute_model(feed_dict, self.update_dict)
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/mlagents/trainers/tf_policy.py", line 151, in _execute_model
    network_out = self.sess.run(list(out_dict.values()), feed_dict=feed_dict)
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/tensorflow_core/python/client/session.py", line 956, in run
    run_metadata_ptr)
  File "/Users/njustesen/anaconda3/envs/ml-test/lib/python3.7/site-packages/tensorflow_core/python/client/session.py", line 1156, in _run
    (np_val.shape, subfeed_t.name, str(subfeed_t.get_shape())))
ValueError: Cannot feed value of shape (0,) for Tensor 'teacher_action:0', which has shape '(?, 1)'
```
**Environment (please complete the following information):**
- macOS Mojave 10.14.6
- _ML-Agents version_: Main branch commit 8a08bc4306dfa897b780afc79ee2bce47c76d47f


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Behavioral Cloning (BC) / pretraining with recurrent=true #2962

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Behavioral Cloning (BC) / pretraining with recurrent=true #2962

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions