Skip to content

Neuromatch group project: Relevance of sensory modalities for spatial navigation in foraging behaviours of the bee

Notifications You must be signed in to change notification settings

dilayercelik/rl-bee-multimodal-sensing

 
 

Repository files navigation

rl-bee-multimodal-sensing

Reinforcement Learning Model Training and Prediction for Neuromatch group project: Relevance of sensory modalities for spatial navigation in foraging behaviours of the bee

This repository contains Python scripts and utilities for training a custom Reinforcement Learning (RL) model using the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm. The trained model can be used for predicting actions in a custom Gym environment.

Files

The repository contains the following files:

  1. bee.py: Environment (gym) setup - agents action, rewards and space definition for the RL model.
  2. train_model.py: The main script to train the RL model.
  3. render_model.py: A script to generate and display a video of the RL model's predictions.
  4. config.yaml: The configuration file for training the RL model.
  5. model.py: A Python module containing utility functions for initializing and loading the RL model.
  6. utils.py: A Python module containing utility functions for creating directories, saving the configuration, and more.
  7. gym_run.py: Demonstration file for testing gym changes

Usage

  1. Configure the config.yaml file with the desired parameters for training the RL model.
  2. Run the training script:

python train_model.py --config_path config.yaml

  1. After training, you can generate a video of the model's predictions using the render_model.py script:

python render_model.py --config_path config.yaml

  1. You can also use the convinience jupyter-notebooks, useful for working in google colab.

WIP:

Implemented features:

  1. logging, early stopping, model managment
  2. model re-training
  3. walls/obstacles

TODO model wise:

  1. Testing different architectures
  2. Make goal smaller
  3. Reward testing
    1. Adding senses
    2. Time passed punishment
    3. wall hitting punishment
  4. Increasing/gradually shrinking the goal size whilst training
  5. multiple goals

References

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

Neuromatch group project: Relevance of sensory modalities for spatial navigation in foraging behaviours of the bee

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 69.3%
  • Jupyter Notebook 29.9%
  • Shell 0.8%