Multi Armed Bandit

This is the algorithm for multi-armed-bandit problem using epsilon_greedy and softmax that tries to maximize the reward given the Gaussian mean of the distributions.

Usage

Compilation/Install

git clone https://github.com/Ali92hm/multi-armed-bandit.git

Execution

The library code is under the algorithm folder. But to see how to use the algorithm you can look at the demo.py script.

python demo.py

Dependencies

Python2.7
Numpy
Matplotlib

Structure

algorithm
├── LICENSE
├── demo.py                     - Demo of the algorithm in use
└── algorithm                   - Algorithm implementation
    ├── base_algorithm.py       - Base class for the algorithms
    ├── epsilon_greedy.py       - Epsilon-greedy algorithm
    └── softmax.py              - Softmax algorithm

Potential Bugs

To do

License

MIT license

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Multi Armed Bandit

Usage

Compilation/Install

Execution

Dependencies

Structure

Potential Bugs

To do

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

Multi Armed Bandit

Usage

Compilation/Install

Execution

Dependencies

Structure

Potential Bugs

To do

License