Flagged-Maze-Reinforcement-Learning

This project is presented in Jupyter Notebook format, providing visibility into the implementation of class definitions and algorithm training. You can observe the program's execution by using the rl.play(True, True) command within the pygame environment. This command executes the program, irrespective of the epsilon capability, facilitating a more focused search.

Model State Determination and Reduction

The number of model states is contingent upon the environment's size. We achieve state reduction by equating certain positions, streamlining the model's complexity.

Concepts and Components

States: These correspond to the agent's positions within the environment. Actions: Define agent movements, encompassing "up," "down," "left," and "right." Rewards: Define the system of penalties and incentives governing agent behavior. Goal State: Identified as "T," this marks the endpoint the agent must reach.

Learning Rate (α) Impact

The learning rate (α) significantly influences the algorithm's performance:

It affects the speed of convergence and oscillation. It strikes a balance between exploration and exploitation. It plays a pivotal role in stabilization and solution accuracy.

Discount Factor (γ) Impact

The discount factor (γ) holds a crucial role in reinforcement learning:

It delineates the importance of long-term versus short-term rewards. It guides the pursuit of optimal policies and underscores the significance of achieving the goal. It influences the convergence rate and temporal consistency of the learning process.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
FreeSansBold.ttf		FreeSansBold.ttf
HassanArdeshir.ipynb		HassanArdeshir.ipynb
HassanArdeshir.pdf		HassanArdeshir.pdf
README.md		README.md
envc.py		envc.py
maze.py		maze.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Flagged-Maze-Reinforcement-Learning

Model State Determination and Reduction

Concepts and Components

Learning Rate (α) Impact

Discount Factor (γ) Impact

Result

About

Languages

proshir/Flagged-Maze-Reinforcement-Learning-Implementation

Folders and files

Latest commit

History

Repository files navigation

Flagged-Maze-Reinforcement-Learning

Model State Determination and Reduction

Concepts and Components

Learning Rate (α) Impact

Discount Factor (γ) Impact

Result

About

Resources

Stars

Watchers

Forks

Languages