You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A Q Learning algorithm where the agent learns to go from 1x1 square to the goal without touching the red obstacle squares in a nxn grid.
It perfects its learning on around 140-150 generations.