Devanik21 / Hexapawn-RL Star 0 Code Issues Pull requests Hexapawn Game Engine Proper 3×3 board with pawn movement Strategic RL Agents Minimax with Alpha-Beta Pruning (depth configurable 1-7) Q-Learning with temporal difference updates Experience replay for efficient learning Epsilon-greedy exploration with decay Multi-level decision hierarchy (immediate threats → strategic planning) reinforcement-learning q-learning epsilon-greedy game-theory deep-rl temporal-difference multi-agent-rl minimax-alpha-beta hexapawn-game Updated Dec 14, 2025 Python