GitHub - rickgorman/gin-rummy-ai: A hand-rolled neuroevolution AI for gin rummy.

Gin Rummy AI

This is an exploration into applying neuralevolution towards building an AI for the greatest card game of all time, Gin Rummy.

The simulation is a python-based player-free Gin Rummy game. The strategy is evolved via neuralevolution. Test code and a sample playground.py is provided, along with a ~40000 generation persistence file.

Getting Started

To run this code, you'll need to install python and the following modules:

pip install pylru
pip install texttable

With those installed, open a console and run:

python playground.py

This will launch the algorithm. You may want to observe the output -- this can be done in a second console window like so:

tail -f debug.log.txt

You can kill the script with Ctrl-C, which will trigger persistence handling and save the current generation to disk to be loaded again at next startup. As coded, it will then run a couple of games with the two best strategies and display the turn-by-turn output.

Additional per-generation fitness history is logged to the file:

playground_check_intelligence.persist.txt.tally

Format is: generation,%skill win rate,score

You can download and use http://www.live-graph.org/ to watch the fitness output in real-time.

How it works

The majority of work is done in the GinMatch class, called as part of the fitness test. This class pits two players against each other in a "match" of gin rummy. Technically, a match is a number of games played until one player has 100 points, at which point the match is over and final scoring occurs. As of this writing, the population has not evolved sufficiently to play a full match, and so presently a match consists of a single game.

Ranking Function and Output

As of this writing, the ranking function looks at the percentage of games won WITHOUT a coinflip.

In the case that a deck runs out or the players take too many moves, we use a coinflip to end the game. Games ending in coinflips are not useful for our long-term goal of developing a strong AI, so we don't include them in the ranking function.

The output for each generation looks something like this:

                 LEADERBOARD FOR GENERATION #47182  (population: 16
+-----------------------------------------------------------------------------------------------------------+
|  ranking      skill       score       skill     coinflip      game        match       match        age    |
|             game win                game wins   game wins    losses       wins       losses               |
|             rate (%)                                                                                      |
+===========================================================================================================+
| 1           0.357       0.333       5           1           9           6           9           0         |
| 2           0.556       0.333       5           6           4           11          4           0         |
| 3           0.385       0.333       5           2           8           7           8           0         |
| ...                                                                                                       |
+-----------------------------------------------------------------------------------------------------------+

Columns are explained as:

ranking
skill game win rate (%): this is how often a strategy wins WITHOUT a coinflip
score: basically skill game win rate averaged by age
skill game wins: number of games won without a coinflip
game losses
match wins: currently matches are 1 game long, so this is mostly filler for later
match losses: same
age (in generations)

Observables

A pair of observer pattern decorators @notify_observers_before and @notify_observers_after are used to keep things DRY and efficient. This pattern allows a class to keep track of properties that will be exposed as inputs to the neural networks via an organize_data() method, and for observers to be notified of changes only when necessary.

Todo

When a player knocks falsely, his hand should be exposed to the other player.
The cull() function kills all individuals except the ones we're mating for the next generation. It should instead retain the top N individuals.
Multithreading (4-8x speedup potential)
Smarter initial weights (100-1000x speedup potential)
Let the InputPerceptrons pull data from Observables, rather than Observables pushing data to Observers on each change (5% speedup potential)
Faster key generation for memoized() (5-10% speedup)

License

All content licensed under Creative Commons Attribution-ShareAlike 4.0 International.

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.idea		.idea
README.md		README.md
deck.py		deck.py
genetic_algorithm.py		genetic_algorithm.py
gindeck.py		gindeck.py
ginhand.py		ginhand.py
ginmatch.py		ginmatch.py
ginplayer.py		ginplayer.py
ginstrategy.py		ginstrategy.py
gintable.py		gintable.py
neuralnet.py		neuralnet.py
notes.txt		notes.txt
observer.py		observer.py
playground.py		playground.py
playground_check_intelligence.persist.txt		playground_check_intelligence.persist.txt
test_deck.py		test_deck.py
test_genetic_algorithm.py		test_genetic_algorithm.py
test_gin.py		test_gin.py
test_gindeck.py		test_gindeck.py
test_ginhand.py		test_ginhand.py
test_ginmatch.py		test_ginmatch.py
test_ginplayer.py		test_ginplayer.py
test_ginstrategy.py		test_ginstrategy.py
test_gintable.py		test_gintable.py
test_helpers.py		test_helpers.py
test_neuralnet.py		test_neuralnet.py
test_observer.py		test_observer.py
utility.py		utility.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gin Rummy AI

Getting Started

How it works

Ranking Function and Output

Observables

Todo

License

About

Releases

Packages

Languages

rickgorman/gin-rummy-ai

Folders and files

Latest commit

History

Repository files navigation

Gin Rummy AI

Getting Started

How it works

Ranking Function and Output

Observables

Todo

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages