Representation Learning (RLAI)

The given code is the implementation of the paper Generate and Test by Mahmood and Sutton (2013). This provides two methods of learning representations one called the fixed representation the other one that updates the features according to utility. The code uses Second Tester in order to determine utility (See paper.)

Requirements

Python 3.x
Pytorch 1.7+
Numpy
Matplotlib
tqdm

Usage

Config file:

The file config.json contains the parameters associated with the run. This can be modified for different runs. See running on server section for parrallel runs.

Running locally:

Use python script directly.

Fixed Representation:

python learner_original.py

Other flags can be seen by:

python learner_original.py -h

Using search

python learner_original.py --search

Running on a server/parallel runs:

For parallel runs you need to generate temporary configuration files by editing master_config.json and adding parameters of your choice then use:

python generate_config.py

This will create a temporary directory then with config files corresponding to the runs. Use --cfg flag to locate them. And an example script for slurm job loader is given as run.sh. Don't forget to use --store-losses flag with parallel runs.

The losses are saved as pickle files(for each run) and results can be visualised as follows.

For fixed representations:

python plot_graph.py -f {size of features seperated by space} -s {Seed array}

For search use --search flag. If you need to compare fixed representation and search results use --plot_all flag. For replacement rate and step size variation use rrstep.py and rrdr.py for replacement rate and decay rate variation plots.
learner_x.py is LTU +Adam and learner_xrel.py is for other activations+ Adam.

Example result:

Here X axis represents number of examples and Y axis loss. -s is using search -f is fixed representation.

** code will be updated with modules soon.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Representation Learning (RLAI)

Requirements

Usage

Config file:

Running locally:

Running on a server/parallel runs:

Example result:

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
adam_gnt		adam_gnt
LICENSE		LICENSE
README.md		README.md
config.json		config.json
g_and_t.png		g_and_t.png
generate_config.py		generate_config.py
learner_original.py		learner_original.py
learner_x.py		learner_x.py
learner_xrel.py		learner_xrel.py
master_config.json		master_config.json
plot_graph.py		plot_graph.py
rrdr.py		rrdr.py
rrstep.py		rrstep.py
run.sh		run.sh

License

armahmood/repn-learning

Folders and files

Latest commit

History

Repository files navigation

Representation Learning (RLAI)

Requirements

Usage

Config file:

Running locally:

Running on a server/parallel runs:

Example result:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages