Windows

Introduction

This page describes steps to get Reaver working on a fresh Windows 10 install, no additional software necessary.

Note that the main difference between Linux and Windows builds is the headless functionality of the StarCraft II Linux client, meaning that the full game can be launched without any graphical interface. This significantly speeds up the game, especially in parallelized case.

Installing

To install Reaver on Windows please follow the steps as listed, one by one:

ensure you have latest StarCraft II and NVIDIA drivers installed
install Anaconda (not a necessity, but will make steps below easier)
- check Add Anaconda to my PATH environment variable when prompted
open command prompt as Administrator (press Windows -> type cmd -> right click -> Run as Administrator)
create a new environment for Reaver: conda create -n reaver-env python=3.6
activate the environment: activate reaver-env
install TensorFlow + CUDA toolkit: conda install tensorflow-gpu
install PySC2 from source: pip install https://github.com/deepmind/pysc2/archive/master.zip
install Reaver: pip install reaver

Running

open command prompt as Administrator (press Windows -> type cmd -> right click -> Run as Administrator)
navigate to base folder for your experiments (e.g. cd C:\Users\inoryy)
activate the environment: activate reaver-env
run reaver: python -m reaver.run --env MoveToBeacon --envs 4

After the last command you should see four StarCraft II games open and see a train log for the first update:

| T      2 | Fr       512 | Ep      0 | Up      1 | RMe    0.00 | RSd    0.00 | RMa    0.00 | RMi    0.00 | Pl   -0.086 | Vl    0.043 | El 0.0156 | Gr   84.766 | Fps   256 |

Subsequent train logs will appear based on the log_freq flag configuration, which is 100 by default. To see logging information on every update, run:

python -m reaver.run --env MoveToBeacon --envs 4 --log_freq 1

During training you should see your agent experiment with various commands - left/right clicking, queuing commands and so on. It might also seem that your agent is doing nothing - that is also fine, unless it lasts more than 200-300 updates. In these cases what is most likely happening is that the agent is trying out less noticeable commands (e.g. control group assignments, using Stop, Halt, and so on).

With four games running in parallel you should see your agent converge to baseline results (RMe = 25-26) in about 30 minutes, depending on the hardware.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Windows

Introduction

Installing

Running

Clone this wiki locally