Minecraft_proj

Environment checked on a RHEL4.8.5 cluster.

Download conda

https://linuxize.com/post/how-to-install-anaconda-on-ubuntu-20-04/ (if not downloaded)

Install

git clone https://github.com/emilytoyber/Minecraft_proj.git

Setup environments

cd Minecraft_proj/
conda env create -f imi_env.yaml

Running

The files state_to_transition.json, pov_cluster_to_transition_with_30K_pov.ipynb and files in jsons_actions.zip are created in preprocess_data.ipynb

In order to run the training part of the models, activate the relevant environment and cd to the respective directory (agents/MineRL2020/ for imi_env), then run python train.py for imitation, run agents/basic_BC.ipynb for behavioral cloning algorithm.

For testing, running the respective colab notebook is needed (colab is needed because of the virtual frame buffer that exists in google colab, other platforms with a virtual frame buffer may also be optional) after cloning the repository to colab and uploading the relevant trained model (basic_BC.ipynb includes both training and testing of BC, Imitation_test.ipynb includes the test of the Imitation agent).

Random_Agent.ipynb runs the random baseline of the environment, scripted_ironpickaxe.ipynb runs the scripted version of our algorithm.

Our main new algorithm uses a clustering model of DBSCAN+KNN trained by running POVs_clustering.ipynb (it is now limited to only 30K POVs per player in the data, because the code crashed due to not having enough resources in colab, you may drop the slicing of 30K if you have more resources).

Testing this agent is done by running choose_actions.ipynb.

Evaluation

Run comparison_environment.ipynb after uploading the respective jsons obtained from the colab notebooks.

Adding new algorithms

In order to add new algorithms to the comparison environment, one has to export a json file of execution statistics from the algorithm. Json will be constructed in the following way:

stats['runtime'].append(time() - start)
stats['reward'].append(reward_sum)
stats['reward_at'].append(rewards)

reward_sum is equal to the total reward of the episode, rewards is a list of tuples (steps, reward) where reward is the immediate (non-zero) reward of the action.

Project Explanation

Our project is a comparison project between different algorithms in Artificial Intelligence trained and tested on environments from the MineRL competitions. Some of the algorithms are submissions of different teams from the 2019-2022 competitions.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
agents		agents
model_weights/basic_BC		model_weights/basic_BC
.gitmodules		.gitmodules
README.md		README.md
avg.gif		avg.gif
comparison_environment.ipynb		comparison_environment.ipynb
imi_env.yaml		imi_env.yaml
jsons_actions.zip		jsons_actions.zip
minerl imgs.zip		minerl imgs.zip
minerl_req.txt		minerl_req.txt
old_README.md		old_README.md
pov_cluster_to_transition_with_30K_pov.json		pov_cluster_to_transition_with_30K_pov.json
pov_to_actions_counter.json		pov_to_actions_counter.json
preprocess_data.ipynb		preprocess_data.ipynb
req.txt		req.txt
state_to_transition.json		state_to_transition.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Minecraft_proj

Download conda

Install

Setup environments

Running

Evaluation

Adding new algorithms

Project Explanation

About

Releases

Packages

Contributors 4

Languages

emilytoyber/Minecraft_proj

Folders and files

Latest commit

History

Repository files navigation

Minecraft_proj

Download conda

Install

Setup environments

Running

Evaluation

Adding new algorithms

Project Explanation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages