Video Generator

(This is still an unfinished model)

The main purpose of this repo is to complete my compulsory course project.

Introduction

The goal of this model is to generate video frame by frame, based on action prompts.

By prompting the model with an action and previous frames, the model will generate the future frames of the video.

At current stage, the architecture of the model will refer to GAIA-1, a generative world model for autonomous driving.

By the way, Genie also demonstrated a good performance that I expected, so I'm considering to combine both models together and do experiment on it.

Installation

Basic

Simply, run:

pip install -r requirements.txt

In addition, the environment is built with Python 3.9 and CUDA 11.8

If you failed to install dependencies in this way, please run the following command:

pip install matplotlib
pip install opencv-python opencv-contrib-python
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

MineRL

If you need to generate the Minecraft random video data, you have to install MineRL.

First of all, install jdk8:

add-apt-repository ppa:openjdk-r/ppa
apt-get update
apt-get install openjdk-8-jdk

# Verify installation
java -version # this should output "1.8.X_XXX"
# If you are still seeing a wrong Java version, you may use the following line to update it
# sudo update-alternatives --config java

Then, install MineRL:

pip install git+https://github.com/minerllabs/minerl

Train

Basically, run:

python train.py

Please read the code of train.py for more details.

Citations

@article{hu2023gaia,
  title={Gaia-1: A generative world model for autonomous driving},
  author={Hu, Anthony and Russell, Lloyd and Yeo, Hudson and Murez, Zak and Fedoseev, George and Kendall, Alex and Shotton, Jamie and Corrado, Gianluca},
  journal={arXiv preprint arXiv:2309.17080},
  year={2023}
}

@article{bruce2024genie,
  title={Genie: Generative Interactive Environments},
  author={Bruce, Jake and Dennis, Michael and Edwards, Ashley and Parker-Holder, Jack and Shi, Yuge and Hughes, Edward and Lai, Matthew and Mavalankar, Aditi and Steigerwald, Richie and Apps, Chris and others},
  journal={arXiv preprint arXiv:2402.15391},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
logs		logs
networks		networks
outputs		outputs
trainers		trainers
weights		weights
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
generate_data.py		generate_data.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Video Generator

Introduction

Installation

Basic

MineRL

Train

Citations

About

Uh oh!

Releases

Packages

Languages

License

Penrose0v0/VideoGenerator

Folders and files

Latest commit

History

Repository files navigation

Video Generator

Introduction

Installation

Basic

MineRL

Train

Citations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages