Multi-view Chemical Variational Autoencoder

Installation

A conda environment is necessary for installation. To create the enviroment, run the following command:

conda env create -f environment.yml
source activate multiview_chemvae

Jupyter notebook is also required to run the *.ipynb examples in the experiments directory.

To create the datasets for training, run

python make_char_dataset.py

python make_grammar_dataset.py

python make_features_dataset.py

The datasets will be named char_dataset.h5, grammar_dataset.h5, and features_dataset.h5 respectively.

To train a model, run

python train.py  --model_type <type>

where <type> is either "Grammar" or "Character".

Additional flags include:

For example:

python train.py  --model_type=Grammar --two_tower --latent_dim=128 --epochs=100

The experiments directory contains the following Jupyter notebooks for evaluating the performance of the models:

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
datasets		datasets
experiments		experiments
models		models
weights		weights
250k_rndm_zinc_drugs_clean.smi		250k_rndm_zinc_drugs_clean.smi
README.md		README.md
environment.yml		environment.yml
make_char_dataset.py		make_char_dataset.py
make_features_dataset.py		make_features_dataset.py
make_grammar_dataset.py		make_grammar_dataset.py
molecule_vae.py		molecule_vae.py
prediction_utils.py		prediction_utils.py
train.py		train.py
zinc_grammar.py		zinc_grammar.py