REINVENT

Molecular De Novo design using Recurrent Neural Networks and Reinforcement Learning

Searching chemical space as described in:

Molecular De Novo Design through Deep Reinforcement Learning

Notes

The current version is a PyTorch implementation that differs in several ways from the original implementation described in the paper. This version works better in most situations and is better documented, but for the purpose of reproducing results from the paper refer to Release v1.0.1

Differences from implmentation in the paper:

Written in PyTorch/Python3.6 rather than TF/Python2.7
SMILES are encoded with token index rather than as a onehot of the index. An embedding matrix is then used to transform the token index to a feature vector.
Scores are in the range (0,1).
A regularizer that penalizes high values of total episodic likelihood is included.
Sequences are only considered once, ie if the same sequence is generated twice in a batch only the first instance contributes to the loss.
These changes makes the algorithm more robust towards local minima, means much higher values of sigma can be used if needed.

Requirements

This package requires:

Python 3.6
PyTorch 0.1.12
RDkit
Scikit-Learn (for QSAR scoring function)
tqdm (for training Prior)
pexpect

Usage

To train a Prior starting with a SMILES file called mols.smi:

First filter the SMILES and construct a vocabulary from the remaining sequences. ./data_structs.py mols.smi - Will generate data/mols_filtered.smi and data/Voc. A filtered file containing around 1.1 million SMILES and the corresponding Voc is contained in "data".
Then use ./train_prior.py to train the Prior. A pretrained Prior is included.

To train an Agent using our Prior, use the main.py script. For example:

./main.py --scoring-function activity_model --num-steps 1000

Training can be visualized using the Vizard bokeh app. The vizard_logger.py is used to log information (by default to data/logs) such as structures generated, average score, and network weights.

cd Vizard
./run.sh ../data/logs
Open the browser at http://localhost:5006/Vizard

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
Vizard		Vizard
__pycache__		__pycache__
bash_scripts		bash_scripts
data		data
images		images
.DS_Store		.DS_Store
Import QM9.ipynb		Import QM9.ipynb
LICENSE		LICENSE
Plot Results.ipynb		Plot Results.ipynb
Plot_rnn_losses.ipynb		Plot_rnn_losses.ipynb
README.md		README.md
data_structs.py		data_structs.py
debug_data_structs.ipynb		debug_data_structs.ipynb
debug_data_structs.py		debug_data_structs.py
diagnostics.py		diagnostics.py
environment.yml		environment.yml
export		export
generate_smiles.py		generate_smiles.py
inspect_RNN_output.ipynb		inspect_RNN_output.ipynb
levy_test.ipynb		levy_test.ipynb
main.py		main.py
model.py		model.py
molecule.charges		molecule.charges
molecule.energy		molecule.energy
molecule.gradient		molecule.gradient
molecule.molecule.engrad		molecule.molecule.engrad
molecule.out		molecule.out
molecule.wbo		molecule.wbo
molecule.xtbrestart		molecule.xtbrestart
molecule.xtbtopo.mol		molecule.xtbtopo.mol
molecule.xyz		molecule.xyz
mols.smi		mols.smi
multiprocess.py		multiprocess.py
plot_results.ipynb		plot_results.ipynb
qm9_all_bandgaps.npy		qm9_all_bandgaps.npy
reinvent_bandgap_cpu.sh		reinvent_bandgap_cpu.sh
reinvent_og_cpu.sh		reinvent_og_cpu.sh
scoring_functions.py		scoring_functions.py
smiles_to_bandgap.py		smiles_to_bandgap.py
test_bandgap_computation.py		test_bandgap_computation.py
train_agent.py		train_agent.py
train_agent.pyc		train_agent.pyc
train_prior.py		train_prior.py
ulimit		ulimit
utils.py		utils.py
vizard_logger.py		vizard_logger.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

REINVENT

Molecular De Novo design using Recurrent Neural Networks and Reinforcement Learning

Notes

Requirements

Usage

About

Releases

Packages

Languages

License

ankur56/REINVENT

Folders and files

Latest commit

History

Repository files navigation

REINVENT

Molecular De Novo design using Recurrent Neural Networks and Reinforcement Learning

Notes

Requirements

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages