Relational Graph Conditioned Variational Autoencoder for Molecule Design (RGCVAE)

This repository contains the code used to generate the results reported in the paper: RGCVAE: Relational Graph Conditioned Variational Autoencoder for Molecule Design.

Dependencies

This project uses the conda environment. In the root folder you can find, for each model, the .yml file for the configuration of the conda environment and also the .txt files for the pip environment. Note that some versions of the dependencies can generate problems in the configuration of the environment. For this reason, although the setup.bash file is present for the configuration of each project, it is better to configure them manually.

Structure

The project is structured as follows:

data: contains the code to execute to make the dataset;
results: contains the checkpoints and the results;
model: contains the code about the model;
utils: contains all the utility code.

Usage

Data Download

First you need to download the necessary files and configuring the environment by running the following commands:

sh setup.bash install
conda activate rgcvae

Data Pre-processing

In order to make de datasets type the following commands:

cd data
python make_dataset.py --dataset [dataset]

Where dataset can be:

qm9
qm9_long2
zinc
zinc_long2

Model Training

In order to train the model use:

python RGCVAE.py --dataset [dataset] --config '{"generation":0, "log_dir":"./results", "use_mask":false}'

Model Test

In order to generate new molecules:

python RGCVAE.py --dataset [dataset] --restore results/[checkpoint].pickle --config '{"generation":1, "log_dir":"./results"}'

While, in order to reconstruct the molecules:

python RGCVAE.py --dataset [dataset] --restore results/[checkpoint].pickle --config '{"generation":2, "log_dir":"./results"}'

In order to analyze the results, we used the following environment: ComparisonsDGM.

Optimization

In order to optimize a molecule use the following command:

python RGCVAE.py --dataset zinc_long2 --restore results/[checkpoint].pickle --config '{"generation":1, "use_mask":false, "suffix":"opt", "optimization_step": 20, "number_of_generation":100, "prior_learning_rate":0.3, "use_argmax_nodes":true, "use_argmax_bonds":true}'

Pre-processed datasets, Pre-trained Models and Results

Soon we will public the pre-processed datasets, pre-trained models and generated molecules.

Information

NOTE: Some functions are extracted from the following source code.

Licenze

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
data		data
model		model
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
RGCVAE.py		RGCVAE.py
RGCVAE_GIN.py		RGCVAE_GIN.py
RGCVAE_RGCN.py		RGCVAE_RGCN.py
RGCVAE_noHist.py		RGCVAE_noHist.py
rgcvae_env.yml		rgcvae_env.yml
rgcvae_env_requirements.txt		rgcvae_env_requirements.txt
setup.bash		setup.bash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Relational Graph Conditioned Variational Autoencoder for Molecule Design (RGCVAE)

Dependencies

Structure

Usage

Data Download

Data Pre-processing

Model Training

Model Test

Optimization

Pre-processed datasets, Pre-trained Models and Results

Information

Licenze

About

Releases

Packages

Languages

License

drigoni/RGCVAE

Folders and files

Latest commit

History

Repository files navigation

Relational Graph Conditioned Variational Autoencoder for Molecule Design (RGCVAE)

Dependencies

Structure

Usage

Data Download

Data Pre-processing

Model Training

Model Test

Optimization

Pre-processed datasets, Pre-trained Models and Results

Information

Licenze

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages