Design and Independent Training of Composable and Reusable Neural Modules

This is the code for the experiments shown in the paper Design and Independent Training of Composable and Reusable Neural Modules. All experiments can be replicated by running the scripts placed under the folder experiments.

The neural network used in the experiments is based on the Neural Module Networks architecture. Modules are trained independently and assembled afterwards. This code has been inspired and takes some code from Jacob Andreas' original repository too.

These experiments work on the VQA v1.0 dataset, which should be placed under the data/vqa directory. Image folders must be divided into raw and conv e.g. Images/train2014/raw containing the raw image files (they are usually placed under Images/train2014) and the Images/train2014/conv is where the 14x14x512 features from the VGG16 will be stored. SPS2 files are already provided in this repository, so that it isn't necessary to install and run the Stanford parser.

Execution of experiments

All experiments listed here can be replicated just by running the corresponding script under the folder experiments. Please be sure to run all scripts from the repository's root folder. Before being able to run any experiment, follow these steps:

Ensure that you have put the VQA data as described before.
Ensure that you have virtualenv installed.
Run 00-setup.sh. This will create a pair of virtual environments and preprocess input images.

01-validate_surrogate.sh

Validation of the surrogate gradient module, testing the correlation of its loss with the final NMN loss. Executes following steps:

Training of N=100 Find modules, using the sparring module for indirect supervision.
Filtering of trained Find modules, acording to uncertainty criteria, and selection of subset for correlation plot.
Test utility of each module by transferring to full NMN and training the rest.
Plot correlation found.

02-end-to-end_baseline.sh

This script runs the hyperparameter search, optimization and evaluation of the end-to-end NMN baseline.

03-direct_modular.sh

Runs modular training of the NMN architecture without making any further adjustments. This script runs the hyperparameter search and optimization of modules independently and tests the final configuration over our held-out test set.

04-adjusted_modular.sh

Runs adjusted modular training, where we make subtle but very important changes to the original NMN architecture that improve compositionality of modules and therefore generalisation of the full neural network.

Generating additional plots

If you have already run 02-end-to-end_baseline.sh and 03-direct_modular.sh, you can generate Figure 9 and Figure 10 by running:

python plots/times.py "hyperopt/" --raw-times
python plots/accdist.py "hyperopt/" --nmn-hpo "hyperopt/nmn"

Figure 12 can be generated after having run 04-adjusted_modular.sh by running:

python plots/accdist.py "hyperopt/modular"

Name		Name	Last commit message	Last commit date
Latest commit History 643 Commits
cache		cache
clevr		clevr
experiments		experiments
misc		misc
model		model
multiparse		multiparse
plots		plots
runners		runners
saved		saved
scripts		scripts
vqa		vqa
.gitignore		.gitignore
README.md		README.md
generate_cache.py		generate_cache.py
generate_results.py		generate_results.py
loaders.py		loaders.py
optimize_hypers.py		optimize_hypers.py
requirements.txt		requirements.txt
requirements_py2.txt		requirements_py2.txt
run_next.py		run_next.py
show_corr_data.py		show_corr_data.py
test_clevr.py		test_clevr.py
train.py		train.py
train_clevr.py		train_clevr.py
train_corr.py		train_corr.py
train_random_find_modules.py		train_random_find_modules.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Design and Independent Training of Composable and Reusable Neural Modules

Execution of experiments

01-validate_surrogate.sh

02-end-to-end_baseline.sh

03-direct_modular.sh

04-adjusted_modular.sh

Generating additional plots

About

Releases

Packages

Languages

dcasbol/dnmn

Folders and files

Latest commit

History

Repository files navigation

Design and Independent Training of Composable and Reusable Neural Modules

Execution of experiments

01-validate_surrogate.sh

02-end-to-end_baseline.sh

03-direct_modular.sh

04-adjusted_modular.sh

Generating additional plots

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages