Towards Discovering Neural Architectures from Scratch

This repository contains the implementation of our paper "Towards Discovering Neural Architectures from Scratch", that treats neural architectures as algebraic terms and implements the construction mechanism of algebraic terms with context-free grammars. For more details, please refer to our paper.

1. Installation

Clone this repository.
Create a conda environment

conda create -n hnas python=3.7

and activate it

conda activate hnas

Install poetry

bash install_dev_utils/poetry.sh

Run poetry install (this can take quite a while) and then run pip install opencv-python.

2. Reproducing the paper results

2.1 Search on the cell-based or hierarchical NAS-Bench-201 search space

To reproduce those search experiments, run

python experiments/optimize.py \
--working_directory $working_directory \
--data_path $data_path \
--search_space $search_space \
--objective $objective \
--searcher $searcher \
--surrogate_model $surrogate_model \
--seed $seed \
--pool_strategy evolution \
--pool_size 200 \
--n_init 10 \
--log \
--p_self_crossover 0.5

where $working_directory and $data_path are the directory you want to save to or the path to the dataset, respectively. The other variables can be set as follows:

variable	options
`search_space`	`nb201_variable_multi_multi` (hierarchical) or `nb201_fixed_1_none` (cell-based)
`objective`	`nb201_cifar10`, `nb201_cifar100`, `nb201_ImageNet16-120`, `nb201_cifarTile`, or `nb201_addNIST`
`searcher`	`bayesian_optimization`, `random search`, or `regularized_evolution`
`surrogate_model`	`gp_hierarchical` (hWL) or `gp` (WL) (only active if `searcher` is set to `bayesian_optimization`)
`seed`	`777`, `888`, `999`

2.2 Search on the activation function search space

To reproduce this search experiment, run

python experiments/optimize.py \
--working_directory $working_directory \
--data_path $data_path \
--search_space act_cifar10 \
--objective act_cifar10 \
--searcher $searcher \
--surrogate_model $surrogate_model \
--seed $seed \
--pool_strategy evolution \
--pool_size 200 \
--n_init 50 \
--log \
--p_self_crossover 0.5 \
--max_evaluations_total 1000

where $working_directory and $data_path are the directory you want to save to or the path to the dataset, respectively. The other variables can be set as follows:

variable	options
`searcher`	`bayesian_optimization`, `random search`, or `regularized_evolution`
`surrogate_model`	`gp_hierarchical` (hWL) or `gp` (WL) (only active if `searcher` is set to `bayesian_optimization`)
`seed`	`777`, `888`, `999` (note that we only ran on the seed `777` in our experiments)

2.3 Surrogate experiments

Search has to be run beforehand or data needs to be provided!

To reproduce our surrogate experiments, run

python experiments/surrogate_regression.py \
--working_directory $working_directory \
--search_space $search_space \
--objective $objective \
--surrogate_model $surrogate_model \
--n_train $n_train \
--log

where $working_directory is the directory where the data from the search runs has been saved to and the surrogate results will be saved to. Other variables can be set as follows:

variable	options
`search_space`	`nb201_variable_multi_multi` (hierarchical) or `nb201_fixed_1_none` (cell-based)
`objective`	`nb201_cifar10`, `nb201_cifar100`, `nb201_ImageNet16-120`, `nb201_cifarTile`, or `nb201_addNIST`
`surrogate_model`	`gp_hierarchical` (hWL) or `gp` (WL) (only active if `searcher` is set to `bayesian_optimization`)
`n_train`	`10`, `25`, `50`, `75`, `100`, `150`, `200`, `300`, or `400`

3. Citing

If you would like to learn more about our work, please read our paper. If you find our approach interesting for your own work, please cite the paper:

@misc{Schrodi_Towards_Discovering_Neural_2022,
  doi = {10.48550/ARXIV.2211.01842},
  url = {https://arxiv.org/abs/2211.01842},
  author = {Schrodi, Simon and Stoll, Danny and Ru, Binxin and Sukthanker, Rhea and Brox, Thomas and Hutter, Frank},
  keywords = {Machine Learning (cs.LG), Artificial Intelligence (cs.AI), Computer Vision and Pattern Recognition (cs.CV), Machine Learning (stat.ML), FOS: Computer and information sciences, FOS: Computer and information sciences},
  title = {Towards Discovering Neural Architectures from Scratch},
  publisher = {arXiv},
  year = {2022},
  copyright = {arXiv.org perpetual, non-exclusive license}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
benchmarks		benchmarks
experiments		experiments
install_dev_utils		install_dev_utils
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Towards Discovering Neural Architectures from Scratch

1. Installation

2. Reproducing the paper results

2.1 Search on the cell-based or hierarchical NAS-Bench-201 search space

2.2 Search on the activation function search space

2.3 Surrogate experiments

3. Citing

About

Releases

Packages

Languages

License

tamltlkdn/towards_nas_from_scratch

Folders and files

Latest commit

History

Repository files navigation

Towards Discovering Neural Architectures from Scratch

1. Installation

2. Reproducing the paper results

2.1 Search on the cell-based or hierarchical NAS-Bench-201 search space

2.2 Search on the activation function search space

2.3 Surrogate experiments

3. Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages