beyond Gradient Descent

Install

# install dependencies via pip3 (python 3 required)
make install # or:
pip3 install -r requirements.txt

# download ParTUT and FastText [32, EN] data files (optional)
make download

Usage

Demo

Required: make download (only available on Linux)

# run demo gradient descent
make descent

# run demo evolution
make evolve

# run demo swarm
make swarm

# run demo simplex
make simplex

# run demo gadam
make gadam

# run demo orchestra
make orchestra

Custom Configuration

# run custom training
python3 -m beyondGD -M ${tagger_config.json} -T ${train_config.json} -D ${data_config.json}

Configuration

Model (POS-Tagger)

{
  "embedding": {
    "size": 32,
    "dropout": 0.0
  },
  "lstm": {
    "hid_size": 16,
    "depth": 1,
    "dropout": 0.5
  },
  "score": {
    "dropout": 0.5
  }
}

Training

Supports the following optimization algorithms: Gradient Descent, Evolution (ES), Swarm Based Optimization (PSO), Nelder–Mead method (Simplex), and Gadam (Genetic-Evolutionary Adam). It is possible to orchestrate the tasks individually in the training process.

{
  "tasks": [
    {
      "type": "string", // Supports: [descent, evolve, swarm, simplex, gadam]
      "population_size": 400, // Only: [evolve, swarm, simplex, gadam]
      "parameters": {
        // Descent:
        "learning_rate": 5e-2,
        "weight_decay": 1e-6,
        "gradient_clip": 60.0,
        /// Evolve:
        "mutation_rate": 0.02,
        "crossover_prob": 0.5,
        "selection_size": 20,
        // Swarm:
        "learning_rate": 0.05,
        "velocity_weight": 1.0,
        "initial_velocity_rate": 0.02,
        "personal_weight": 2.0,
        "global_weight": 2.0,
        // Simplex:
        "expansion_rate": 2.0,
        "contraction_rate": 0.5,
        "shrink_rate": 0.02,
        // Gadam:
        "learning_rate": 5e-2,
        "learning_prob": 1.0,
        "weight_decay": 1e-6,
        "mutation_rate": 0.02,
        "mutation_prob": 0.8,
        "crossover_prob": 0.6,
        "selection_size": 10,
        // General:
        "epoch_num": 50,
        "report_rate": 5,
        "batch_size": 96
      }
    }
  ]
}

Data

{
  "embedding": "path/to/fasttext-data.bin",
  "preprocess": true, // pre embed and encode training data
  "reduce_train": 0.995, // reduce amount train data in percent
  "train": "path/to/train.conllu",
  "dev": "path/to/dev.conllu",
  "test": "path/to/test.conllu",
  "load_model": "path/to/existing_model", // load existing model from .pickle file
  "save_model": "path/to/trained_model" // save model after training as .pickle file
}

Experiments

Required: make download (only available on Linux)

The experiments located in the results directory, can be reproduced with the following make commands:

# 00-Baseline-Gradient
make exp0

# 01-Baseline-Evolve
make exp1

# 02-Baseline-Swarm
make exp2

# 03-Baseline-Simplex
make exp3

# 04-Baseline-Gadam
make exp4

# 04-Orchestration
make exp5

Testing, Linting, Cleaning

# test: pytest
make test

# lint: flake8
make lint

# clean: cache/tmp files
make clean

History

0.1 POS-Tagger preliminary beta
0.2 Optimized POS-Tagger
0.3 Optimized Gradient Descent Training
0.4 Included Genetic Algorithm Training
1.0 Created stable Experimenting Environment
2.0 Include swarm training approach
2.1 Include advance metrics
3.0 Include simplex training
3.1 Reworked tasks interface
4.0 Reworked into orchestrated training process
4.1 Added model load/save function
4.2 Reworked simplex optimization
5.0 Added new PSO optimizer, discard old swarm approach
6.0 Added new Gadam optimizer
6.1 Minor Updates, Code Freeze

Name		Name	Last commit message	Last commit date
Latest commit History 450 Commits
beyondGD		beyondGD
config_example		config_example
results		results
scripts		scripts
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

beyond Gradient Descent

Install

Usage

Demo

Custom Configuration

Configuration

Model (POS-Tagger)

Training

Data

Experiments

Testing, Linting, Cleaning

History

About

Releases 4

Packages

Languages

License

simon-muenker/CL-SoSe21--Bachelor-Thesis-Code

Folders and files

Latest commit

History

Repository files navigation

beyond Gradient Descent

Install

Usage

Demo

Custom Configuration

Configuration

Model (POS-Tagger)

Training

Data

Experiments

Testing, Linting, Cleaning

History

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages