Kernel-Supported Neural Networks

Experiments and research related to kernel-supported neural networks, which are an attempt to high-probability accuracy premises based on theoretical guarantees of train-supporting using kernel regression (Nadaraya-Watson) estimators.

Dependencies

We based our implementation on our core library, pydentification (see: https://github.com/cyber-physical-systems-group/pydentification). Most of the feature is implemented in v0.2.0-alpha and the v0.3.0-alpha version contains the code for running the experiments (entrypoints etc.), experimentation code was implemented here and generalized and moved to main library.

Note: from v0.5.0 implementation of nonparametric modules was moved here, to reduce the complexity of the main library and keep only shared components there.

Releases

Version	`pydentification` version	Description
`v1.0-alpha`	`v0.2.0-alpha`	Pre-review code for the paper, with `nonparametric` models parts of the `pydentification`
`v1.1-alpha`	`v0.5.0`	Pre-review code for the paper, with python 3.12 support and `nonparametric` models here
`v1.1-alpha-standalone`	NA	Stand-alone version, which can be run without `pydentification`, otherwise same as `v1.1-alpha`

Source

Source code to run experiments in given in src package. It contains code specific for this research, but heavily relies on pydentification library. The code is organized in the following way:

models - definition of neural models and bounded trainer, with some manual toggles (CPU/GPU, etc.)
plots - plotly utils for visualizing results for static and SISO dynamic systems
shared - code for preparing input using data-modules, reporting to W&B and saving models

Model Parameters

Models are added as assets to each release. Download them and put then into models directory. For v1.0-alpha entire torch models are added, but for v1.1-alpha only the weights are added, since the model is defined in the code. The same models can be used for v1.1-alpha-standalone release.

Datasets

This directory contains datasets used in the paper and presentations related to it. Each dataset is stored as file in csv directory. For Wiener-Hammerstein benchmark the values for Sigma and L are estimated, using smoothing and the characteristic of the nonlinear diode (only nonlinear element in the circuit).

Dataset	Train Samples	Test Samples	Dynamics	Sigma	L
TOY	1000	1000	None	0.05	1
STATIC	10 000	10 000	None	0.1	1/4
DYNAMIC	50 000	50 000	Nonlinear	0.05	1
WIENER-HAMMERSTEIN	100 000	88 000	Nonlinear	<10e-3	~25

TOY

This dataset is toy dataset mostly for visual explanation of the algorithms used. It contains data for single function, which is R^1 to R^1 function, defined as sin(x) exp(-gamma x), where gamma is set to 0.1. Samples on x axis are samples from uniform distribution on interval [0, 12pi] and samples of y axis have additional noise with Gaussian distribution with standard deviation 0.05.

Dataset is stored in csv/toy.csv file with feature column x and target column y.

STATIC

This dataset represents high dimensional static system, R^8 to R^1. The function is defined as sum of 8 Gaussian density functions (not distribution, but explicit PDF) with different C (given in table below). The inputs are samples uniformly on range [-1, 1]. L is 1/4 for all Gaussian functions, so for its sum as well. Noise was added with sigma = 0.1.

C	Value
C1	1.42
C2	-1.32
C3	-1.65
C4	-1.04
C5	0.61
C6	1.98
C7	0.31
C8	-0.31

DYNAMIC

This dataset represents low dimensional dynamic system, R^1 to R^1 with nonlinear dynamics. The function is defined as x' = -k \sigma(x-1) u(t), where sigma is sigmoid function, k is constant equal to 0.9 and u(t) is input signal, which was generated as sine wave. The system was numerically integrated and is stored in CSV files with 3 columns: "t" for time, "u" for inputs and "y" for noised outputs. Initial condition was 1, time is set to 100 000 samples from 0 to 100 seconds and forcing was given as sin(pi/5 t).

WIENER-HAMMERSTEIN BENCHMARK

The Wiener-Hammerstein benchmark is a benchmark for nonlinear system identification. It is widely used in research related to nonlinear systems. Its description and data can be found on this page.

Results

Results for out method on problems described above. $RMSE$ denotes root-mean-square error, $RRR$ denotes RMSE range ratio, which is the fraction of RMSE compared to the range of the target variable. For bounds of Wiener-Hammerstein benchmark, we did not optimize or search for best model, since both the dimensionality of the problem and its Lipschitz constant are too high, for our method to work.

Metric	Sine Function	Sum of Gaussians	Nonlinear Dynamics	Wiener-Hammerstein
`RMSE_NET`	0.1037	0.0103	0.0502	0.0005
`RMSE_KRE`	0.1041	0.0143	0.0505	0.0835
`RMSE_BOUNDS`	0.4207	0.2723	0.7054	>1
`RRR_NET`	5.86%	6.54%	3.64%	0.23%
`RRR_KRE`	5.89%	8.87%	3.67%	34.89%
`RRR_BOUND`	15.67%	163.99%	38.38%	>500%

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
.github/workflows		.github/workflows
configs		configs
data/csv		data/csv
notebooks		notebooks
src		src
tests		tests
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements-notebooks.txt		requirements-notebooks.txt
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kernel-Supported Neural Networks

Dependencies

Releases

Source

Model Parameters

Datasets

TOY

STATIC

DYNAMIC

WIENER-HAMMERSTEIN BENCHMARK

Results

About

Releases

Packages

Contributors 2

Languages

License

cyber-physical-systems-group/kernel-supported-neural-networks

Folders and files

Latest commit

History

Repository files navigation

Kernel-Supported Neural Networks

Dependencies

Releases

Source

Model Parameters

Datasets

TOY

STATIC

DYNAMIC

WIENER-HAMMERSTEIN BENCHMARK

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages