GitHub - vinsis/attention-seeking-in-pytorch: Implementation of various attention-based models in PyTorch

attention-seeking-in-pytorch

This repo contains implementation of various forms of attention:

and finally

No attention

Task to learn

Each of these sequence to sequence models is trained to learn how to sort a shuffled array of numbers from 1 to N. The code to generate this data is here.

There is a considerable improvement if an attention based model is used versus the no attention model.

Organization of code

All the models and the data loader are defined in code/.

Each model is defined in a separate file. The file containing a model also contains train and test functions which are self-explanatory.
Output logs are stored under training_outputs/
Attention weights can be visualized using the code in the notebook Visualizing attention.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
code		code
training_outputs		training_outputs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Visualizing attention.ipynb		Visualizing attention.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

attention-seeking-in-pytorch

Task to learn

Organization of code

About

Releases

Packages

Languages

License

vinsis/attention-seeking-in-pytorch

Folders and files

Latest commit

History

Repository files navigation

attention-seeking-in-pytorch

Task to learn

Organization of code

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages